Evaluation on public dataset #94

Blank-z0 · 2024-10-11T03:36:26Z

Hi, great work!
I'm trying to reproduce the results on public datasets. However, I only found the training codes, where the model was evaluated on the eval set (or you don't use train/eval/test spilt, only train/test split?). I’d like to know if you partitioned the public dataset into a test set, and whether the results reported in the paper correspond to the test set or the eval set.
If I want to partition a test set, should I set ignore_last_n=0,1,2 when loading the test, eval and train dataset? For example:

train_dataset = DatasetV2(
      ratings_file=dp.output_format_csv(),
      padding_length=max_sequence_length + 1,  # target
      ignore_last_n=2,
      chronological=chronological,
)
eval_dataset = DatasetV2(
      ratings_file=dp.output_format_csv(),
      padding_length=max_sequence_length + 1,  # target
      ignore_last_n=1,
      chronological=chronological,
)
test_dataset = DatasetV2(
      ratings_file=dp.output_format_csv(),
      padding_length=max_sequence_length + 1,  # target
      ignore_last_n=0,
      chronological=chronological,
)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation on public dataset #94

Evaluation on public dataset #94

Blank-z0 commented Oct 11, 2024

Evaluation on public dataset #94

Evaluation on public dataset #94

Comments

Blank-z0 commented Oct 11, 2024