Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to reproduce the results in the paper. #2

Open
xssstory opened this issue Jan 2, 2024 · 3 comments
Open

How to reproduce the results in the paper. #2

xssstory opened this issue Jan 2, 2024 · 3 comments

Comments

@xssstory
Copy link

xssstory commented Jan 2, 2024

Hello, I have downloaded the released model and followed the inference command you provided.

However, it seems that the strict accuracy is not matched with the number you reported in the paper.

My inference command is:

python  src/inference.py JingyaoLi/MoTCoder-15B-v1.0/ apps/test.jsonl ./output/generation.jsonl FORMAT_PROMPT

After evaluation, the accuracy on the competition level is:

img_v3_026k_87a2bf54-f748-4082-8fef-13683fe91ddg

Could you please help me do inference correctly?

@JulietLJY
Copy link
Member

Hi, our reported pass@1 is the average/normalized pass@1. You can refer to this benchmark paper for the detailed metric definition.

@xssstory
Copy link
Author

xssstory commented Jan 5, 2024

Thanks for your reply!

I noticed that the pass@1 and pass@5 of GPT-Neo (Tab.4 in your paper) are strict accuracies.

I believe it would be better to report the numbers using the consistent metric in Tab.4.

@JulietLJY
Copy link
Member

Thank you for bringing this to our attention. We have verified that what you mentioned is correct. Due to an error in reporting in line with previous work, we have inaccuracies in the performance metrics of competitive methods in the paper. We will rectify our mistake as soon as possible.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants