PR fixing the issue #1391 (wrong contexts in the mgsm task) #1440

leocnj · 2024-02-18T13:41:50Z

The issue reported in #1391 has been fixed.

For example, for mgsm_direct_en, new yaml file will be

!!@@##@@!! -- Example 1
Question: Roger has 5 tennis balls. He buys 2 more cans of tennis balls. Each can has 3 tennis balls. How many tennis balls does he have now?
Answer:Roger started with 5 balls. 2 cans of 3 tennis balls each is 6 tennis balls. 5 + 6 = 11. The answer is 11.

Question: If there are 3 cars in the parking lot and 2 more cars arrive, how many cars are in the parking lot?
Answer:There are 3 cars in the beginning, 2 more arrive, so now there should be 3 + 2 = 5 cars. The answer is 5.

Question: There were nine computers in the server room. Five more computers were installed each day, from monday to thursday. How many computers are now in the server room?
Answer:

For mgsm_native_cot_zh, the new yaml file will be

!!@@##@@!! -- Example 1
问题：罗杰有 5 个网球。他又买了 2 罐网球。每罐有 3 个网球。他现在有多少个网球？
逐步解答: 杰一开始有 5 个球。2 罐各 3 个网球就是 6 个网球。5 + 6 = 11。答案是 11。

问题：如果停车场里有 3 辆车，又来了 2 辆车，停车场里有多少辆车？
逐步解答: 开始有 3 辆车，又来了 2 辆，所以现在应该有 3 + 2 = 5 辆车。答案是 5。

问题：服务器机房里有九台电脑。从周一到周四，每天又安装了五台电脑。服务器机房里现在有多少台电脑？
逐步解答:

…keep the one with a space (default)

- change naming so that file name will match with task name - task|file follows a consistent naming way, mgsm_(mode)_(lang) for three modes, i.e., direct, en_cot, and native_cot

CLAassistant · 2024-02-18T13:41:55Z

All committers have signed the CLA.

haileyschoelkopf

Thanks very much for this PR! looks good to me, modulo the one nit I had on target delimiter.

Also flagging that we will want to re-look at MGSM to apply some of the better answer extraction from GSM in #1356 to it.

lm_eval/tasks/mgsm/en_cot/cot_yaml

lm_eval/tasks/mgsm/native_cot/mgsm_native_cot_zh.yaml

lm_eval/tasks/mgsm/native_cot/mgsm_native_cot_ja.yaml

haileyschoelkopf · 2024-02-22T14:42:02Z

Thank you for this!

thnkinbtfly · 2024-02-23T07:26:52Z

Thanks very much for this PR! looks good to me, modulo the one nit I had on target delimiter.

Also flagging that we will want to re-look at MGSM to apply some of the better answer extraction from GSM in #1356 to it.

Working on it!

…EleutherAI#1440) * fix the issue EleutherAI#1391, wrong contexts in mgsm tasks * fix yaml issue for having two target_delimiter lines. For COT tasks, keep the one with a space (default) * regenerate all task yaml files - change naming so that file name will match with task name - task|file follows a consistent naming way, mgsm_(mode)_(lang) for three modes, i.e., direct, en_cot, and native_cot * English CoTs should have a space as target_delimiter * Update utils.py * Apply suggestions from code review --------- Co-authored-by: Hailey Schoelkopf <[email protected]>

leocnj added 3 commits February 17, 2024 10:37

fix the issue EleutherAI#1391, wrong contexts in mgsm tasks

a5042e2

fix yaml issue for having two target_delimiter lines. For COT tasks, …

418da68

…keep the one with a space (default)

regenerate all task yaml files

778a804

- change naming so that file name will match with task name - task|file follows a consistent naming way, mgsm_(mode)_(lang) for three modes, i.e., direct, en_cot, and native_cot

leocnj requested review from haileyschoelkopf and lintangsutawika as code owners February 18, 2024 13:41

haileyschoelkopf approved these changes Feb 19, 2024

View reviewed changes

lm_eval/tasks/mgsm/en_cot/cot_yaml Show resolved Hide resolved

haileyschoelkopf added the bug Something isn't working. label Feb 19, 2024

haileyschoelkopf approved these changes Feb 22, 2024

View reviewed changes

English CoTs should have a space as target_delimiter

4e43d16

haileyschoelkopf reviewed Feb 22, 2024

View reviewed changes

lm_eval/tasks/mgsm/native_cot/mgsm_native_cot_zh.yaml Show resolved Hide resolved

Update utils.py

9c4030b

haileyschoelkopf reviewed Feb 22, 2024

View reviewed changes

lm_eval/tasks/mgsm/native_cot/mgsm_native_cot_ja.yaml Show resolved Hide resolved

Apply suggestions from code review

ab4b682

haileyschoelkopf merged commit a72babb into EleutherAI:main Feb 22, 2024
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR fixing the issue #1391 (wrong contexts in the mgsm task) #1440

PR fixing the issue #1391 (wrong contexts in the mgsm task) #1440

leocnj commented Feb 18, 2024

CLAassistant commented Feb 18, 2024 •

edited

Loading

haileyschoelkopf left a comment

haileyschoelkopf commented Feb 22, 2024

thnkinbtfly commented Feb 23, 2024

PR fixing the issue #1391 (wrong contexts in the mgsm task) #1440

PR fixing the issue #1391 (wrong contexts in the mgsm task) #1440

Conversation

leocnj commented Feb 18, 2024

CLAassistant commented Feb 18, 2024 • edited Loading

haileyschoelkopf left a comment

Choose a reason for hiding this comment

haileyschoelkopf commented Feb 22, 2024

thnkinbtfly commented Feb 23, 2024

CLAassistant commented Feb 18, 2024 •

edited

Loading