Add KoCommonGEN v2 benchmark #2208

metterian · 2024-08-12T07:50:27Z

Description:
This PR adds support for the KoCommonGEN v2 benchmark, a new dataset for evaluating Korean commonsense reasoning in large language models.

Changes:

Added KoCommonGEN v2 task definition
Updated task list to include ko_commongen_v2
Added citation information for the benchmark

KoCommonGEN v2 Details:

Paper: "KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models"
Accepted to ACL 2024-Findings
GitHub: https://github.com/J-Seo/KoCommonGEN-V2
Dataset: https://huggingface.co/datasets/nlpai-lab/ko_commongen_v2

This benchmark provides a valuable resource for evaluating Korean language models on commonsense reasoning tasks. Adding it to our evaluation suite will help broaden our coverage of multilingual NLP capabilities.

Please review and let me know if any changes or additional information is needed.

CLAassistant · 2024-08-12T07:50:33Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

lintangsutawika · 2024-08-15T16:39:27Z

lm_eval/tasks/ko_commongen_v2/code_switching/utils.py

+            # "choices": [f"{doc[str(i+1)]}" for i in range(4)],
+            # "choices": [f'{str(i+1)}. ' + doc['{i}'.format(i=i + 1)] for i in range(4)],  # The list of choices.
+            # "choices": [str(i+1) for i in range(4)],  # The list of choices.


Just want to check if this is an alternative option (which is why it's commented but left in)?

+1, if this comment is safe to delete let's do so!

haileyschoelkopf

Thanks for the PR! Just a few small changes and then we can merge this.

haileyschoelkopf · 2024-08-22T16:53:29Z

lm_eval/tasks/ko_commongen_v2/code_switching/_default

@@ -0,0 +1,19 @@
+task: ko_commongen_v2


Suggested change

task: ko_commongen_v2

let's remove the task: field since this is a template/stub config

haileyschoelkopf · 2024-08-22T16:54:07Z

lm_eval/tasks/ko_commongen_v2/code_switching/utils.py

+            # "choices": [f"{doc[str(i+1)]}" for i in range(4)],
+            # "choices": [f'{str(i+1)}. ' + doc['{i}'.format(i=i + 1)] for i in range(4)],  # The list of choices.
+            # "choices": [str(i+1) for i in range(4)],  # The list of choices.


+1, if this comment is safe to delete let's do so!

haileyschoelkopf · 2024-08-22T16:54:16Z

lm_eval/tasks/ko_commongen_v2/utils.py

+        out_doc = {
+            "query": query,
+            "choices": [f"{i+1}. {doc[str(i+1)]}" for i in range(4)],
+            # "choices": [f"{doc[str(i+1)]}" for i in range(4)],


Same with here

haileyschoelkopf · 2024-08-28T13:52:54Z

Hi @metterian , just following up to see if you'd be able to make these final few changes so we can merge this task! If not we'll try to get to them ourselves.

Note also that we'd ideally have an entry in https://github.com/EleutherAI/lm-evaluation-harness/blob/main/lm_eval/tasks/README.md describing the task as well, so users know about your task!

implementing ko_commongen_v2

b7f8920

metterian requested review from haileyschoelkopf and lintangsutawika as code owners August 12, 2024 07:50

lintangsutawika reviewed Aug 15, 2024

View reviewed changes

haileyschoelkopf requested changes Aug 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add KoCommonGEN v2 benchmark #2208

Add KoCommonGEN v2 benchmark #2208

metterian commented Aug 12, 2024

CLAassistant commented Aug 12, 2024

lintangsutawika Aug 15, 2024

haileyschoelkopf Aug 22, 2024

haileyschoelkopf left a comment

haileyschoelkopf Aug 22, 2024

haileyschoelkopf Aug 22, 2024

haileyschoelkopf Aug 22, 2024

haileyschoelkopf Aug 22, 2024

haileyschoelkopf commented Aug 28, 2024

Add KoCommonGEN v2 benchmark #2208

Are you sure you want to change the base?

Add KoCommonGEN v2 benchmark #2208

Conversation

metterian commented Aug 12, 2024

CLAassistant commented Aug 12, 2024

lintangsutawika Aug 15, 2024

Choose a reason for hiding this comment

haileyschoelkopf Aug 22, 2024

Choose a reason for hiding this comment

haileyschoelkopf left a comment

Choose a reason for hiding this comment

haileyschoelkopf Aug 22, 2024

Choose a reason for hiding this comment

haileyschoelkopf Aug 22, 2024

Choose a reason for hiding this comment

haileyschoelkopf Aug 22, 2024

Choose a reason for hiding this comment

haileyschoelkopf Aug 22, 2024

Choose a reason for hiding this comment

haileyschoelkopf commented Aug 28, 2024