Skip to content

Commit

Permalink
running pre-commit
Browse files Browse the repository at this point in the history
  • Loading branch information
shahrzads committed Sep 3, 2024
1 parent b0366af commit 33237fe
Show file tree
Hide file tree
Showing 336 changed files with 474 additions and 509 deletions.
6 changes: 2 additions & 4 deletions lm_eval/tasks/arabic_leaderboard_complete/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -59,7 +59,7 @@ Homepage: https://huggingface.co/spaces/OALL/Open-Arabic-LLM-Leaderboard
abstract = "Recent advances in the space of Arabic large language models have opened up a wealth of potential practical applications. From optimal training strategies, large scale data acquisition and continuously increasing NLP resources, the Arabic LLM landscape has improved in a very short span of time, despite being plagued by training data scarcity and limited evaluation resources compared to English. In line with contributing towards this ever-growing field, we introduce AlGhafa, a new multiple-choice evaluation benchmark for Arabic LLMs. For showcasing purposes, we train a new suite of models, including a 14 billion parameter model, the largest monolingual Arabic decoder-only model to date. We use a collection of publicly available datasets, as well as a newly introduced HandMade dataset consisting of 8 billion tokens. Finally, we explore the quantitative and qualitative toxicity of several Arabic models, comparing our models to existing public Arabic LLMs.",
}
@misc{huang2023acegpt,
title={AceGPT, Localizing Large Language Models in Arabic},
title={AceGPT, Localizing Large Language Models in Arabic},
author={Huang Huang and Fei Yu and Jianqing Zhu and Xuening Sun and Hao Cheng and Dingjie Song and Zhihong Chen and Abdulmohsen Alharthi and Bang An and Ziche Liu and Zhiyi Zhang and Junying Chen and Jianquan Li and Benyou Wang and Lian Zhang and Ruoyu Sun and Xiang Wan and Haizhou Li and Jinchao Xu},
year={2023},
eprint={2309.12053},
Expand Down Expand Up @@ -239,7 +239,7 @@ Homepage: https://huggingface.co/spaces/OALL/Open-Arabic-LLM-Leaderboard
- `arabic_leaderboard_acva_computer_and_phone`
- `arabic_leaderboard_acva_daily_life`
- `arabic_leaderboard_acva_entertainment`

### Checklist

For adding novel benchmarks/datasets to the library:
Expand All @@ -252,5 +252,3 @@ If other tasks on this dataset are already supported:
* [ ] Is the "Main" variant of this task clearly denoted?
* [ ] Have you provided a short sentence in a README on what each new variant adds / evaluates?
* [ ] Have you noted which, if any, published evaluation setups are matched by this variant?


Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ aggregate_metric_list:
aggregation: mean
weight_by_size: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,9 @@ def _process_doc(doc):
question = doc["query"]
answer_index = int(doc["label"])
# Dynamically determining the choices by excluding '__few_shots', 'query' and 'label'
choices_keys = [key for key in doc.keys() if key not in ["query", "label", "__few_shots"]]
choices_keys = [
key for key in doc.keys() if key not in ["query", "label", "__few_shots"]
]
choices = [doc[key] for key in choices_keys]

instruction = "الأسئلة التالية هي أسئلة متعددة الإختيارات مع الجواب الصحيح\n\n"
Expand All @@ -16,9 +18,6 @@ def _process_doc(doc):
query += f"{index}) {choice}\n"
query += "الإجابة:"

return {
"query": query,
"choices": choices,
"gold": answer_index
}
return dataset.map(_process_doc)
return {"query": query, "choices": choices, "gold": answer_index}

return dataset.map(_process_doc)
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -10,4 +10,4 @@ aggregate_metric_list:
aggregation: mean
weight_by_size: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
import datasets
import numpy as np


# fmt: off
LETTER_INDICES_AR = ["أ", "ب", "ج", "د", "هـ", "و", "ز", "ح", "ط", "ي", "ك", "ل", "م", "ن", "س", "ع", "ف", "ص", "ق", "ر", "ش", "ت", "ث", "خ", "ذ", "ض", "ظ", "غ"]
# fmt: on
Expand All @@ -10,12 +11,15 @@
LETTER_INDICES = ["A", "B", "C", "D", "E", "F", "G", "H", "I", "J", "K", "L", "M", "N", "O", "P", "Q", "R", "S", "T", "U", "V", "W", "X", "Y", "Z"]
# fmt: on


def process_docs(dataset: datasets.Dataset):
def _process_doc(doc):
topic = doc["subject"]
question = doc["question"]
choices = [doc["A"], doc["B"], doc["C"], doc["D"]]
choices_formatted = [f" {LETTER_INDICES_AR[i]}) {choice}\n" for i, choice in enumerate(choices)]
choices_formatted = [
f" {LETTER_INDICES_AR[i]}) {choice}\n" for i, choice in enumerate(choices)
]
answer = doc["answer"]
answer_index = LETTER_INDICES.index(answer)

Expand All @@ -24,9 +28,6 @@ def _process_doc(doc):
query += "\n".join(choices_formatted)
query += "\nالإجابة:"

return {
"query": query,
"choices": LETTER_INDICES_AR[:4],
"gold": answer_index
}
return dataset.map(_process_doc)
return {"query": query, "choices": LETTER_INDICES_AR[:4], "gold": answer_index}

return dataset.map(_process_doc)
Original file line number Diff line number Diff line change
Expand Up @@ -65,4 +65,4 @@ aggregate_metric_list:
aggregation: mean
weight_by_size: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Original file line number Diff line number Diff line change
Expand Up @@ -20,4 +20,4 @@ metric_list:
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
version: 1.0
Loading

0 comments on commit 33237fe

Please sign in to comment.