Skip to content

Commit

Permalink
fix the leaderboard doc to reflect the tasks (#2219)
Browse files Browse the repository at this point in the history
  • Loading branch information
NathanHB committed Aug 20, 2024
1 parent 97327e4 commit 221c7d7
Showing 1 changed file with 1 addition and 4 deletions.
5 changes: 1 addition & 4 deletions lm_eval/tasks/leaderboard/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -51,15 +51,13 @@ In this work, we focus on a suite of 23 challenging BIG-Bench tasks which we cal
- `leaderboard_bbh_causal_judgement`
- `leaderboard_bbh_date_understanding`
- `leaderboard_bbh_disambiguation_qa`
- `leaderboard_bbh_dyck_languages`
- `leaderboard_bbh_formal_fallacies`
- `leaderboard_bbh_geometric_shapes`
- `leaderboard_bbh_hyperbaton`
- `leaderboard_bbh_logical_deduction_five_objects`
- `leaderboard_bbh_logical_deduction_seven_objects`
- `leaderboard_bbh_logical_deduction_three_objects`
- `leaderboard_bbh_movie_recommendation`
- `leaderboard_bbh_multistep_arithmetic_two`
- `leaderboard_bbh_navigate`
- `leaderboard_bbh_object_counting`
- `leaderboard_bbh_penguins_in_a_table`
Expand All @@ -73,7 +71,6 @@ In this work, we focus on a suite of 23 challenging BIG-Bench tasks which we cal
- `leaderboard_bbh_tracking_shuffled_objects_seven_objects`
- `leaderboard_bbh_tracking_shuffled_objects_three_objects`
- `leaderboard_bbh_web_of_lies`
- `leaderboard_bbh_word_sorting`

## GPQA

Expand Down Expand Up @@ -215,7 +212,7 @@ Eprint = {arXiv:2206.14858},
- `leaderboard_math_intermediate_algebra_hard`
- `leaderboard_math_num_theory_hard`
- `leaderboard_math_prealgebra_hard`
- `leaderboard_math_precalc_hard`
- `leaderboard_math_precalculus_hard`


## MMLU-Pro
Expand Down

0 comments on commit 221c7d7

Please sign in to comment.