Skip to content

Actions: EleutherAI/lm-evaluation-harness

Tasks Modified

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,930 workflow runs
2,930 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix writeout script
Tasks Modified #3419: Pull request #2350 opened by baberabb
September 25, 2024 12:25 16s writeout
September 25, 2024 12:25 16s
Support pipeline parallel with OpenVINO models
Tasks Modified #3418: Pull request #2349 opened by sstrehlk
September 25, 2024 11:30 Action required sstrehlk:sstrehlk-ov-parallelizm
September 25, 2024 11:30 Action required
Fix float limit override
Tasks Modified #3417: Pull request #2325 synchronize by cjluo-omniml
September 24, 2024 23:15 14s cjluo-omniml:patch-1
September 24, 2024 23:15 14s
Merge New Tasks
Tasks Modified #3416: Pull request #2341 opened by ToluClassics
September 24, 2024 15:16 4m 22s ToluClassics:main
September 24, 2024 15:16 4m 22s
add a note for missing dependencies (#2336)
Tasks Modified #3415: Commit bc50a9a pushed by baberabb
September 24, 2024 14:13 4m 4s main
September 24, 2024 14:13 4m 4s
Mathvista
Tasks Modified #3414: Pull request #2321 synchronize by baberabb
September 24, 2024 13:29 1m 38s mathvista
September 24, 2024 13:29 1m 38s
Mathvista
Tasks Modified #3413: Pull request #2321 synchronize by baberabb
September 24, 2024 13:15 1m 43s mathvista
September 24, 2024 13:15 1m 43s
Added metric aggregation for leaderboard tasks.
Tasks Modified #3412: Pull request #2340 opened by Am1n3e
September 24, 2024 12:34 1m 47s Am1n3e:add-leaderboard-aggregation
September 24, 2024 12:34 1m 47s
Fixed dummy model (#2339)
Tasks Modified #3411: Commit d7734d1 pushed by baberabb
September 24, 2024 12:08 13s main
September 24, 2024 12:08 13s
Fixed dummy model
Tasks Modified #3410: Pull request #2339 opened by Am1n3e
September 24, 2024 11:58 16s Am1n3e:fix-dummy-model
September 24, 2024 11:58 16s
Add a note for missing dependencies
Tasks Modified #3407: Pull request #2336 opened by eldarkurtic
September 24, 2024 05:14 4m 1s eldarkurtic:fix-leaderboard-docs
September 24, 2024 05:14 4m 1s
mmlu-pro: add newlines to task descriptions (not leaderboard)
Tasks Modified #3406: Pull request #2334 synchronize by baberabb
September 23, 2024 19:46 5m 20s mmlupro_
September 23, 2024 19:46 5m 20s
change glianorex to test split
Tasks Modified #3405: Pull request #2332 synchronize by baberabb
September 23, 2024 16:39 1m 33s glia
September 23, 2024 16:39 1m 33s
change glianorex to test split
Tasks Modified #3404: Pull request #2332 synchronize by baberabb
September 23, 2024 16:36 4m 27s glia
September 23, 2024 16:36 4m 27s
change glianorex to test split
Tasks Modified #3403: Pull request #2332 synchronize by baberabb
September 23, 2024 16:28 1m 39s glia
September 23, 2024 16:28 1m 39s
mmlu-pro: add newlines to task descriptions (not leaderboard)
Tasks Modified #3402: Pull request #2334 opened by baberabb
September 23, 2024 16:08 1m 49s mmlupro_
September 23, 2024 16:08 1m 49s
add newlines to mmlu_pro task descriptions (not leaderboard)
Tasks Modified #3401: Pull request #2333 synchronize by baberabb
September 23, 2024 15:58 1m 57s mmlu_
September 23, 2024 15:58 1m 57s
add newlines to mmlu_pro task descriptions (not leaderboard)
Tasks Modified #3400: Pull request #2333 opened by baberabb
September 23, 2024 15:56 2m 33s mmlu_
September 23, 2024 15:56 2m 33s
change glianorex to test split
Tasks Modified #3399: Pull request #2332 opened by baberabb
September 23, 2024 14:43 1m 39s glia
September 23, 2024 14:43 1m 39s
openai: better error messages; fix greedy matching
Tasks Modified #3398: Pull request #2327 opened by baberabb
September 20, 2024 07:38 11s openai
September 20, 2024 07:38 11s
Fix float limit override
Tasks Modified #3397: Pull request #2325 opened by cjluo-omniml
September 19, 2024 19:48 17s cjluo-omniml:patch-1
September 19, 2024 19:48 17s
Ifeval: Dowload punkt_tab on rank 0
Tasks Modified #3396: Pull request #2267 synchronize by baberabb
September 19, 2024 19:09 2m 4s ifeval_rank
September 19, 2024 19:09 2m 4s
Ifeval: Dowload punkt_tab on rank 0
Tasks Modified #3395: Pull request #2267 synchronize by baberabb
September 19, 2024 19:02 4m 2s ifeval_rank
September 19, 2024 19:02 4m 2s
Mathvista
Tasks Modified #3394: Pull request #2321 synchronize by baberabb
September 19, 2024 07:03 2m 13s mathvista
September 19, 2024 07:03 2m 13s
Mathvista
Tasks Modified #3393: Pull request #2321 synchronize by baberabb
September 18, 2024 21:57 1m 49s mathvista
September 18, 2024 21:57 1m 49s