Skip to content

Actions: EleutherAI/lm-evaluation-harness

Tasks Modified

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,935 workflow runs
2,935 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Minor features
Tasks Modified #3284: Pull request #2249 synchronize by artemorloff
August 29, 2024 18:26 2m 3s artemorloff:feature/small_fixes
August 29, 2024 18:26 2m 3s
API: fix maxlen; vllm: prefix_token_id bug
Tasks Modified #3283: Pull request #2262 synchronize by baberabb
August 29, 2024 18:15 13s maxlen
August 29, 2024 18:15 13s
API: fix maxlen; vllm: prefix_token_id bug
Tasks Modified #3282: Pull request #2262 opened by baberabb
August 29, 2024 18:14 14s maxlen
August 29, 2024 18:14 14s
Fix loglikelihood_rolling caching ( #1821 ) (#2187)
Tasks Modified #3279: Commit 8138fd5 pushed by baberabb
August 28, 2024 18:51 1m 39s main
August 28, 2024 18:51 1m 39s
Fix loglikelihood_rolling caching ( #1821 )
Tasks Modified #3278: Pull request #2187 synchronize by haileyschoelkopf
August 28, 2024 18:43 1m 37s 1821-fix-rolling-cache
August 28, 2024 18:43 1m 37s
update nltk version to require 3.9.1 (#2259)
Tasks Modified #3277: Commit 2de3688 pushed by baberabb
August 28, 2024 17:17 1m 59s main
August 28, 2024 17:17 1m 59s
Update NLTK version in *ifeval tasks ( #2210 )
Tasks Modified #3276: Pull request #2259 opened by haileyschoelkopf
August 28, 2024 17:07 1m 46s 2210-nltk-punkt-fix
August 28, 2024 17:07 1m 46s
Fix loglikelihood_rolling caching ( #1821 )
Tasks Modified #3275: Pull request #2187 synchronize by baberabb
August 28, 2024 16:43 1m 35s 1821-fix-rolling-cache
August 28, 2024 16:43 1m 35s
Fix loglikelihood_rolling caching ( #1821 )
Tasks Modified #3274: Pull request #2187 synchronize by baberabb
August 28, 2024 16:17 1m 43s 1821-fix-rolling-cache
August 28, 2024 16:17 1m 43s
[Draft] More descriptive simple_evaluate() LM TypeError (#2258)
Tasks Modified #3273: Commit 40010ec pushed by baberabb
August 28, 2024 15:43 15s main
August 28, 2024 15:43 15s
Introduce perplexity per token in loglikelihood_rolling
Tasks Modified #3263: Pull request #2132 synchronize by dtamayo-nlp
August 26, 2024 13:12 1m 45s dtamayo-nlp:main
August 26, 2024 13:12 1m 45s
[Draft] llm-as-judge
Tasks Modified #3259: Pull request #2251 synchronize by baberabb
August 26, 2024 03:10 2m 20s bjudge
August 26, 2024 03:10 2m 20s
[Draft] llm-as-judge
Tasks Modified #3258: Pull request #2251 synchronize by baberabb
August 26, 2024 03:02 1m 42s bjudge
August 26, 2024 03:02 1m 42s
[Draft] llm-as-judge
Tasks Modified #3257: Pull request #2251 opened by baberabb
August 25, 2024 21:46 1m 38s bjudge
August 25, 2024 21:46 1m 38s
chat template hotfix (#2250)
Tasks Modified #3256: Commit ebe7226 pushed by lintangsutawika
August 25, 2024 19:55 1m 37s main
August 25, 2024 19:55 1m 37s
chat template hotfix
Tasks Modified #3255: Pull request #2250 synchronize by baberabb
August 25, 2024 19:54 2m 17s hotfix
August 25, 2024 19:54 2m 17s
chat template hotfix
Tasks Modified #3254: Pull request #2250 opened by baberabb
August 25, 2024 19:53 12s hotfix
August 25, 2024 19:53 12s
Minor features
Tasks Modified #3253: Pull request #2249 opened by artemorloff
August 25, 2024 14:25 1m 28s artemorloff:feature/small_fixes
August 25, 2024 14:25 1m 28s
Created new task for testing Llama on Asdiv (#2236)
Tasks Modified #3244: Commit aab42ba pushed by haileyschoelkopf
August 23, 2024 18:47 2m 4s main
August 23, 2024 18:47 2m 4s
Created new task for testing Llama on Asdiv
Tasks Modified #3243: Pull request #2236 synchronize by haileyschoelkopf
August 23, 2024 18:47 2m 1s Cameron7195:main
August 23, 2024 18:47 2m 1s
fix group args of mmlu and mmlu_pro (#2245)
Tasks Modified #3235: Commit 5ad23ec pushed by lintangsutawika
August 23, 2024 11:55 3m 47s main
August 23, 2024 11:55 3m 47s
fix group args of mmlu and mmlu_pro
Tasks Modified #3234: Pull request #2245 opened by eyuansu62
August 23, 2024 10:58 2m 9s baai-open-internal:mmlu_pro
August 23, 2024 10:58 2m 9s