Skip to content

Actions: EleutherAI/lm-evaluation-harness

Tasks Modified

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,672 workflow run results
1,672 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Cont metrics
Tasks Modified #1834: Pull request #1475 synchronize by lintangsutawika
February 26, 2024 15:44 1m 50s cont-metrics
February 26, 2024 15:44 1m 50s
Cont metrics
Tasks Modified #1833: Pull request #1475 synchronize by lintangsutawika
February 26, 2024 14:56 1m 39s cont-metrics
February 26, 2024 14:56 1m 39s
Create a means for caching task registration and request building. Ad…
Tasks Modified #1832: Commit 1e6c927 pushed by haileyschoelkopf
February 26, 2024 14:54 2m 22s main
February 26, 2024 14:54 2m 22s
Create a means for caching task registration and request building. Ad…
Tasks Modified #1831: Pull request #1372 synchronize by haileyschoelkopf
February 26, 2024 14:36 1m 50s inf3rnus:main
February 26, 2024 14:36 1m 50s
Revert "setting trust_remote_code (#1467)" (#1474)
Tasks Modified #1830: Commit f6befdb pushed by lintangsutawika
February 26, 2024 14:21 1m 35s main
February 26, 2024 14:21 1m 35s
Add Gemma support (Add flag to control BOS token usage) (#1465)
Tasks Modified #1828: Commit 4c51111 pushed by haileyschoelkopf
February 26, 2024 14:02 1m 48s main
February 26, 2024 14:02 1m 48s
[Refactor] Continuous Metrics (#969)
Tasks Modified #1827: Commit 967eb4f pushed by lintangsutawika
February 26, 2024 13:29 1m 55s big-refactor
February 26, 2024 13:29 1m 55s
[Refactor] Continuous Metrics
Tasks Modified #1826: Pull request #969 synchronize by lintangsutawika
February 26, 2024 13:17 2m 10s cont-metrics
February 26, 2024 13:17 2m 10s
add arabic mmlu (#1402)
Tasks Modified #1825: Commit 7de7b27 pushed by lintangsutawika
February 26, 2024 13:14 2m 54s main
February 26, 2024 13:14 2m 54s
Add Gemma support (Add flag to control BOS token usage)
Tasks Modified #1824: Pull request #1465 synchronize by lintangsutawika
February 26, 2024 13:06 1m 42s add-bos-token
February 26, 2024 13:06 1m 42s
setting trust_remote_code (#1467)
Tasks Modified #1823: Commit c1145df pushed by lintangsutawika
February 26, 2024 13:05 2m 3s main
February 26, 2024 13:05 2m 3s
Apply code autoformatting with Ruff to tasks/*.py an *__init__.py (#1…
Tasks Modified #1822: Commit d27c0c0 pushed by lintangsutawika
February 26, 2024 13:00 2m 19s main
February 26, 2024 13:00 2m 19s
Refactor evaluater.evaluate
Tasks Modified #1814: Pull request #1441 synchronize by baberabb
February 24, 2024 17:31 1m 50s baberabb:eval
February 24, 2024 17:31 1m 50s
Refactor evaluater.evaluate
Tasks Modified #1813: Pull request #1441 synchronize by baberabb
February 24, 2024 17:18 2m 19s baberabb:eval
February 24, 2024 17:18 2m 19s
Add environment and transformers version logging in results dump (#1464)
Tasks Modified #1812: Commit f78e2da pushed by haileyschoelkopf
February 24, 2024 17:01 16s main
February 24, 2024 17:01 16s
Add environment and transformers version logging in results dump
Tasks Modified #1811: Pull request #1464 synchronize by LSinev
February 23, 2024 22:34 15s LSinev:more-env-logging
February 23, 2024 22:34 15s
Adding documentation for Weights and Biases CLI interface (#1466)
Tasks Modified #1810: Commit eacb74e pushed by haileyschoelkopf
February 23, 2024 20:36 15s main
February 23, 2024 20:36 15s
Add environment and transformers version logging in results dump
Tasks Modified #1808: Pull request #1464 synchronize by LSinev
February 23, 2024 18:48 17s LSinev:more-env-logging
February 23, 2024 18:48 17s
Add Gemma support (Add flag to control BOS token usage)
Tasks Modified #1807: Pull request #1465 opened by haileyschoelkopf
February 23, 2024 18:39 2m 13s add-bos-token
February 23, 2024 18:39 2m 13s
Refactor evaluater.evaluate
Tasks Modified #1805: Pull request #1441 synchronize by baberabb
February 23, 2024 16:51 1m 50s baberabb:eval
February 23, 2024 16:51 1m 50s