Skip to content

Actions: mosaicml/llm-foundry

Docker

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
269 workflow run results
269 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update README.md (#693)
Docker #256: Commit c60657b pushed by j316chuck
October 26, 2023 03:57 1m 39s main
October 26, 2023 03:57 1m 39s
Make default for cuda_load_lazy false (#694)
Docker #255: Commit 08611b0 pushed by dakinggg
October 26, 2023 02:21 1m 47s main
October 26, 2023 02:21 1m 47s
Add fixtures (#673)
Docker #254: Commit ea3279a pushed by irenedea
October 25, 2023 22:33 2m 7s main
October 25, 2023 22:33 2m 7s
Fix mlflow model logging bug (#692)
Docker #253: Commit bc687b7 pushed by dakinggg
October 25, 2023 18:45 2m 3s main
October 25, 2023 18:45 2m 3s
Allow flash attention 2 and upgrade to transformers 4.34.1 (#672)
Docker #252: Commit d72902a pushed by dakinggg
October 24, 2023 18:25 2m 10s main
October 24, 2023 18:25 2m 10s
Attempt to fix flaky test (#688)
Docker #251: Commit 091ddca pushed by dakinggg
October 23, 2023 21:45 2m 13s main
October 23, 2023 21:45 2m 13s
Tiktoken wrapper add_eos_token option (#681)
Docker #250: Commit f65b07e pushed by rajammanabrolu
October 20, 2023 22:31 54s main
October 20, 2023 22:31 54s
Adding Mosaic logger + logging data validated event (#670)
Docker #249: Commit 459947c pushed by jjanezhang
October 20, 2023 22:00 15m 29s main
October 20, 2023 22:00 15m 29s
add |---| to render tables (#686)
Docker #248: Commit 3e5b960 pushed by vchiley
October 20, 2023 04:33 2m 33s main
October 20, 2023 04:33 2m 33s
Update_pretrain_benchmarks (#543)
Docker #247: Commit b2a43a1 pushed by vchiley
October 19, 2023 23:34 2m 9s main
October 19, 2023 23:34 2m 9s
Add profiler support in llm foundry (#678)
Docker #246: Commit 92bd673 pushed by j316chuck
October 18, 2023 23:25 1m 56s main
October 18, 2023 23:25 1m 56s
Small changes to HF repo update script (#680)
Docker #245: Commit f11483f pushed by dakinggg
October 18, 2023 21:04 2m 34s main
October 18, 2023 21:04 2m 34s
add load_strict_model_weights as an optional config parameter (#655)
Docker #244: Commit 2c5965e pushed by dakinggg
October 18, 2023 01:18 6m 24s main
October 18, 2023 01:18 6m 24s
Add support for automatically registering models to UC at the end of …
Docker #243: Commit cc238a3 pushed by dakinggg
October 17, 2023 04:38 1m 53s main
October 17, 2023 04:38 1m 53s
Convert to DataSpec and add token counts that include padding (#676)
Docker #242: Commit 4fa2dd8 pushed by dakinggg
October 17, 2023 01:23 17m 26s main
October 17, 2023 01:23 17m 26s
small typos in eval readme (#671)
Docker #241: Commit aecadc9 pushed by maxisawesome
October 12, 2023 23:43 2m 5s main
October 12, 2023 23:43 2m 5s
Do not update past_key_values in place (#652)
Docker #240: Commit 3c7421c pushed by irenedea
October 12, 2023 22:07 15m 4s main
October 12, 2023 22:07 15m 4s
Point to composer.callback.Generate (#631)
Docker #239: Commit db2233e pushed by aspfohl
October 12, 2023 00:32 2m 6s main
October 12, 2023 00:32 2m 6s
Fix typo in image name (#669)
Docker #238: Commit 8e4c30a pushed by dakinggg
October 11, 2023 21:59 1m 37s main
October 11, 2023 21:59 1m 37s
Adding Simplified Coding Tasks (#645)
Docker #237: Commit cdb1c28 pushed by bmosaicml
October 11, 2023 20:08 2m 2s main
October 11, 2023 20:08 2m 2s
Add test suite for flash attention 2 (#666)
Docker #236: Commit 0045ae6 pushed by dakinggg
October 11, 2023 17:06 2m 7s main
October 11, 2023 17:06 2m 7s
Inverse Square Root LR Schedule (#657)
Docker #235: Commit 6c98276 pushed by codestar12
October 11, 2023 16:24 2m 10s main
October 11, 2023 16:24 2m 10s
put target back (#668)
Docker #234: Commit bdac4c7 pushed by dakinggg
October 10, 2023 17:07 2m 25s main
October 10, 2023 17:07 2m 25s
fix (#667)
Docker #233: Commit a128340 pushed by dakinggg
October 10, 2023 16:52 1m 33s main
October 10, 2023 16:52 1m 33s
Add images with flash attention 2 (#651)
Docker #232: Commit ba6b880 pushed by dakinggg
October 10, 2023 16:13 17m 17s main
October 10, 2023 16:13 17m 17s