Various fixes for training #570
Triggered via pull request
July 17, 2024 09:32
Status
Cancelled
Total duration
6h 0m 20s
Artifacts
–
test_trainium_distributed.yml
on: pull_request
Run distributed tests on Trainium 1
5h 59m
Annotations
2 errors
Run distributed tests on Trainium 1
Canceling since a higher priority waiting request for 'Optimum Neuron - Test optimum.neuron.distributed on Trainium-fix_mpmd_at_end_of_epoch' exists
|
Run distributed tests on Trainium 1
The operation was canceled.
|