Skip to content

Fix GQA permutation computation and sequential weight initialization / loading when doing TP #362

Fix GQA permutation computation and sequential weight initialization / loading when doing TP

Fix GQA permutation computation and sequential weight initialization / loading when doing TP #362

Triggered via pull request March 27, 2024 14:27
Status Cancelled
Total duration 25m 19s
Artifacts
optimum-neuron-tests
0s
optimum-neuron-tests
Fit to window
Zoom out
Zoom in

Annotations

1 error
optimum-neuron-tests
Canceling since a higher priority waiting request for 'Optimum Neuron - Test optimum.neuron.distributed on Trainium-fix_gqa_compute_query_indicies' exists