FP8 AllGather Support in Fairscale #1185

Co-authored-by: Naman Goyal <[email protected]>

This commit works with a 4 GPU run on SMALL model with FSDP and PP enabled.

- Clean up flatten and non_flatten parameter generation logic. - Avoid checking `main_grad` attribute all equal to zeros.

- Cleans up amax and scale update logic. Amax and scale should be done for both weights and parameters. So it should be done at forward of each microbatch. - Consolidate `cast_params` and `all_gather` stream.

Co-authored-by: Naman Goyal <[email protected]>

This commit works with a 4 GPU run on SMALL model with FSDP and PP enabled.

- Clean up flatten and non_flatten parameter generation logic. - Avoid checking `main_grad` attribute all equal to zeros.

- Cleans up amax and scale update logic. Amax and scale should be done for both weights and parameters. So it should be done at forward of each microbatch. - Consolidate `cast_params` and `all_gather` stream.

…kresearch/fairscale into shikaili_fp8_allgather_no_pp_fix

Commits on Apr 1, 2024

Cleans up code.

levendlee committed Apr 1, 2024

Configuration menu

View commit details

Copy full SHA for 24a769f

Browse repository at this point

Copy the full SHA

24a769f View commit details

Browse the repository at this point in the history

Commits on Apr 10, 2024

Clean up shard offset calculation logic.

levendlee committed Apr 10, 2024

Configuration menu

View commit details

Copy full SHA for 57eb557

Browse repository at this point

Copy the full SHA

57eb557 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP8 AllGather Support in Fairscale #1185

FP8 AllGather Support in Fairscale #1185

Commits on Mar 29, 2024

Commits on Apr 1, 2024

Commits on Apr 2, 2024

Commits on Apr 9, 2024

Commits on Apr 10, 2024

Commits on Apr 17, 2024

Commits on May 20, 2024

FP8 AllGather Support in Fairscale #1185

Are you sure you want to change the base?

FP8 AllGather Support in Fairscale #1185

Commits on Mar 29, 2024

Commits on Apr 1, 2024

Commits on Apr 2, 2024

Commits on Apr 9, 2024

Commits on Apr 10, 2024

Commits on Apr 17, 2024

Commits on May 20, 2024