[Feature] LISA multi GPU support #899

wheresmyhair · 2024-09-25T04:46:40Z

Description

LISA now supports multi GPU.
Key points:

When initialize models, ds will wrap all model params with optimizer, which uses huge amount of gpu mem. However, according to LISA's logic, we do not need to wrap all params with optim, since only activated layers will be updated. So we hack the initialization such that trainer only wrap one layer at the beginning (to avoid CUDA oom error).
Currently cannot save training args when using LISATrainer, as it will raise error when pickling. Model weights can be saved correctly.

research4pan

Yizhen added 2 commits September 15, 2024 02:05

lisa multigpu test

54f5e29

[bug fix] temporarily disable saving training args when lisa

72379a9

wheresmyhair mentioned this pull request Sep 25, 2024

[Roadmap] LMFlow Roadmap #862

Open

30 tasks

research4pan reviewed Sep 25, 2024

View reviewed changes