LoRA Support #1204

Quentin-Anthony · 2024-04-23T15:09:53Z

Our finetuning focus has been a bit lacking lately. Let's change that, starting with LoRA!

Run a finetuning baseline with a Pythia model that saturates VRAM, measure its TFLOPs/VRAM usage in wandb and link it here. Choose a small finetuning dataset for this that requires little compute to converge without being trivial.
Add prototype LoRA support to gpt-neox
Compare to baseline in step 1 and ensure TFLOP/VRAM changes make sense
Compare to baseline in step 1 and ensure loss is maintained

As discussed in Discord @mkerin

mkerin · 2024-04-23T15:22:18Z

Thanks for creating the feature request! As discussed - I'm happy to take this one on :) I expect to start work on it late this week / early next week.

Quentin-Anthony added the feature request New feature or request label Apr 23, 2024

Quentin-Anthony assigned mkerin Apr 23, 2024

mkerin mentioned this issue May 20, 2024

Add lora support #1225

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LoRA Support #1204

LoRA Support #1204

Quentin-Anthony commented Apr 23, 2024

mkerin commented Apr 23, 2024

LoRA Support #1204

LoRA Support #1204

Comments

Quentin-Anthony commented Apr 23, 2024

mkerin commented Apr 23, 2024