LoRA finetuning tutorial #671

michaelbenayoun · 2024-07-29T08:53:34Z

What does this PR do?

Features required for the tutorial
Training script available
First draft of the tutorial ready
Able to properly fine-tune a llama model

HuggingFaceDocBuilderDev · 2024-07-29T08:56:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

JingyaHuang

The tutorials are great! I left some some nits and questions to better understand it, thanks Michael!

docs/source/training_tutorials/finetune_llm.mdx

docs/source/training_tutorials/sft_lora_finetune_llm.mdx

JingyaHuang · 2024-09-17T08:45:05Z

docs/source/training_tutorials/sft_lora_finetune_llm.mdx

+
+To overcome this, we added a [model cache repository](https://huggingface.co/docs/optimum-neuron/guides/cache_system), which allows us to use precompiled models from the Hugging Face Hub to skip the compilation step. But be careful: every change in the model configuration might lead to a new compilation, which could result in some cache misses.
+
+_Note: If your model configuration is not cached please open an issue on [Github](https://github.com/huggingface/optimum-neuron/issues), we are happy to include it._


We shall guide to the cache repo on the hub or not?

I added a link!

docs/source/training_tutorials/sft_lora_finetune_llm.mdx

philschmid · 2024-09-17T14:15:44Z

docs/source/_toctree.yml

@@ -14,6 +14,8 @@
      title: Fine-tune BERT for Text Classification on AWS Trainium
    - local: training_tutorials/finetune_llm
      title: Fine-tune Llama 3 8B on AWS Trainium
+    - local: training_tutorials/sft_lora_finetune_llm
+      title: Fine-tune Llama 3 8B on with LoRA and the SFTTrainer


Suggested change

title: Fine-tune Llama 3 8B on with LoRA and the SFTTrainer

title: Fine-tune Llama 3.1 8B with LoRA and the SFTTrainer

Should we do 3.1?

I think we could not do for the rope thing? Or for the transformers version.
Let's keep it like that. In any case we will move to 70B asap, and I can try to do 3.1 then.

JingyaHuang

LGTM!
(Don't worry about failing inf2 CIs, they will be fixed via #691.

michaelbenayoun added 2 commits September 5, 2024 17:01

Add SFTTrainer + LoRA tutorial

6a640af

Fix peft adapter shard dir issue

ab8c961

michaelbenayoun force-pushed the lora_tutorial branch from d64af16 to ab8c961 Compare September 5, 2024 15:05

michaelbenayoun added 8 commits September 5, 2024 17:06

Add tutorial

21dab52

Add tutorial

aead30e

Add tutorial

976ecca

Add tutorial

aa520cf

Rename to 8B

2ebeb0a

Update command line

ce28603

Update title

47cc6d8

remove todo

612e329

michaelbenayoun marked this pull request as ready for review September 16, 2024 17:04

michaelbenayoun requested review from JingyaHuang, dacorvo, pagezyhf and philschmid September 16, 2024 17:04

JingyaHuang reviewed Sep 17, 2024

View reviewed changes

apply suggestions

8a6b89c

philschmid approved these changes Sep 17, 2024

View reviewed changes

JingyaHuang approved these changes Sep 18, 2024

View reviewed changes

michaelbenayoun merged commit d55d3ad into main Sep 18, 2024
5 of 10 checks passed

michaelbenayoun deleted the lora_tutorial branch September 18, 2024 09:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LoRA finetuning tutorial #671

LoRA finetuning tutorial #671

michaelbenayoun commented Jul 29, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 29, 2024

JingyaHuang left a comment

JingyaHuang Sep 17, 2024

michaelbenayoun Sep 17, 2024

philschmid Sep 17, 2024

philschmid Sep 17, 2024

michaelbenayoun Sep 17, 2024

JingyaHuang left a comment


		To overcome this, we added a [model cache repository](https://huggingface.co/docs/optimum-neuron/guides/cache_system), which allows us to use precompiled models from the Hugging Face Hub to skip the compilation step. But be careful: every change in the model configuration might lead to a new compilation, which could result in some cache misses.

		_Note: If your model configuration is not cached please open an issue on [Github](https://github.com/huggingface/optimum-neuron/issues), we are happy to include it._

	title: Fine-tune Llama 3 8B on with LoRA and the SFTTrainer
	title: Fine-tune Llama 3.1 8B with LoRA and the SFTTrainer

LoRA finetuning tutorial #671

LoRA finetuning tutorial #671

Conversation

michaelbenayoun commented Jul 29, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jul 29, 2024

JingyaHuang left a comment

Choose a reason for hiding this comment

JingyaHuang Sep 17, 2024

Choose a reason for hiding this comment

michaelbenayoun Sep 17, 2024

Choose a reason for hiding this comment

philschmid Sep 17, 2024

Choose a reason for hiding this comment

philschmid Sep 17, 2024

Choose a reason for hiding this comment

michaelbenayoun Sep 17, 2024

Choose a reason for hiding this comment

JingyaHuang left a comment

Choose a reason for hiding this comment

michaelbenayoun commented Jul 29, 2024 •

edited

Loading