You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

shripadk · 2024-03-03T18:11:17Z

Describe the bug

I have enabled 4-bit quantization for fine tuning mistralai/Mistral-7B-v0.1. Seems like Ludwig 0.10.1 depends on bitsandbytes < 0.41.0. But when I run the trainer I get the following warning:

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. 
If you want to save 4-bit models, make sure to have `bitsandbytes>=0.41.3` installed.

To Reproduce
Steps to reproduce the behavior:

Install Ludwig

pip install ludwig[full]

Config file (model.yaml):

model_type: llm
base_model: mistralai/Mistral-7B-v0.1

quantization:
  bits: 4

adapter:
  type: lora

prompt:
  template: |
    ### Instruction:
    {instruction}

    ### Input:
    {input}

    ### Response:

input_features:
  - name: prompt
    type: text

output_features:
  - name: output
    type: text

generation:
  temperature: 0.1

trainer:
  type: finetune
  epochs: 3
  optimizer:
    type: paged_adam
  batch_size: 1
  eval_steps: 100
  learning_rate: 0.0002
  eval_batch_size: 2
  steps_per_checkpoint: 1000
  learning_rate_scheduler:
    decay: cosine
    warmup_fraction: 0.03
  gradient_accumulation_steps: 16
  enable_gradient_checkpointing: true

preprocessing:
  sample_ratio: 0.1

Train the model:

ludwig train --config model.yaml --dataset "ludwig://alpaca"

Expected behavior
Should not show the warning on bitsandbytes version not supporting save_pretrained for 4-bit quantization.

Environment (please complete the following information):

OS: Linux
Version: 6.7.6-arch1-1
Python: 3.10.8
Ludwig: v0.10.1

@alexsherstinsky

The text was updated successfully, but these errors were encountered:

yogeshhk · 2024-03-04T09:28:20Z

Here is the notebook showing the run... First run asked for a RESTART, after doing that and running all the cells, the output is https://colab.research.google.com/drive/1kmZhQKBzpHBJRJvvp9PEdPEUMfMu6dh7?usp=sharing Just FYI.... btw, the output of the model is "","", but that's most likely an issue with the base model!! [
[@shripadk @alexsherstinsky]

yogeshhk · 2024-03-13T05:11:42Z

With more epochs Gemma finetuning seems to work fine https://console.cloud.google.com/vertex-ai/colab/notebooks?project=document-ai-374204&activeNb=projects%2Fdocument-ai-374204%2Flocations%2Fus-central1%2Frepositories%2F87000216-df46-4358-8bb1-6bc933f4c82b [@shripadk @alexsherstinsky ]

alexsherstinsky · 2024-07-26T21:35:12Z

@shripadk Are you still having the issues? A new version of Ludwig will be release next week (you may wish to try again). Please keep an eye on the release announcement next week in our Discord. Thank you!

shripadk · 2024-07-26T22:54:53Z

@alexsherstinsky thanks for the heads up. I'll definitely take a look at it and get back to you on this. Will surely keep an eye on the release. Thanks again 🎉

alexsherstinsky added the llm Large Language Model related label Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

shripadk commented Mar 3, 2024

yogeshhk commented Mar 4, 2024

yogeshhk commented Mar 13, 2024

alexsherstinsky commented Jul 26, 2024

shripadk commented Jul 26, 2024

You are calling save_pretrained to a 4-bit converted model, but your bitsandbytes version doesn't support it. #3951

You are calling save_pretrained to a 4-bit converted model, but your bitsandbytes version doesn't support it. #3951

Comments

shripadk commented Mar 3, 2024

yogeshhk commented Mar 4, 2024

yogeshhk commented Mar 13, 2024

alexsherstinsky commented Jul 26, 2024

shripadk commented Jul 26, 2024

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951

You are calling `save_pretrained` to a 4-bit converted model, but your `bitsandbytes` version doesn't support it. #3951