The ckpt of Quantized OPT model is not be found #53

liuxy1103 · 2024-01-12T07:01:04Z

We have tried to reproduce the quantization of OPT-30B， but it is still difficult for us. Would you mind releasing the details of the procedure?

ChenMnZ · 2024-01-12T07:03:49Z

We have released the ckpt of OPT. You can find them at https://huggingface.co/ChenMnZ/OmniQuant/tree/main.

liuxy1103 · 2024-01-12T07:50:14Z

The results by loading the ckpt is not consistent with the reported result in the paper.

ChenMnZ · 2024-01-12T07:54:58Z

Can you reproduce the results of other models except OPT-30B？

linloong · 2024-07-02T01:51:42Z

When reproducing evaluation results for OPT-13b-w4a4, I got nan for wikitext2 dataset.

It seems that the checkpoint has also broken now.

And when I try to train by myself, the ppl is very high.

ChenMnZ · 2024-07-02T11:06:32Z

@linloong Can you provide the training script?

linloong · 2024-07-02T11:18:46Z

Sure,
Actually, I just use the script you provided in the scripts/opt/opt-13b/w4a4.sh.

CUDA_VISIBLE_DEVICES=0 python main.py \
--model facebook/opt-13b  \
--epochs 20 --output_dir ./log/opt-13b-w4a4 \
--wbits 4 --abits 4 --lwc --let --alpha 0.75

@ChenMnZ

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The ckpt of Quantized OPT model is not be found #53

The ckpt of Quantized OPT model is not be found #53

liuxy1103 commented Jan 12, 2024

ChenMnZ commented Jan 12, 2024

liuxy1103 commented Jan 12, 2024

ChenMnZ commented Jan 12, 2024

linloong commented Jul 2, 2024 •

edited

Loading

ChenMnZ commented Jul 2, 2024

linloong commented Jul 2, 2024 •

edited

Loading

The ckpt of Quantized OPT model is not be found #53

The ckpt of Quantized OPT model is not be found #53

Comments

liuxy1103 commented Jan 12, 2024

ChenMnZ commented Jan 12, 2024

liuxy1103 commented Jan 12, 2024

ChenMnZ commented Jan 12, 2024

linloong commented Jul 2, 2024 • edited Loading

ChenMnZ commented Jul 2, 2024

linloong commented Jul 2, 2024 • edited Loading

linloong commented Jul 2, 2024 •

edited

Loading

linloong commented Jul 2, 2024 •

edited

Loading