Fix maximum seqlen for gptq quantization #1748

SunMarc · 2024-03-07T20:39:21Z

What does this PR do

This PR set a maximum seqlen that we use to create our calibration dataset. For models like mistral, we have a seqlen of 32768 which results in issues when creating the calibration dataset. The dataset don't need to have data with a big seqlen too.
Fixes huggingface/transformers#29494

fix gptq calibration data

fix gptq calibration data

5c2554c

SunMarc requested a review from fxmarty March 7, 2024 20:39

fxmarty approved these changes Mar 18, 2024

View reviewed changes

fxmarty merged commit 9ff5ea8 into huggingface:main Mar 18, 2024
37 of 45 checks passed

young-developer pushed a commit to young-developer/optimum that referenced this pull request May 10, 2024

Fix maximum seqlen for gptq quantization (huggingface#1748)

3c17f0c

fix gptq calibration data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix maximum seqlen for gptq quantization #1748

Fix maximum seqlen for gptq quantization #1748

SunMarc commented Mar 7, 2024

Fix maximum seqlen for gptq quantization #1748

Fix maximum seqlen for gptq quantization #1748

Conversation

SunMarc commented Mar 7, 2024

What does this PR do