-
-
Notifications
You must be signed in to change notification settings - Fork 267
Issues: turboderp/exllamav2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] Failed to quantize Qwen2.5-Math-72B-Instruct: Measurement/inference error (3): hidden_states
bug
Something isn't working
#627
opened Sep 19, 2024 by
Orion-zhen
3 tasks done
[BUG] Quantization of Qwen return garbage
bug
Something isn't working
#621
opened Sep 10, 2024 by
fahadh4ilyas
3 tasks done
Q8 or unquantized cache with what context length for llama 3.1-8b 5.0 bpw exl2?
#575
opened Jul 27, 2024 by
lovebeatz
[ERROR] Worker (pid:25134) was sent SIGKILL! Perhaps out of memory?
#556
opened Jul 18, 2024 by
UTSAV-44
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.