Apply bitnet quantization #2193

PauGa9 · 2024-05-29T07:09:01Z

PauGa9
May 29, 2024

Hi everyone!

I was reading about the bitnet (1.58bit) quantization, and I was wondering if that would improve inference time for this model? That would help in the real time translation.

Let me know what do you think, and I'm happy to help with that!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply bitnet quantization #2193

{{title}}

Replies: 0 comments

Select a reply

Apply bitnet quantization #2193

PauGa9 May 29, 2024

Replies: 0 comments

PauGa9
May 29, 2024