[Model] Support Qwen2.5 Instruct and Coder #564

CharlieFRuan · 2024-09-19T06:57:36Z

This PR adds the following models to prebuilt list:

Qwen2.5-0.5B - q0f16, q0f32, q4f16_1, q4f32_1
Qwen2.5-1.5B, 3B, 7B - q4f16_1, q4f32_1
Qwen2.5-Coder-1.5B, 7B - q4f16_1, q4f32_1

Previous Qwen2 models are not deprecated.

All models reuse previous Qwen2 WASMs (except Qwen2.5-3B since Qwen2 does not have a 3B variant, which we compile at head with mlc-ai/binary-mlc-llm-libs#139)

This new version only introduces new models: - #558 - Adds `Hermes-3-Llama-3.1-8B-q4f32_1-MLC` and `Hermes-3-Llama-3.1-8B-q4f16_1-MLC` to the prebuilt - #564 - Add the following models to prebuilt: - Qwen2.5-0.5B - q0f16, q0f32, q4f16_1, q4f32_1 - Qwen2.5-1.5B, 3B, 7B - q4f16_1, q4f32_1 - Qwen2.5-Coder-1.5B, 7B - q4f16_1, q4f32_1 ### TVMjs - Updated to current head: apache/tvm@a242046 - Main change is apache/tvm#17371; without it, it prevents us from installing dependencies when building web-llm

[Model] Support Qwen2.5 instruct and coder

1a85840

CharlieFRuan merged commit 1684e0f into main Sep 19, 2024
1 check passed

CharlieFRuan deleted the pr0918-qwen2_5 branch September 19, 2024 07:02

CharlieFRuan mentioned this pull request Sep 19, 2024

[Version] Bump version to 0.2.63 #565

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Support Qwen2.5 Instruct and Coder #564

[Model] Support Qwen2.5 Instruct and Coder #564

CharlieFRuan commented Sep 19, 2024

[Model] Support Qwen2.5 Instruct and Coder #564

[Model] Support Qwen2.5 Instruct and Coder #564

Conversation

CharlieFRuan commented Sep 19, 2024