This repository has been archived by the owner on May 28, 2024. It is now read-only.
What's Changed
- Add tensorrt-llm backend (v0.6.1).
- Add embedding backend.
- Add Mixtral serve config.
- Upgrading vllm support to (v0.2.5)
- Upgrading ray to v2.9.1
Thanks for contributions from:
@avnishn
@csivanich
@sihanwang41
@Yard1
@tterrysun