Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix Hyperbolic / Qwen message format #4008

Closed
wants to merge 4 commits into from
Closed

fix Hyperbolic / Qwen message format #4008

wants to merge 4 commits into from

Conversation

enyst
Copy link
Collaborator

@enyst enyst commented Sep 23, 2024

Short description of the problem this fixes or functionality that this introduces. This may be used for the CHANGELOG
Fix Qwen non-vision expected message format.


Give a summary of what the PR does, explaining any non-trivial design decisions

This addresses a regression from #3832: Hyperbolic supports both vision and non-vision formats, but separately, each for the respective LLM. If the LLM doesn't have vision enabled, the API returns 400 for role=user messages in vision-like format. So the compromise that PR 3832 tried to make doesn't work.

This PR proposes to support the two formats for now with flags for the type of format: vision_enabled and cache_enabled. It continues to use Pydantic serialization mechanism, restructured a bit. The flag is set per Message.


Link of any specific issues this addresses
#4004

@enyst enyst changed the title fix Hyperbolic Qwen message format fix Hyperbolic / Qwen message format Sep 24, 2024
@enyst enyst closed this Sep 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant