fix Hyperbolic / Qwen message format #4008

enyst · 2024-09-23T18:11:02Z

Short description of the problem this fixes or functionality that this introduces. This may be used for the CHANGELOG
Fix Qwen non-vision expected message format.

Give a summary of what the PR does, explaining any non-trivial design decisions

This addresses a regression from #3832: Hyperbolic supports both vision and non-vision formats, but separately, each for the respective LLM. If the LLM doesn't have vision enabled, the API returns 400 for role=user messages in vision-like format. So the compromise that PR 3832 tried to make doesn't work.

This PR proposes to support the two formats for now with flags for the type of format: vision_enabled and cache_enabled. It continues to use Pydantic serialization mechanism, restructured a bit. The flag is set per Message.

Link of any specific issues this addresses
#4004

enyst added 4 commits September 23, 2024 19:57

fix Hyperbolic Qwen message format

673a92e

add test, fix test

67de279

add test

0bec9cd

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/qwen

66efa14

enyst mentioned this pull request Sep 24, 2024

Vision and prompt caching fixes #4014

Merged

enyst changed the title ~~fix Hyperbolic Qwen message format~~ fix Hyperbolic / Qwen message format Sep 24, 2024

enyst closed this Sep 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix Hyperbolic / Qwen message format #4008

fix Hyperbolic / Qwen message format #4008

enyst commented Sep 23, 2024 •

edited

Loading

fix Hyperbolic / Qwen message format #4008

fix Hyperbolic / Qwen message format #4008

Conversation

enyst commented Sep 23, 2024 • edited Loading

enyst commented Sep 23, 2024 •

edited

Loading