Refactor messages serialization #3832

enyst · 2024-09-11T20:55:47Z

CHANGELOG
Refactor the serialization of messages for vision, prompt caching, Groq incompatibilities.

fix potential bug with shared list
refactor the formatting we need to do into the serialization methods on the Message classes

Give a summary of what the PR does, explaining any non-trivial design decisions
Groq documents on their site that their API is openai-compatible, and lists only a few limitations that don't affect us. Sadly, that's not quite right: for vision format, their API gives 400 errors because it requires both system and assistant message to be simple strings (as opposed to accepting lists of text/image content). Cc: @tobitege I had a wild hope that it is only the system message, but the fact that assistant message is also not supported makes it impossible to merge implementations completely for now...

It's possible that the vision models are too new still, and Groq and/or liteLLM haven't got yet to adapt to all changes. I'll follow up on that... In the meantime we have a potential bug in the Message class and I took the opportunity to synchronize the two implementations (vision and non-vision) we have.

Tested with: Groq/Llama 3.1 70B, Gemini 1.5 Flash (AI Studio), o1, Sonnet 3.5

Part of #3812

…essages

tobitege · 2024-09-11T21:16:48Z

The format_message method is not without reason as big as it is. It took hours of tweaking and testing to make it work with the cases at hand. You also need to try a full integration regenerate test with this. 😬

…essages

ColeMurray · 2024-09-13T18:14:06Z

tests/integration/conftest.py

@@ -78,6 +77,29 @@ def get_log_id(prompt_log_name):
        return match.group(1)


+def _format_messages(messages):


What's the thought process here of mocking this out compared to the previous usage that used the actual format_messages call?

🤔 The reason why something like this is used in integration tests is that what they do is compare the prompt as created and sent to a mock LLM by the agent, with existing log files of the same thing - files that log prompts. So this is imitating the logging into a file...

…essages

li-boxuan

I don't understand the context of this PR, so my comments are just about style/nitpicks

config.template.toml

li-boxuan · 2024-09-17T05:45:56Z

openhands/core/message.py

-                    'role': role,
-                    'content': content_str,
-                }
+        elif self.role == 'assistant' and not self.contains_image:


nit: this branch seems mergeable with previous branch

You're right, I would suggest to keep it at least for now though. The branching helps my brain to see how we serialize each of the 3 roles, and I suspect we will continue to need to look / tweak this code...

tobitege · 2024-09-17T10:25:33Z

Before we merge this, I'd like this PR to get tested against Gemini, a Lama 3.1 model on Groq and of course Sonnet.
All if possible with vision on and off (where supported) and caching on and off (Sonnet).

I guess this could be a manual workflow file calling a single .py test file (in /tests folder) that we can improve upon and run manually from CI.
Just not sure how to deal with different llm configs in CLI (no toml, but coded?), if we can use the all-hands proxy for all models?

What do you guys think? @enyst @xingyaoww

neubig · 2024-09-17T23:43:18Z

Hey @tobitege , I understand the feeling, but also think that this might be a bit of a heavy lift to get this PR integrated. Maybe we could just make a best effort for this PR and put that on the list enhancements for the future.

tobitege

LGTM

…essages

…to enyst/messages

neubig · 2024-09-18T20:08:59Z

@enyst : please feel free to merge if you think this is ready.

enyst · 2024-09-18T21:48:41Z

To clarify a bit the issue here, for the record: since the introduction of vision support, we have changed the format we send messages to the LLM. The format we started sending was the openai-compatible format for vision, like:

{'content': [{'type': 'text', 'text': 'Ask me what your task is'}, ...], 'role': 'user'}

It has a list of dicts of content types, instead of the old simpler format, where 'content' is just a single string. Content types are 'text' and 'image_url'.

Things seemed to work, until they didn't: in reality, as @tobitege found and fixed already, multiple providers don't support this format or not fully. They appear to support this for role: user, but not for role: system, and sadly, not for role: assistant. The fix did what was necessary: since this situation requires us to support 2 kinds of serialization, basically, in some way, pending future fixes from litellm/providers. Tobi did the hard work on this and restored the non-vision format so that things work.

This PR merely aims to simplify the first take on this:

it settles on a compromise: serialize user messages in vision-like format, but the rest not, if vision isn't enabled. This works in all cases I've seen so far.
it refactors how we do serialization: it was in two places, now it's in one place, pydantic decorated Message serializers.

Note: I think we had some code in serialization that was used only in integration tests. This PR moves it to tests. IMHO the code has become way too complex, and we'll be happier if we keep the core code do core stuff, and the tests do tests stuff.

enyst added 5 commits September 11, 2024 19:22

refactor messages serialization

f4d2016

potential bug

a5f9000

try to keep the format

0852db4

fix model_dump

ca0b60f

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/m…

71c17e0

…essages

enyst added 2 commits September 11, 2024 23:36

fix image list

8a164ee

fix single message

0979650

enyst force-pushed the enyst/messages branch from f090984 to 0979650 Compare September 11, 2024 22:22

enyst added 2 commits September 12, 2024 01:05

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/m…

5d61dd4

…essages

fix message

4724ec7

enyst force-pushed the enyst/messages branch from 242f55d to 4724ec7 Compare September 12, 2024 00:45

enyst added 3 commits September 12, 2024 14:30

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/m…

bdc88fc

…essages

fix conftest.py

7b39c6e

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/m…

3787d62

…essages

ColeMurray reviewed Sep 13, 2024

View reviewed changes

enyst added 3 commits September 13, 2024 20:45

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/m…

5d88648

…essages

adapt to new default

caaba06

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/m…

b712d1f

…essages

li-boxuan reviewed Sep 17, 2024

View reviewed changes

Merge branch 'main' into enyst/messages

2ff4580

tobitege approved these changes Sep 18, 2024

View reviewed changes

enyst added 3 commits September 18, 2024 20:26

rename example

71040ad

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/m…

736e142

…essages

Merge branch 'enyst/messages' of github.com:All-Hands-AI/OpenHands in…

0f0ae93

…to enyst/messages

enyst merged commit 8fdfece into main Sep 18, 2024
13 checks passed

enyst deleted the enyst/messages branch September 18, 2024 21:48

enyst mentioned this pull request Sep 23, 2024

fix Hyperbolic / Qwen message format #4008

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor messages serialization #3832

Refactor messages serialization #3832

enyst commented Sep 11, 2024 •

edited

Loading

tobitege commented Sep 11, 2024

ColeMurray Sep 13, 2024

enyst Sep 13, 2024

li-boxuan left a comment

li-boxuan Sep 17, 2024

enyst Sep 18, 2024

tobitege commented Sep 17, 2024 •

edited

Loading

neubig commented Sep 17, 2024

tobitege left a comment

neubig commented Sep 18, 2024

enyst commented Sep 18, 2024

		@@ -78,6 +77,29 @@ def get_log_id(prompt_log_name):
		return match.group(1)


		def _format_messages(messages):

Refactor messages serialization #3832

Refactor messages serialization #3832

Conversation

enyst commented Sep 11, 2024 • edited Loading

tobitege commented Sep 11, 2024

ColeMurray Sep 13, 2024

Choose a reason for hiding this comment

enyst Sep 13, 2024

Choose a reason for hiding this comment

li-boxuan left a comment

Choose a reason for hiding this comment

li-boxuan Sep 17, 2024

Choose a reason for hiding this comment

enyst Sep 18, 2024

Choose a reason for hiding this comment

tobitege commented Sep 17, 2024 • edited Loading

neubig commented Sep 17, 2024

tobitege left a comment

Choose a reason for hiding this comment

neubig commented Sep 18, 2024

enyst commented Sep 18, 2024

enyst commented Sep 11, 2024 •

edited

Loading

tobitege commented Sep 17, 2024 •

edited

Loading