feat: self discover agent #3437

sb-git-cloud · 2024-08-17T17:22:37Z

What is the problem that this fixes or functionality that this introduces? Does it fix any open issues?

Implements a Self Discover agent, which provides a step-by-step plan to CodeActAgent. This is related to the open PR #46.

Give a summary of what the PR does, explaining any non-trivial design decisions
The agent first chooses reasoning modules that may be relevant for the user-given task. Then it adapts those to the specific task and finally it creates a plan, which is then sent to CodeAct for execution.

As of now it is in beta version.

Other references

tobitege · 2024-08-17T17:32:01Z

Thanks for building and contributing this! 🤗

agenthub/self_discover_agent/reasoning_action_parser.py

agenthub/self_discover_agent/prompt.py

agenthub/self_discover_agent/reasoning_action.py

tobitege · 2024-08-18T19:30:34Z

agenthub/self_discover_agent/prompt.py

+
+SYSTEM_SUFFIX = """Let the following principles guide your response:
+- Before responding read all information carefully.
+- Take time to think.


Just commenting: I wonder whether this instruction was ever followed. 😂

Yeah good question. I don't have any data to answer...

tobitege

Would love to see some test for this, if possible. Maybe in a follow up PR.

tobitege · 2024-08-18T19:31:15Z

agenthub/self_discover_agent/prompt.py

+You MUST NOT include any other text besides the JSON response.
+"""
+
+# TODO: modify example so that it is consistent with prompting


Do we want to keep this commented out code?

Okay, will remove the commented-out code for now and just leave the Todo comment

li-boxuan · 2024-08-18T21:58:29Z

Can you showcase some simple examples? Screenshots would suffice. Then maybe we could run some evaluation against it. C.C. @xingyaoww

xingyaoww

Thanks for this! Overall this looks good to me! I don't see major road block to get this merged. A few things:

We probably need to generalize the Jinja2 template (used in CodeAct now https://github.com/OpenDevin/OpenDevin/blob/14a4e45cbbbb01eba01b47e5d874c2583decdfd8/agenthub/codeact_agent/codeact_agent.py#L76-L80) at some point to better support different needs - this agent requires multiple different ways to compose prompt. I think we can merge this as is, and figure out ways to optimize later. (I think that's probably the downside of abstraction - you'd step on your own toe 😅)
We should remove the comments.

Btw, @sb-git-cloud if you are interested in running some SWE-Bench eval to validate the agent, i'd be happy to sponsor some API credits - just lmk in the slack :D

xingyaoww · 2024-08-19T01:41:58Z

agenthub/codeact_agent/codeact_agent.py

@@ -195,6 +195,11 @@ def _get_messages(self, state: State) -> list[Message]:
            ),
        ]

+        if 'task' in state.inputs:


Is this absolutely necessary for CodeAct to be delegated by other agents? cc @li-boxuan

I think so, at least last I checked. The task was not accessible to the agent via an event.

I've been thinking we could return it from get_current_user_intent() though. It would be easier to use like goal=state.get_user_intent() for everything, delegate or not.

The issue is, if I recall correctly, that children start with zero history. The task is included in the DelegateAction, but the child doesn't have access to that action. We'll need Li Boxuan to confirm, though.

The contract is messy and there lacks a clear API between agents.

It seems like for now the best is to leave it as is, and once the API between agents is defined we can update the agent with a follow-up PR.

xingyaoww · 2024-08-19T01:42:51Z

agenthub/self_discover_agent/agent.py

+    implement_example = IMPLEMENT_EXAMPLE.format(
+        implement_state_key=SelfDiscoverState.IMPLEMENT.value,
+        task_key=TASK_KEY,
+    )


It will be nice if we can use the newly implemented Jinja template for this agent - but no pressure - we can do it follow up PR.

https://github.com/OpenDevin/OpenDevin/blob/14a4e45cbbbb01eba01b47e5d874c2583decdfd8/agenthub/codeact_agent/codeact_agent.py#L76-L80

If the template is too rigid, we can figure out a way to generalize it a bit

Okay, given your comment below let's defer to a future PR.

xingyaoww · 2024-08-19T01:46:59Z

agenthub/self_discover_agent/prompt.py

+"""
+
+
+def get_prompt(


On a second thought: the prompt here is probably a little bit too complex to support for the Jinja2 template in its current shape - we can defer that to future PR.

xingyaoww · 2024-08-19T01:48:22Z

agenthub/self_discover_agent/prompt.py

+You MUST NOT include any other text besides the JSON response.
+"""
+
+# TODO: modify example so that it is consistent with prompting


tobitege

This PR still needs to register the agent in the agenthub registry (agenthub/__init__.py)

yufansong · 2024-08-19T20:15:56Z

agenthub/self_discover_agent/__init__.py

BTW, can you add some README.md under this folder to explain more context about self-discover agent? I also think we need some evalution result on SWE bench before merge.

A little curious on this, what result are we looking for?

Yes, sounds good. Thanks for the suggestion. I will add a readme and run some evaluations.

deferring to other maintainers

sb-git-cloud · 2024-08-26T00:41:05Z

Thank you all for your reviews! Will work on the related changes.

mamoodi · 2024-09-07T17:37:41Z

Hi @sb-git-cloud thanks so much for your contribution. Just wanted to check if this is something that is generally on your radar?

sb-git-cloud · 2024-09-07T18:02:45Z

Hi @sb-git-cloud thanks so much for your contribution. Just wanted to check if this is something that is generally on your radar?

Thanks for checking in. Yes. Haven't had the chance to run evaluations. Hopefully I will get to it next week.

neubig · 2024-09-23T16:59:47Z

Hey @sb-git-cloud , if you're interested in running evaluations, things are now a bit easier with our remote runtime: https://github.com/All-Hands-AI/OpenHands/tree/main/evaluation/swe_bench#run-inference-on-remoteruntime-experimental

If you'd like to use that we'd be happy to help out.

github-actions · 2024-10-24T01:58:54Z

This PR is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

github-actions · 2024-10-31T02:00:12Z

This PR was closed because it has been stalled for over 30 days with no activity.

sb-git-cloud changed the title ~~Self discover agent~~ feat: self discover agent Aug 17, 2024

sb-git-cloud mentioned this pull request Aug 17, 2024

Create Self-Discover Prompting Agent #46

Open

enyst reviewed Aug 17, 2024

View reviewed changes

agenthub/self_discover_agent/reasoning_action_parser.py Outdated Show resolved Hide resolved

enyst reviewed Aug 17, 2024

View reviewed changes

agenthub/self_discover_agent/prompt.py Outdated Show resolved Hide resolved

enyst reviewed Aug 17, 2024

View reviewed changes

agenthub/self_discover_agent/prompt.py Outdated Show resolved Hide resolved

enyst reviewed Aug 17, 2024

View reviewed changes

agenthub/self_discover_agent/reasoning_action.py Outdated Show resolved Hide resolved

sb-git-cloud added 21 commits August 18, 2024 11:48

first draft

45d8be7

minor changes. __init__ added

c997267

rename agent

e855f94

first working version with delegating to codeact.

add09b3

first draft

ce0e9e3

minor changes. __init__ added

f766953

rename agent

b62a9df

rename dir

bffe75f

improve prompting

1649e3f

improve select prompt

7f9ce43

first draft

7063b08

minor changes. __init__ added

0124a17

rename agent

444cf1d

first working version with delegating to codeact.

80c60e4

first draft

457dc3c

minor changes. __init__ added

5dc851b

rename agent

0740f0e

rename dir

1b0da31

beta_version

a4df26e

type checking improvement

58ec819

added config to agent init

a8d655d

sb-git-cloud force-pushed the self_discover_agent branch from 500fed6 to a8d655d Compare August 18, 2024 19:24

ci edtis

4dd0052

tobitege reviewed Aug 18, 2024

View reviewed changes

tobitege requested a review from li-boxuan August 18, 2024 20:24

xingyaoww reviewed Aug 19, 2024

View reviewed changes

tobitege previously requested changes Aug 19, 2024

View reviewed changes

yufansong reviewed Aug 19, 2024

View reviewed changes

neubig assigned sb-git-cloud Sep 23, 2024

github-actions bot added the Stale Inactive for 30 days label Oct 24, 2024

github-actions bot closed this Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: self discover agent #3437

feat: self discover agent #3437

sb-git-cloud commented Aug 17, 2024

tobitege commented Aug 17, 2024

tobitege Aug 18, 2024

sb-git-cloud Aug 18, 2024

tobitege left a comment

tobitege Aug 18, 2024

xingyaoww Aug 19, 2024

sb-git-cloud Aug 26, 2024 •

edited

Loading

li-boxuan commented Aug 18, 2024

xingyaoww left a comment

xingyaoww Aug 19, 2024

enyst Aug 19, 2024 •

edited

Loading

li-boxuan Aug 20, 2024

sb-git-cloud Aug 26, 2024

xingyaoww Aug 19, 2024

sb-git-cloud Aug 26, 2024

xingyaoww Aug 19, 2024

xingyaoww Aug 19, 2024

tobitege left a comment •

edited

Loading

yufansong Aug 19, 2024

enyst Aug 19, 2024

sb-git-cloud Aug 26, 2024

sb-git-cloud commented Aug 26, 2024

mamoodi commented Sep 7, 2024

sb-git-cloud commented Sep 7, 2024

neubig commented Sep 23, 2024

github-actions bot commented Oct 24, 2024

github-actions bot commented Oct 31, 2024

feat: self discover agent #3437

feat: self discover agent #3437

Conversation

sb-git-cloud commented Aug 17, 2024

tobitege commented Aug 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tobitege left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sb-git-cloud Aug 26, 2024 • edited Loading

Choose a reason for hiding this comment

li-boxuan commented Aug 18, 2024

xingyaoww left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

enyst Aug 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tobitege left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sb-git-cloud commented Aug 26, 2024

mamoodi commented Sep 7, 2024

sb-git-cloud commented Sep 7, 2024

neubig commented Sep 23, 2024

github-actions bot commented Oct 24, 2024

github-actions bot commented Oct 31, 2024

sb-git-cloud Aug 26, 2024 •

edited

Loading

enyst Aug 19, 2024 •

edited

Loading

tobitege left a comment •

edited

Loading