Skip to content
This repository has been archived by the owner on May 28, 2024. It is now read-only.

Merge from master #25

Merged
merged 23 commits into from
Jul 21, 2023
Merged

Merge from master #25

merged 23 commits into from
Jul 21, 2023

Conversation

akshay-anyscale
Copy link
Collaborator

No description provided.

Yard1 and others added 23 commits July 10, 2023 17:54
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Max Pumperla <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Aviary currently combines all `LLMDeployments` and the `Router`
deployment into a single Serve app. When a request arrives, the `Router`
determines which `LLMDeployment` should process it and it sends the
request to that deployment over a Serve handle. However, due to a
limitation in Ray and Ray Serve, deployments called from other
deployments using a Serve Handle cannot stream their responses. This
means the current single-app architecture cannot support streaming.

This change enables streaming by splitting the `Router` and each
`LLMDeployment` into its own Serve app. To work around the limitation,
whenever the `Router` app receives an HTTP request, it creates a new
HTTP request over `localhost` to a streaming `LLMDeployment`, receives
the response stream, and sends a `StreamingResponse` containing that
stream back to the original client. This workaround is meant to be
temporary and should be replaced with a longer-term solution once the
Ray Serve limitation is fixed.

---------

Signed-off-by: Max Pumperla <[email protected]>
Signed-off-by: Shreyas Krishnaswamy <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Co-authored-by: Max Pumperla <[email protected]>
Co-authored-by: Kyle Huang <[email protected]>
Co-authored-by: M Waleed Kadous <[email protected]>
Co-authored-by: sijie <[email protected]>
Co-authored-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
This change adds `locustfiles` that can load test Aviary.

---------

Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Shreyas Krishnaswamy <[email protected]>
Co-authored-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
Signed-off-by: Antoni Baum <[email protected]>
@akshay-anyscale akshay-anyscale merged commit 71072cd into aviary-template Jul 21, 2023
avnishn pushed a commit that referenced this pull request Aug 4, 2023
Signed-off-by: Antoni Baum <[email protected]>
pcmoritz pushed a commit that referenced this pull request Aug 4, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants