Add more details about prompt format in the docs #126

alanwguo · 2024-01-24T01:10:12Z

Trying to make it easier for users to self-service add custom models to use with ray-llm.

Signed-off-by: Alan Guo <[email protected]>

alanwguo · 2024-01-24T01:10:42Z

models/README.md

@@ -69,6 +69,24 @@ RayLLM supports continuous batching, meaning incoming requests are processed as
 You can follow the TensorRT-LLM example to generate the model.(https://github.com/NVIDIA/TensorRT-LLM/tree/v0.6.1/examples/llama). After generating the model, you can upload the model artifact to S3 and use the `s3_mirror_config` to load the model from S3. You can also place the model artifacts in a local directory and use the `model_local_path` to load the model from the local directory. See the [llama example](continuous_batching/trtllm-meta-llama--Llama-2-7b-chat-hf.yaml) for more details.


+#### Prompt Format
+A prompt format is used to convert a chat completions API input into a prompt to feed into the LLM engine. The format is a dictionary where the key refers to one of the chat actors and the value is a string template for which to convert the text of the actor into a string to add to the overall prompt. The template is used to generate a portion of the prompt and each portion is assembled together to form the final prompt.


I wasn't sure if this prompt_format is only used in the ChatCompletions API.

I assume it is

Signed-off-by: Alan Guo <[email protected]>

tchordia

lgtm! I think it would be helpful to add examples. Let's add example prompts formats and the final formatted string?

alanwguo · 2024-01-25T21:24:41Z

There's an existing example in the docs below. I'll add a reference to it.

Signed-off-by: Alan Guo <[email protected]>

Signed-off-by: Max Pumperla <[email protected]>

Trying to make it easier for users to self-service add custom models to use with ray-llm. --------- Signed-off-by: Alan Guo <[email protected]>

Trying to make it easier for users to self-service add custom models to use with ray-llm. Cherry-pick of #126 --------- Signed-off-by: Alan Guo <[email protected]>

Add more details about prompt format in the docs

f97d8e5

Signed-off-by: Alan Guo <[email protected]>

alanwguo commented Jan 24, 2024

View reviewed changes

adjust copy;

a83ac2d

Signed-off-by: Alan Guo <[email protected]>

alanwguo requested review from Yard1, sihanwang41, tchordia and avnishn January 24, 2024 23:23

tchordia approved these changes Jan 25, 2024

View reviewed changes

fixup

4613e16

Signed-off-by: Alan Guo <[email protected]>

alanwguo merged commit f6926b7 into master Jan 25, 2024
2 checks passed

alanwguo deleted the improve-prompt-format-docs branch January 25, 2024 22:05

alanwguo pushed a commit that referenced this pull request Jan 25, 2024

versioned api endpoints (#126)

87ab81f

Signed-off-by: Max Pumperla <[email protected]>

alanwguo added a commit that referenced this pull request Jan 25, 2024

Add more details about prompt format in the docs (#126)

8ec5f02

Trying to make it easier for users to self-service add custom models to use with ray-llm. --------- Signed-off-by: Alan Guo <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more details about prompt format in the docs #126

Add more details about prompt format in the docs #126

alanwguo commented Jan 24, 2024

alanwguo Jan 24, 2024

tchordia left a comment

alanwguo commented Jan 25, 2024

Add more details about prompt format in the docs #126

Add more details about prompt format in the docs #126

Conversation

alanwguo commented Jan 24, 2024

alanwguo Jan 24, 2024

Choose a reason for hiding this comment

tchordia left a comment

Choose a reason for hiding this comment

alanwguo commented Jan 25, 2024