Skip to content

Commit

Permalink
Bump images for llama2 and update compute (mosaicml#436)
Browse files Browse the repository at this point in the history
image and compute
  • Loading branch information
margaretqian committed Sep 19, 2023
1 parent 1813eaf commit 7003793
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 6 deletions.
7 changes: 3 additions & 4 deletions examples/inference-deployments/llama2/llama2_13b.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -5,9 +5,8 @@ command: |- # Note this command is a workaround until we build vllm into the inf
pip uninstall torch -y
pip install torch==2.0.1
compute:
gpus: 1
instance: oci.vm.gpu.a10.1
image: mosaicml/inference:0.1.37
cluster: r7z15
gpus: 2
instance: oci.vm.gpu.a10.2
image: mosaicml/inference:0.1.40
default_model:
model_type: llama2-13b
3 changes: 1 addition & 2 deletions examples/inference-deployments/llama2/llama2_7b_chat.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,6 @@ command: |- # Note this command is a workaround until we build vllm into the inf
compute:
gpus: 1
instance: oci.vm.gpu.a10.1
image: mosaicml/inference:0.1.37
cluster: r7z15
image: mosaicml/inference:0.1.40
default_model:
model_type: llama2-7b-chat

0 comments on commit 7003793

Please sign in to comment.