Multiple gpus on different nodes #427

bjwswang · 2023-12-22T08:56:53Z

I found a discussion in vllm community about deploy vllm model across multiple nodes in kubernetes
vllm-project/vllm#1363

The text was updated successfully, but these errors were encountered:

bjwswang · 2023-12-22T09:40:29Z

We may try

kuberay -> manger gpu cluster
vllm -> run ray application(llm models)
rdam -> IB(infiniBand)

nkwangleiGIT · 2023-12-30T00:26:58Z

Use ray as distributed runtime to support GPU resource pool across nodes, we can run big LLM models using multiple GPUs now.

nkwangleiGIT · 2024-01-02T02:01:16Z

Add docs here: http://kubeagi.k8s.com.cn/docs/category/distributed-inference

feat: #427 support to run models using ray cluster

nkwangleiGIT · 2024-01-05T06:44:48Z

Fixed by #500

This was referenced Dec 22, 2023

Able to run worker(local model service) with multiple gpus #418

Closed

feat: enable multiple gpus(single node) in runner Fastchat #425

Merged

bjwswang self-assigned this Dec 22, 2023

bjwswang added this to the v0.2.0 milestone Dec 22, 2023

bjwswang added LLM k8s-operator priority-medium labels Dec 22, 2023

bjwswang assigned Lanture1064 and bjwswang and unassigned bjwswang Dec 25, 2023

nkwangleiGIT assigned nkwangleiGIT and unassigned bjwswang and Lanture1064 Dec 30, 2023

nkwangleiGIT mentioned this issue Dec 30, 2023

Support to specify CUDA visible device id for model service #444

Closed

nkwangleiGIT added a commit to nkwangleiGIT/arcadia that referenced this issue Jan 5, 2024

kubeagi#427 support to run models using ray cluster

7e9a5fb

nkwangleiGIT added a commit to nkwangleiGIT/arcadia that referenced this issue Jan 5, 2024

feat: kubeagi#427 support to run models using ray cluster

b83b158

nkwangleiGIT added a commit to nkwangleiGIT/arcadia that referenced this issue Jan 5, 2024

feat: kubeagi#427 support to run models using ray cluster

8d83804

nkwangleiGIT added a commit that referenced this issue Jan 5, 2024

Merge pull request #500 from nkwangleiGIT/main

cf9e6d4

feat: #427 support to run models using ray cluster

nkwangleiGIT closed this as completed Jan 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple gpus on different nodes #427

Multiple gpus on different nodes #427

bjwswang commented Dec 22, 2023 •

edited

Loading

bjwswang commented Dec 22, 2023

nkwangleiGIT commented Dec 30, 2023

nkwangleiGIT commented Jan 2, 2024

nkwangleiGIT commented Jan 5, 2024

Multiple gpus on different nodes #427

Multiple gpus on different nodes #427

Comments

bjwswang commented Dec 22, 2023 • edited Loading

bjwswang commented Dec 22, 2023

nkwangleiGIT commented Dec 30, 2023

nkwangleiGIT commented Jan 2, 2024

nkwangleiGIT commented Jan 5, 2024

bjwswang commented Dec 22, 2023 •

edited

Loading