Support for allocating GPUs in Passthrough-Mode #183

varunrsekar · 2024-10-17T09:24:50Z

This PR introduces a new DeviceClass vfiopci.nvidia.com that will allocate a full GPU in PassThrough-mode (PT) by binding the GPU to vfio-pci driver.

The primary usecase for this new DeviceClass are Kata containers and KubeVirt VMs that require the gpu to be in PT-mode and made available to a pod which then would spin up a guest with the gpu.

Note: Regular pod workloads will not benefit from this DeviceClass and shouldn't try to use this.

As part of this change, I've introduced some (but not all) modifications to the kind cluster config that are needed for this DeviceClass to work. Host-level modifications needed:

# Example on Ubuntu:

# Enable IOMMU on the host kernel
if ! grep -q "GRUB_CMDLINE_LINUX_DEFAULT=.*intel_iommu=on" /etc/default/grub; then
   sed -i 's/GRUB_CMDLINE_LINUX_DEFAULT="/GRUB_CMDLINE_LINUX_DEFAULT="intel_iommu=on /g' /etc/default/grub
fi
sudo update-grub

# Disable GDM
sudo systemctl stop gdm && systemctl disable gdm
# Unload nvidia-drm
sudo modprobe -r nvidia-drm
# Reboot the node
sudo reboot

Validated on a kind cluster with a Quattro P2000 GPU:

$ nvidia-smi -L
GPU 0: Quadro P2000 (UUID: GPU-7bea1569-778c-fb4d-7801-df6b6b85ceac)

$ k get resourceclaim -n gpu-test-vfiopci
NAME             STATE                AGE
pod1-gpu-k9w6g   allocated,reserved   21s

$ k get pod -n gpu-test-vfiopci
NAME   READY   STATUS    RESTARTS   AGE
pod1   1/1     Running   0          2m20s

Open items:

how to make sysfs on the kind cluster node be read-write mountable?
Handling kubelet plugin restarts when the GPU is bound to vfio-pci driver as there is no device discovery possible at that time.

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

varunrsekar · 2024-10-17T09:25:42Z

/cc @klueska

Varun Ramachandra Sekar added 11 commits October 15, 2024 11:05

Vfio PCI device API support

0293286

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

report PCI address in Allocatable devices

da73845

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

vfio-pci device prepare/unprepare

d50955b

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

vfio-pci devicelcass

6aa294d

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

api gen

6f8569c

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

fix Dockerfile paths

6bf65cb

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

Make vfio-pci driver part of gpu config

d3deec7

Signed-off-by: Varun Ramachandra Sekar <[email protected]>

bug fixes

c07d786

kind cluster changes for vfio-PT

1cdf03a

code cleanup

fa622b6

GPU passthrough-mode example

0fc7174

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for allocating GPUs in Passthrough-Mode #183

Support for allocating GPUs in Passthrough-Mode #183

varunrsekar commented Oct 17, 2024

varunrsekar commented Oct 17, 2024

Support for allocating GPUs in Passthrough-Mode #183

Are you sure you want to change the base?

Support for allocating GPUs in Passthrough-Mode #183

Conversation

varunrsekar commented Oct 17, 2024

varunrsekar commented Oct 17, 2024