How to Set Up NVIDIA Container Runtime and Deploy GPUStack with Docker

GPUStack is an open-source GPU cluster manager designed for running large models, including LLMs, embedding models, reranker models, vision language models, image generation models, as well as STT and TTS models. It allows you to create a unified cluster by combining GPUs from diverse platforms, such as Apple Macs, Windows PCs, and Linux servers.

Using Docker to install GPUStack on Linux is the recommended method. It simplifies the installation process by avoiding many dependency and compatibility issues.

Today, we bring you a tutorial on how to to set up the NVIDIA container runtime and deploy GPUStack with Docker.

Requirements

Verify you have NVIDIA GPU:


xxxxxxxxxx
lspci | grep -i nvidia

Verify the system has gcc Installed:


xxxxxxxxxx
gcc --version

Installing NVIDIA driver

Refer to: https://developer.nvidia.com/datacenter-driver-downloads

Install the kernel headers and development packages for the currently running kernel:


xxxxxxxxxx
sudo apt-get install linux-headers-$(uname -r)

Install the cuda-keyring package:


xxxxxxxxxx
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb

Install NVIDIA driver:


xxxxxxxxxx
sudo apt-get update
sudo apt-get install nvidia-open -y

Reboot the system:


xxxxxxxxxx
sudo reboot


xxxxxxxxxx
nvidia-smi

Installing Docker Engine

Refer to: https://docs.docker.com/engine/install/ubuntu/

Run the following command to uninstall all conflicting packages:


xxxxxxxxxx
for pkg in docker.io docker-doc docker-compose docker-compose-v2 podman-docker containerd runc; do sudo apt-get remove $pkg; done

Set up Docker's apt repository:


xxxxxxxxxx
# Add Docker's official GPG key:
sudo apt-get update
sudo apt-get install ca-certificates curl
sudo install -m 0755 -d /etc/apt/keyrings
sudo curl -fsSL https://download.docker.com/linux/ubuntu/gpg -o /etc/apt/keyrings/docker.asc
sudo chmod a+r /etc/apt/keyrings/docker.asc

# Add the repository to Apt sources:
echo \
  "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.asc] https://download.docker.com/linux/ubuntu \
  $(. /etc/os-release && echo "$VERSION_CODENAME") stable" | \
  sudo tee /etc/apt/sources.list.d/docker.list > /dev/null
sudo apt-get update

Install the Docker packages:


xxxxxxxxxx
sudo apt-get install docker-ce docker-ce-cli containerd.io docker-buildx-plugin docker-compose-plugin -y

Check the Docker is available:


xxxxxxxxxx
docker info

Installing NVIDIA Container Toolkit

Refer to: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

Configure the production repository:


xxxxxxxxxx
curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
  && curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \
    sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
    sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

Install the NVIDIA Container Toolkit packages:


xxxxxxxxxx
sudo apt-get update
sudo apt-get install -y nvidia-container-toolkit -y

Configure the container runtime by using the nvidia-ctk command:


xxxxxxxxxx
sudo nvidia-ctk runtime configure --runtime=docker

Check the daemon.json file:


xxxxxxxxxx
cat /etc/docker/daemon.json

Restart the Docker daemon:


xxxxxxxxxx
sudo systemctl restart docker

Verify your installation by running a sample CUDA container:


xxxxxxxxxx
sudo docker run --rm --runtime=nvidia --gpus all ubuntu nvidia-smi

Installing GPUStack

Refer to: https://docs.gpustack.ai/latest/installation/docker-installation/

Install GPUStack with Docker:


xxxxxxxxxx
docker run -d --gpus all -p 80:80 --ipc=host --name gpustack \
    -v gpustack-data:/var/lib/gpustack gpustack/gpustack

To view the login password, run the following command:


xxxxxxxxxx
docker exec -it gpustack cat /var/lib/gpustack/initial_admin_password

Access the GPUStack UI ( http://YOUR_HOST_IP ) in your browser, using admin as the username and the password obtained above. After resetting your password, log into GPUStack.

To add additional worker nodes and form a GPUStack cluster, please run the following command on each worker node:


xxxxxxxxxx
docker run -d --gpus all --ipc=host --network=host --name gpustack \
    gpustack/gpustack --server-url http://YOUR_HOST_IP --token YOUR_TOKEN

Replace http://YOUR_HOST_IP with your GPUStack server URL and YOUR_TOKEN with your secret token for adding workers. To retrieve the token from the GPUStack server, use the following command:


xxxxxxxxxx
docker exec -it gpustack cat /var/lib/gpustack/token

After that, deploy models from the Hugging Face:

Experimenting with the model in Playground:

Join Our Community

In this tutorial, we introduced how to set up the NVIDIA container runtime and deploy GPUStack with Docker.

If you are interested in GPUStack, please find more information at: https://gpustack.ai.

If you encounter any issues or have suggestions for GPUStack, feel free to join our Community for support from the GPUStack team and to connect with fellow users globally.

Try GPUStack

How to Set Up NVIDIA Container Runtime and Deploy GPUStack with Docker

Requirements

Installing NVIDIA driver

Installing Docker Engine

Installing NVIDIA Container Toolkit

Installing GPUStack

Join Our Community

Related Articles

Resources

Company

Try GPUStack

How to Set Up NVIDIA Container Runtime and Deploy GPUStack with Docker

Requirements

Installing NVIDIA driver

Installing Docker Engine

Installing NVIDIA Container Toolkit

Installing GPUStack

Join Our Community

Related Articles

Resources

Company

Get our newsletter