Get started

Author: GPUStack

GPUStack-illustration-940x400

Product Updates

GPUStack v0.7: Desktop Installer and Usage Metering

2025-07-29

GPUStack-v0

Product Updates

GPUStack v0.6: Deliver the Best Model Inference Experience

2025-04-28

gpustack-v0

Product Updates

GPUStack v0.5: Model Catalog for Simplified Deployment

2025-04-28

GPUStack-illustration-940x400

Product Updates

GPUStack v0.4: Image and Audio models and Backend Version Management

2024-12-10

gpustack-docker

Tutorials

Set Up NVIDIA Container Runtime and Deploy GPUStack with Docker

2024-11-20

lllama-cpp-hf

Tutorials

Convert and Upload Your GGUF Model to Hugging Face

2024-11-13

GPUStack + AnythingLLM

Tutorials

Building Your Private ChatGPT and Knowledge Base

2024-11-06

GPUStack + Qwen2

Tutorials

Running Full Qwen 2.5 Series - Performance and Resource Allocation Review

2024-10-11

gpustack-0

Product Updates

GPUStack 0.3: vLLM Support and Multi-Model Comparison View

2024-10-01

GPUStack-illustration-940x400

Product Updates

GPUStack 0.2: Heterogeneous Distributed Inference

2024-09-16

GPUStack + Continue

Tutorials

Building Free GitHub Copilot Alternative with Continue + GPUStack

2024-08-23

GPUStack-illustration-940x400

Tutorials

Beginner Tutorial: Using GPUStack to Aggregate GPUs and Run LLMs

2024-07-29

GPUStack-illustration-940x400

Tutorials

GGUF Parser: A Tool for Estimating LLM Resource Requirements

2024-07-24

GPUStack-illustration-940x400

Product Updates

Introducing GPUStack: An open-source GPU cluster manager for running LLMs

2024-07-24