Skip to content
@gpustack

GPUStack

GPU cluster manager for optimized AI model deployment

Pinned Loading

  1. gpustack gpustack Public

    Performance-Optimized AI Inference on Your GPUs. Unlock it by selecting and tuning the optimal inference engine for your model.

    Python 4.4k 443

  2. runner runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    Dockerfile 7 6

  3. runtime runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    Python 7 9

  4. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 227 23

  5. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 190 28

Repositories

Showing 10 of 14 repositories
  • runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    gpustack/runtime’s past year of commit activity
    Python 7 Apache-2.0 9 0 1 Updated Jan 14, 2026
  • runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    gpustack/runner’s past year of commit activity
    Dockerfile 7 Apache-2.0 6 0 0 Updated Jan 14, 2026
  • gpustack-ui Public
    gpustack/gpustack-ui’s past year of commit activity
    TypeScript 69 Apache-2.0 54 1 5 Updated Jan 14, 2026
  • gpustack Public

    Performance-Optimized AI Inference on Your GPUs. Unlock it by selecting and tuning the optimal inference engine for your model.

    gpustack/gpustack’s past year of commit activity
    Python 4,373 Apache-2.0 443 384 20 Updated Jan 14, 2026
  • community-inference-backends Public

    Community Inference Backends for GPUStack V2

    gpustack/community-inference-backends’s past year of commit activity
    2 Apache-2.0 3 0 0 Updated Jan 9, 2026
  • gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    gpustack/gguf-parser-go’s past year of commit activity
    Go 227 MIT 23 2 0 Updated Jan 6, 2026
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 0 2 0 0 Updated Jan 6, 2026
  • gpustack/gpustack-higress-plugin’s past year of commit activity
    Go 0 2 0 0 Updated Dec 30, 2025
  • .github Public

    Meta-Github repository for all GPUStack repositories.

    gpustack/.github’s past year of commit activity
    0 Apache-2.0 3 0 0 Updated Dec 27, 2025
  • vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    gpustack/vox-box’s past year of commit activity
    Python 190 Apache-2.0 28 14 0 Updated Dec 23, 2025