Skip to content
@gpustack

GPUStack

GPU cluster manager for optimized AI model deployment

Pinned Loading

  1. gpustack gpustack Public

    A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

    Python 4.7k 483

  2. runner runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    Dockerfile 10 9

  3. runtime runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    Python 11 14

  4. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 250 24

  5. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 201 33

Repositories

Showing 10 of 15 repositories
  • gpustack Public

    A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

    gpustack/gpustack’s past year of commit activity
    Python 4,709 Apache-2.0 483 491 33 Updated Mar 24, 2026
  • gpustack-ui Public
    gpustack/gpustack-ui’s past year of commit activity
    TypeScript 77 Apache-2.0 56 2 6 Updated Mar 24, 2026
  • runtime Public

    Provides a unified interface to detect GPU resources and manages GPU workloads.

    gpustack/runtime’s past year of commit activity
    Python 11 Apache-2.0 14 0 3 Updated Mar 20, 2026
  • gpustack/gpustack-higress-plugin’s past year of commit activity
    Go 1 2 0 0 Updated Mar 20, 2026
  • runner Public

    Collection of Dockerfiles to build images for various inference services across different accelerated backends.

    gpustack/runner’s past year of commit activity
    Dockerfile 10 Apache-2.0 9 0 0 Updated Mar 13, 2026
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 1 2 0 0 Updated Mar 7, 2026
  • gpustack/benchmark-runner’s past year of commit activity
    Python 2 Apache-2.0 2 1 0 Updated Mar 6, 2026
  • community-inference-backends Public

    Community Inference Backends for GPUStack V2

    gpustack/community-inference-backends’s past year of commit activity
    Python 8 Apache-2.0 7 0 1 Updated Feb 13, 2026
  • gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    gpustack/gguf-parser-go’s past year of commit activity
    Go 250 MIT 24 1 0 Updated Feb 11, 2026
  • .github Public

    Meta-Github repository for all GPUStack repositories.

    gpustack/.github’s past year of commit activity
    1 Apache-2.0 4 0 0 Updated Feb 4, 2026