Skip to content
Change the repository type filter

All

    Repositories list

    • TypeScript
      Apache License 2.0
      568025Updated Apr 30, 2026Apr 30, 2026
    • gpustack

      Public
      A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
      Python
      Apache License 2.0
      5165k53229Updated Apr 30, 2026Apr 30, 2026
    • core-ui

      Public
      TypeScript
      0000Updated Apr 29, 2026Apr 29, 2026
    • Community Inference Backends for GPUStack V2
      Python
      Apache License 2.0
      91200Updated Apr 28, 2026Apr 28, 2026
    • runtime

      Public
      Provides a unified interface to detect GPU resources and manages GPU workloads.
      Python
      Apache License 2.0
      161403Updated Apr 28, 2026Apr 28, 2026
    • Go
      4200Updated Apr 28, 2026Apr 28, 2026
    • HTML
      2100Updated Apr 24, 2026Apr 24, 2026
    • .github

      Public
      Meta-Github repository for all GPUStack repositories.
      Apache License 2.0
      4100Updated Apr 1, 2026Apr 1, 2026
    • Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
      Go
      MIT License
      2426500Updated Mar 25, 2026Mar 25, 2026
    • runner

      Public
      Collection of Dockerfiles to build images for various inference services across different accelerated backends.
      Dockerfile
      Apache License 2.0
      91300Updated Mar 13, 2026Mar 13, 2026
    • Python
      Apache License 2.0
      2310Updated Mar 6, 2026Mar 6, 2026
    • vox-box

      Public
      A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
      Python
      Apache License 2.0
      33210162Updated Dec 23, 2025Dec 23, 2025
    • llama-box

      Public archive
      LM inference server implementation based on *.cpp.
      C++
      MIT License
      2829520Updated Nov 24, 2025Nov 24, 2025
    • Python
      Apache License 2.0
      2110Updated Aug 26, 2025Aug 26, 2025
    • fastfetch

      Public
      Like neofetch, but much faster because written mostly in C.
      C
      MIT License
      761200Updated Oct 24, 2024Oct 24, 2024
    • Deliver LLMs of GGUF format via Dockerfile.
      Go
      MIT License
      51500Updated Oct 24, 2024Oct 24, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.