Reduce your AI costs
Manifest is a smart model router for agents and AI applications that redirects each query to the right model, saving up to 70% in AI costs.
- π Routing based on complexity, specificity and custom HTTP headers
- ποΈ Mix your providers: API keys, Subscriptions, Local models, Custom providers
- π Track every single dollar, setup notifications and limits
- π Fallback on different models when queries fails
Go to app.manifest.build and follow the guide.
Manifest ships as a Docker image. One command:
bash <(curl -sSL https://raw.githubusercontent.com/mnfst/manifest/main/docker/install.sh)Open http://localhost:2099 and sign up β the first account you create becomes the admin. Full self-hosting guide: docker/DOCKER_README.md.
The legacy
manifestnpm package is deprecated and no longer published.
Manifest connects to 300+ models across 16 providers plus any custom provider (OpenAI/Anthropic compatible). Bring your own API key, reuse a paid subscription you already have, or run models locally β all routed through
the same /auto endpoint.
| Provider | API key | Subscription | Featured models |
|---|---|---|---|
| OpenAI | β | β ChatGPT Plus / Pro / Team | gpt-5, gpt-5-mini, o4, o4-mini |
| Anthropic | β | β Claude Max / Pro | claude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5 |
| β | β | gemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash | |
| xAI | β | β | grok-4, grok-3, grok-code-fast |
| DeepSeek | β | β | deepseek-v3.2, deepseek-r1 |
| Mistral | β | β | mistral-large, codestral, magistral |
| Qwen (Alibaba) | β | β | qwen3-max, qwen3-coder, qwq-32b |
| Moonshot (Kimi) | β | β | kimi-k2, moonshot-v1-128k |
| MiniMax | β | β MiniMax Coding Plan | minimax-m2, abab7-chat-preview |
| Z.ai (Zhipu) | β | β GLM Coding Plan | glm-4.6, glm-4.5-air |
| OpenCode | β | β Go subscription | Routes via OpenCode Go catalog |
| Ollama | π₯οΈ Local | β Ollama Cloud | Any GGUF model, port 11434 |
| LM Studio | π₯οΈ Local | β | Any GGUF model, port 1234 |
| llama.cpp | π₯οΈ Local | β | Any GGUF model, port 8080 |
| OpenRouter | β | β | Routes to 300+ models across labs |
| GitHub Copilot | β | β Copilot subscription | OAuth, no API key needed |
| Custom (OpenAI/Anthropic-compatible) | β | β | Any /v1/chat/completions or /v1/messages endpoint |
