Unified execution runtime for LLM and ML programs.
machine-learning deep-learning transformers pytorch agents execution-engine kv-cache dataflow-graph llm generative-ai ai-runtime workflow-optimization program-optimization prefix-caching agent-runtime llm-runtime compiler-runtime cross-call-caching
-
Updated
May 1, 2026 - C++