Skip to content
View parano's full-sized avatar
🍱
🍱

Organizations

@sysu @CSE512-14W @horseshoe477 @VoteWithYourFeet @atalaya-io @bentoml

Block or report parano

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
.github/profile/README.md

Get your ML models into production today!

BentoML is an open-source platform that streamlines machine-learning model deployment and runs high-performance serving workload at scale.

Join us in our community Slack to receive project updates and get involved with latest development.

BentoML - The Unified Model Serving Framework

🍱 BentoML repo | 🎨 Gallery Projects | πŸ“– Documentation

BentoML provides high-level APIs for packaging ML models and defining ML Services. From the ML service definition, BentoML allows building versioned archive(aka Bento) capturing all required dependencies, automatically generate HTTP Server and create Docker Image for production-grade deployment.

Key Features:

  • Support multiple ML frameworks including PyTorch, TensorFlow, Scikit-Learn, XGBoost, and many more
  • Support Adaptive Batching which dynamically group inference requets into small batches in real-time for better performance
  • Build inference graph composed from multiple models and execute them in parallel
  • Automatic Docker image can be generated for production deployment
  • Automatically generate REST API spec in Swagger/OpenAPI format

Yatai - MLOps on Kubernetes

πŸ¦„οΈ Yatai repo | πŸ‘©β€πŸš€ Administrator's Guide | ⎈ Helm Chart

Yatai helps ML teams to run BentoML at scale on Kubernetes. It makes it easy for teams to manage all their ML assets in one place and operate large scale model serving workloads on Kubernetes.

  • Bento Registry - manage all your team's Bentos and Models, backed by cloud blob storage(S3, MinIO)
  • Deployment Automation - deploy Bentos as auto-scaling API endpoints on Kubernetes and easily rollout new versions
  • Observability - monitoring dashboard helping users to identify model performance issues
  • CI/CD - flexible APIs for integrating with your training and CI pipelines

bentoctl - Fast model deployment with BentoML on cloud platforms

πŸš€ bentoctl source

bentoctl is a CLI tool for deploying your BentoML packaged ML models as API endpoint on popular cloud platforms. It automates Bento docker image build, interactes with cloud platform APIs, and allow users to easily manage their deployment.

Supported platforms:

Custom deploy target is also supported by building your own bentoctl plugin from the deployment operator template.

Pinned Loading

  1. bentoml/BentoML bentoml/BentoML Public

    The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

    Python 8.6k 950

  2. bentoml/OpenLLM bentoml/OpenLLM Public

    Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

    Python 12.3k 805