Skip to content
#

ai-ops

Here are 28 public repositories matching this topic...

Production-grade multi-agent AI system for infrastructure reliability analysis and advisory intelligence, with governed execution available in Enterprise deployments.

  • Updated Jan 10, 2026
  • Python

tpu-doc is a zero-dependency diagnostic binary for Google Cloud TPU environments that instantly validates hardware health, discovers software stack configurations, and provides AI-powered log analysis to eliminate expensive debugging downtime.

  • Updated Jan 6, 2026
  • Rust

It is an AI-powered DevOps tool that analyzes Linux server logs to detect anomalies and predict failures. It integrates ML models, automated fixes via Ansible, containerization with Docker, and orchestration using Kubernetes—providing a full-stack solution for predictive maintenance.

  • Updated Aug 21, 2025
  • Python

Improve this page

Add a description, image, and links to the ai-ops topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-ops topic, visit your repo's landing page and select "manage topics."

Learn more