This repository is the official community hub for Azure SRE Agent. Here you'll find:
- π Report Issues β File bugs, feature requests, and feedback via GitHub Issues
- π Resources β Curated links to docs, videos, blogs, and community content for Azure SRE Agent
- π§ͺ Labs β Hands-on labs and sample environments to deploy, break, and fix apps with Azure SRE Agent (see the
labs/folder)
| Resource | Link |
|---|---|
| Product Home Page | https://www.azure.com/sreagent |
| Portal (Create & Manage Agents) | https://aka.ms/sreagent |
| Documentation | https://aka.ms/sreagent/newdocs |
| Pricing & Billing | https://aka.ms/sreagent/pricing |
| All Blogs | https://aka.ms/sreagent/blog |
| YouTube Channel | https://aka.ms/sreagent/youtube |
| GitHub β Azure SRE Agent (Report Issues, Official Labs & Resources) | https://aka.ms/sreagent/github |
| Hands-on Lab | https://aka.ms/sreagent/lab |
| Request a New Region | https://aka.ms/sreagent/region |
| GitHub β Official Plugins | https://github.com/Azure/sre-agent-plugins |
| Tech Community Discussions | https://aka.ms/sreagent/discussions |
| Agentic DevOps Live | https://aka.ms/agenticdevopslive |
| X (Twitter) | https://x.com/azuresreagent |
A comprehensive look at Azure SRE Agent as an end-to-end agentic operations platform β covering how it tackles every kind of operational toil and scales to meet enterprise needs. π https://www.youtube.com/watch?v=06j-d0gsREw
The official Microsoft Azure product overview β a concise explainer of what Azure SRE Agent is, how it works, and the problems it solves. π https://www.youtube.com/watch?v=6vDrThUjDOc Β· 6,156 views Β· 158 likes
Satya Nadella highlights Azure SRE Agent as a key example of AI-driven operations transforming how engineering teams manage reliability at scale. π https://www.youtube.com/watch?v=3hPeKDtLvPg Β· 2,548 views Β· 26 likes
Scott Hanselman walks through Azure SRE Agent on Azure Friday, showing how it reduces operational toil and lets teams focus on innovation. π https://www.youtube.com/watch?v=5c9pl8_DI3w Β· 4,264 views Β· 75 likes
The GA launch video demonstrating Azure SRE Agent performing root cause analysis with full code context through deep GitHub integration. π https://www.youtube.com/watch?v=1vKoxPeep_M Β· 582 views Β· 25 likes
Deep-dive Build session covering end-to-end SRE Agent capabilities: automated investigation, remediation, proactive monitoring, and custom hooks. π https://www.youtube.com/watch?v=bK3SIQoE_Nc Β· 12,294 views Β· 129 likes
- Fix It Before They Feel It: Proactive .NET Reliability with Azure SRE Agent β dotnet Β· 1,466 views
- Azure SRE Agent - Incident Management with PagerDuty β Azure SRE Agent (official) Β· 547 views
- Azure SRE Agent - Your 24/7 Automated Response Team β Mariusz Ferdyn Β· 313 views
- Azure's New SRE Agent Is INSANE β Here's Why you Should Pay Attention β TechTalks with Gil Β· 249 views
- SRE Agent Series: What Is Azure SRE Agent and How to Create One Step by Step β JBSWiki Β· 204 views
- Azure SRE Agent Explained β Cloud Talk with Jonnychipz Β· 160 views
- SRE Agent Series: I Let an Azure SRE Agent Manage My Subscription β Here's What Happened β JBSWiki Β· 143 views
- Agentic DevOps: Azure SRE Agent with GitHub Copilot Coding Agent demo β Jorge Balderas Β· new
- Event-Driven IaC Operations: Terraform Drift Detection via HTTP Triggers β Vineela Suri Β· 10 min read. End-to-end pipeline: Terraform Cloud webhook triggers SRE Agent to classify drift as benign/risky/critical, correlate with incidents, and ship a fix β including a "DO NOT revert" recommendation that prevents turning a mitigated incident into an outage.
- Managing Multi-Tenant Azure Resources with SRE Agent and Lighthouse β Pranab Mandal Β· 6 min read. Step-by-step guide to configuring Azure Lighthouse delegation so a single SRE Agent can monitor and manage resources across multiple tenants β covering ARM templates, RBAC roles, and managed identity setup.
- New in Azure SRE Agent: Log Analytics and Application Insights Connectors β Dalibor Kovacevic Β· 3 min read. Native MCP-backed connectors for Log Analytics and App Insights β connect a workspace, auto-grant RBAC, and the agent queries ContainerLog, Syslog, exceptions, and traces directly during investigations.
- Azure Monitor in Azure SRE Agent: Autonomous Alert Investigation and Intelligent Merging β Vineela Suri Β· 9 min read. Full walkthrough of Azure Monitor integration: Incident Response Plans, alert merging (7 firings β 1 thread), auto-resolve trade-offs, and a live AKS + Redis scenario where the agent fixes a bad credential autonomously.
- 3 Ways to Get More from Azure SRE Agent β dchelupati Β· 4 min read. Practical cost and value tips: start narrow with incident routing, replace high-frequency polling with push/batch patterns, and keep scheduled task threads fresh with "new chat thread for each run."
- How We Build and Use Azure SRE Agent with Agentic Workflows β Shamir AbdulAziz Β· 6 min read. Customer Zero blog: how Microsoft embedded agents across the SDLC to build SRE Agent β 35K+ incidents handled, 50K+ developer hours saved, App Service time-to-mitigation down from 40.5 hours to 3 minutes.
- An Update to the Active Flow Billing Model β Mayunk Jain Β· 3 min read. Active flow billing moves from time-based to token-based usage, with per-model-provider AAU rates. Always-on pricing unchanged at 4 AAUs per agent-hour.
- Announcing General Availability for the Azure SRE Agent β Mayunk Jain Β· 4 min read. GA announcement: 1,300+ agents deployed internally at Microsoft, 35K+ incidents mitigated, 20K+ engineering hours saved. Covers deep context, built-in computation, memory and learning, and Ecolab customer story.
- What's New in Azure SRE Agent in the GA Release β dchelupati Β· 2 min read. Companion to the GA announcement: redesigned onboarding, deep context, code interpreter, memory, skills, subagents, Python tools, agent hooks, and MCP connectors.
- The Agent That Investigates Itself (SRE4SRE) β Sanchit Mehta Β· 11 min read. Deep technical post β the SRE Agent investigating its own KV cache regression, demonstrating how the team uses the product to maintain the product.
- Azure SRE Agent Now Builds Expertise Like Your Best Engineer (Deep Context) β dchelupati Β· 6 min read. How the agent operates with continuous access to source code, persistent memory across investigations, and background intelligence that runs when nobody is asking questions.
- What It Takes to Give SRE Agent a Useful Starting Point (Onboarding) β Dalibor Kovacevic Β· 10 min read. Designing the guided onboarding flow: connecting code, logs, incidents, Azure resources, and knowledge files so a new agent becomes useful on day one.
- Agent Hooks: Production-Grade Governance for Azure SRE Agent β Vineela Suri Β· 9 min read. Governance primitives for controlling agent behavior: stop hooks, PostToolUse hooks, and global hooks that enforce approval gates and safety boundaries.
- An AI-Led SDLC: Building an End-to-End Agentic Software Development Lifecycle with Azure and GitHub β owaino Β· 16 min read. Full agentic SDLC walkthrough: Spec-Kit β GitHub Coding Agent β Code Quality β CI/CD β SRE Agent β with the SRE Agent closing the loop by opening GitHub issues for the coding agent to fix.
- Context Engineering: Lessons from Building Azure SRE Agent β Sanchit Mehta Β· 8 min read. Engineering lessons: started with 100+ tools and 50+ specialized agents, ended with 5 core tools and generalist agents β why less is more in agent design.
| Repo | Stars | Description |
|---|---|---|
| microsoft/sre-agent | 83 | Official hands-on lab β sample environments, walkthroughs, and prompt guides |
| matthansen0/azure-sre-agent-sandbox | 52 | Fully automated sandbox deployment with AKS break-fix scenarios |
| paulasilvatech/Agentic-Ops-Dev | 23 | Agentic Operations & Observability Workshop |