This repository was archived by the owner on Apr 3, 2026. It is now read-only.
Commit b34a910
committed
feat(monitoring): add health checks and alert rules for observability stack
- Add comprehensive health checks to all monitoring services (Prometheus, Grafana, Loki, Tempo, Promtail, node-exporter, celery-exporter)
- Create Grafana alert rule configuration for CPU usage monitoring
- Implement logging configuration with rotation limits for Loki, Tempo, and Promtail
- Document post-upgrade fixes and health status in detailed reports1 parent 2ee1fb7 commit b34a910
4 files changed
Lines changed: 849 additions & 0 deletions
File tree
- monitoring/grafana/provisioning/alerting
0 commit comments