A self-hosted operational layer designed for engineering teams managing multi-tenant cloud environments at scale. Built to solve real Day-2 operational gaps in Platform9 / OpenStack environments at MSP scale.
π https://github.com/erezrozenbaum/pf9-mngt
- 450+ commits
- 170+ API endpoints
- 40+ database tables
- 10+ containerized services
- Full multi-service architecture
Short walkthrough showing:
- Inventory visibility
- Snapshot automation
- VM restore workflows
- Migration planning
I design and build operational platforms for real production environments.
Focused on:
- Multi-tenant cloud operations (MSP scale)
- Day-2 automation and governance
- Migration from VMware to open platforms
- Turning operational chaos into structured systems
- MSP engineering teams
- Cloud platform teams
- Enterprises operating Platform9 / OpenStack at scale
Built from real production gaps:
- No persistent infrastructure inventory
- No automated restore workflows
- No snapshot SLA visibility
- No structured onboarding for tenants
- No structured migration planning from VMware environments
- Limited cross-project operational visibility
- Single control plane across environments
- Cross-region visibility and management
- Designed for MSP-scale operations
- Full infrastructure inventory (servers, volumes, networks, etc.)
- Historical tracking with change detection
- PostgreSQL-backed with JSONB + delta logic
- Policy-driven snapshot scheduling
- Metadata-based automation (daily / monthly / retention)
- Compliance reporting per tenant / volume
- Structured restore workflows
- Not just snapshot creation β actual recovery operations
- RVTools-based ingestion
- Capacity planning (CPU, RAM, storage)
- Migration wave design + downtime estimation
- Target readiness validation (flavors, networks, tenants)
- Full audit logs
- RBAC with LDAP integration
- Change tracking across infrastructure
- Tier1 / Tier2 operational workflows
- Structured execution with escalation paths
- Designed for real support teams
- Usage visibility
- Foundation for chargeback / showback
- Operational alerts and event-driven workflows
- Backend: FastAPI
- Frontend: React + TypeScript (Vite, Tailwind)
- Database: PostgreSQL
- Auth: LDAP + internal RBAC
- Deployment: Docker & Kubernetes (both supported)
- CI/CD: GitHub Actions (container build pipeline)
This is not a replacement for Platform9.
It is built to extend it with the operational capabilities engineering teams actually need in production:
- Visibility
- Control
- Automation
- Governance
- Migration readiness
This project reflects a broader shift:
From infrastructure management
β to operational systems engineering
Because at scale, infrastructure is the easy part.
28+ years in IT, Engineering Manager, building and operating:
- Multi-tenant cloud environments
- MSP platforms
- Infrastructure automation systems
Focused on turning operational complexity into structured systems.
- GitHub: @erezrozenbaum
π§ Email: erez.rozenbaum@gmail.com
π Location: Israel

