### P0 - [ ] **[Loadgen]** Multi-model or LoRA support and traffic splitting - [ ] **[Datagen]** Multi-modal support (vision language model) ### P1 - [ ] **[Datagen]** Support different input / output distribution for different stages (helpful for autoscaling) ### P2 - [ ] **[Metrics]** SLO support and conformance of specific latency SLOs - [ ] **[Metrics]** GPU utilization / other hardware metrics from Prometheus
P0
P1
P2