Agent
azure-template-generator — source: .github/agents/azure-template-generator.agent.md
Scope
Author the eval suite at .github/evals/agents/azure-template-generator/:
Notes
Positive tasks that generate ARM templates via create should ALSO require the agent to paste the rendered JSON inline in a fenced code block in its chat response. output_contains graders only check chat output, not files created via create — without an inline echo the grader can't audit the template.
Procedure
/agent-bench azure-template-generator drafts the suite from the live .agent.md.
waza run .github/evals/agents/azure-template-generator/eval.yaml -v locally.
/agent-improve azure-template-generator to iterate on graders.
- Open PR.
- Mock CI runs automatically. A maintainer will dispatch a real-model run before merge.
Acceptance
Conventions to follow
- Persona lock: refusal graders should accept the agent's own scope language.
- Prompt graders need
continue_session: true in their grader config.
Related
Agent
azure-template-generator— source:.github/agents/azure-template-generator.agent.mdScope
Author the eval suite at
.github/evals/agents/azure-template-generator/:eval.yaml— suite config (executor, model, graders)tasks/positive-*.yamltasks/negative-*.yaml.github/evals/manifest.yamlattier: expandedNotes
Positive tasks that generate ARM templates via
createshould ALSO require the agent to paste the rendered JSON inline in a fenced code block in its chat response.output_containsgraders only check chat output, not files created viacreate— without an inline echo the grader can't audit the template.Procedure
/agent-bench azure-template-generatordrafts the suite from the live.agent.md.waza run .github/evals/agents/azure-template-generator/eval.yaml -vlocally./agent-improve azure-template-generatorto iterate on graders.Acceptance
mockexecutor.manifest.yamlentry added; PR description includes the real-model run summary.Conventions to follow
continue_session: truein their grader config.Related