Benchmark guide for comparing edit format performance and render behavior.
Use this benchmark to compare edit-format throughput and output quality for Codex-focused editing workloads.
npm run bench:edit-formatsSmoke run:
npm run bench:edit-formats:smokeRender dashboard output:
npm run bench:edit-formats:render| Output | Location |
|---|---|
| Benchmark report JSON | .tmp/edit-format-benchmark-*.json |
| Render preview artifacts | .tmp/edit-format-benchmark-render-*.txt |
(Temporary benchmark artifacts are not source files.)
| Preset | Goal |
|---|---|
codex-core |
Baseline Codex-oriented evaluation |
smoke |
Fast sanity check for CI/local validation |
- Compare latency per format.
- Measure token/size overhead.
- Review success/error rates.
- Validate output consistency.
- Confirm no regressions in editing fidelity.
Bash:
rm -rf .tmpPowerShell:
Remove-Item ".tmp" -Recurse -Force -ErrorAction SilentlyContinue