Skip to content

test: add agent and stress validation#20

Merged
iFurySt merged 1 commit into
iFurySt:mainfrom
JohanSardar:agent-stress-validation
Jun 7, 2026
Merged

test: add agent and stress validation#20
iFurySt merged 1 commit into
iFurySt:mainfrom
JohanSardar:agent-stress-validation

Conversation

@JohanSardar
Copy link
Copy Markdown
Contributor

Summary

  • add a repeatable stress runner for fixture-backed native smoke loops plus cursor-idle smoke
  • add a live-agent smoke runner for Claude Code, Codex CLI, and Hermes across list-apps, fixture, and fixture-full scenarios
  • document validation commands and record the validation history

Validation

  • node --check scripts/run-agent-smoke-tests.mjs
  • make check-docs
  • swift test
  • make agent-smoke
  • OPEN_COMPUTER_USE_STRESS_LOOPS=20 OPEN_COMPUTER_USE_STRESS_CONFIGURATION=release ./scripts/run-tool-stress-tests.sh
  • node ./scripts/run-agent-smoke-tests.mjs --scenario=fixture-full --agents=codex --command=open-computer-use --json --timeout-ms=180000
  • node ./scripts/run-agent-smoke-tests.mjs --scenario=fixture-full --agents=claude --command=open-computer-use --json --timeout-ms=180000 --claude-budget-usd=3.00
  • node ./scripts/run-agent-smoke-tests.mjs --scenario=fixture-full --agents=hermes --command=open-computer-use --hermes-provider=anthropic --hermes-model=claude-opus-4-20250514 --json --timeout-ms=180000

Notes: the 20-loop stress run completed 200 fixture operations plus cursor-idle smoke in 76 seconds. Hermes needed explicit provider/model routing on this machine because the local default model alias returned HTTP 404 before tool use.

Copy link
Copy Markdown
Owner

@iFurySt iFurySt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! thanks

@iFurySt iFurySt merged commit 9d9d5d9 into iFurySt:main Jun 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants