codex-review

Based on hamelsmu/claude-review-loop. Rewritten with parallel Codex reviews, self-review, file-scoped reviews, knowledge compounding, and AI anti-pattern detection.

A Claude Code plugin that adds automated review loops to your workflow. Every task gets independent reviews from multiple parallel Codex agents, and every review compounds knowledge back into the codebase.

Why this fork

The original plugin reviews all uncommitted changes as a single diff. That works well for solo work on small repos, but breaks down in a large monorepo where multiple agents work in the same branch simultaneously — each agent's review picks up noise from everyone else's changes, and the reviewer has no module-level context.

This fork scopes every review to the files that a particular agent actually touched. A PostToolUse hook tracks each Edit/Write operation per session, so when the stop hook fires, Codex only sees that agent's changes — not the entire diff. On top of that, we auto-derive a dependency map for the touched modules (via codebase-map) and inject it into the review prompt, giving each reviewer structural context for the code it's looking at.

The result: several agents can work in parallel on different parts of a monorepo, each getting a focused, context-rich review of only their own work.

What it does

Always-on (every session)

PostToolUse checks: code→comment replacement detection, _param lazy refactoring detection
Self-review: randomized questions from 4 focus areas on every stop (implementation completeness, code quality, integration, codebase consistency)

On-demand (`/review-parallel`)

Run N parallel Codex reviews on uncommitted changes — no review loop needed. 4 specialized agents (diff, holistic, security, tests) review independently and produce a combined report.

Pre-implementation (`/review-plan`)

Run N parallel Codex agents in read-only mode against the existing codebase, given a plan/spec as input. 4 agents assess risks, challenge complexity, and find corner cases before any code is written.

/review-plan docs/tasks/my-feature.md

Full loop (`/review-loop`)

Three-phase lifecycle:

Task phase: Claude implements your task
Review phase: Stop hook runs N parallel Codex reviews (+ lint/typecheck), Claude addresses feedback
Compound phase: Claude extracts reusable knowledge, updates nearest AGENTS.md, logs learnings

Fork additions

Parallel Codex reviews (N separate processes)

Each review category runs as an independent codex exec review process. Dramatically more thorough than single-agent review — tested at 330KB/21 findings across 4 agents vs 2.2KB from single agent.

Fallback: REVIEW_LOOP_SINGLE_AGENT=true for single process.

Self-review hook (from ClaudeKit)

Always-on Stop hook that forces Claude to self-review before stopping. Randomized questions from 4 focus areas avoid pattern fatigue. Skips when codex-review loop is active (Codex handles it instead).

File-scoped reviews (parallel agent safe)

A PostToolUse hook tracks every file modified by Edit/Write tools during the session. When the Stop hook fires, Codex only reviews files THIS agent changed — not the entire repo.

Multiple agents can work in parallel on different modules, each getting a review scoped to its own changes.

Project convention injection

Reads AGENTS.md or CLAUDE.md from repo root and injects project conventions into each Codex review prompt.

Auto-scoped dependency map

If codebase-map is installed, auto-derives impacted modules from changed files and injects a focused dependency graph. Monorepo-agnostic — detects apps/, services/, packages/ boundaries.

Real-time anti-pattern checks

PostToolUse hooks fire on every Edit:

Code→comment replacement — blocks replacing code with // removed comments
Unused parameter prefixing — blocks param → _param lazy refactoring

AI anti-pattern detection (in Codex review)

The diff review agent checks for: mocks/stubs to pass tests, hardcoded values that should use constants, code bolted on without integrating, over-engineered error handling, duplicate utility functions, unnecessary type assertions, feature flags where direct replacement fits.

Parallel quality checks

Lint and typecheck run in parallel with Codex review — zero wasted time. Auto-detects: biome/eslint for JS/TS, ruff for Python, tsc for typechecking.

Knowledge compounding

After addressing findings, Claude extracts reusable knowledge: routes lore to nearest AGENTS.md (Least Common Ancestor), logs session learnings to progress.txt.

Review coverage

N parallel Codex processes, one per review category:

Agent	Always runs?	Focus
Diff Review	Yes	Line-by-line code quality, test coverage, security, AI anti-patterns
Holistic Review	Yes	Architecture, module structure, documentation, agent readiness
Security Review	Yes	Auth, injection, data protection, rate limiting, OWASP
Test Coverage	Yes	Missing tests, test quality, anti-patterns, integration gaps
Next.js Review	If `next.config.*` or `"next"` in `package.json`	App Router, RSC, caching, Server Actions, React performance

Each agent gets file scope + project conventions + category-specific criteria. Outputs merged into reviews/review-<id>.md.

Requirements

Heads up: Each review loop spawns 4-5 parallel Codex processes. This burns through your OpenAI Codex token budget fast — a single /review-loop cycle can use 10-50x more tokens than a normal Codex session. Monitor your usage.

Claude Code (CLI)
jq — brew install jq (macOS) / apt install jq (Linux)
Codex CLI — npm install -g @openai/codex

Optional

codebase-map — npm install -g codebase-map (dependency maps)
ruff — pip install ruff (Python linting, auto-detected)

Installation

From the CLI:

claude plugin marketplace add dkorobtsov/codex-review
claude plugin install codex-review@dkorobtsov-review

Or from within a Claude Code session:

/plugin marketplace add dkorobtsov/codex-review
/plugin install codex-review@dkorobtsov-review

Usage

Start a review loop

/review-loop Add user authentication with JWT tokens and test coverage

Claude implements the task. When it finishes, the stop hook:

Collects files this agent modified (PostToolUse tracking)
Launches N parallel Codex reviews scoped to those files (+ lint/typecheck)
Injects AGENTS.md conventions + dependency map into each agent's prompt
Merges findings into reviews/review-<id>.md
Claude addresses findings
Claude extracts reusable knowledge → updates AGENTS.md + progress.txt

On-demand parallel review

/review-parallel

Runs 4 parallel Codex reviews on uncommitted changes. No review loop session needed.

Pre-implementation risk assessment

/review-plan path/to/plan.md

Takes a plan/spec/PRD markdown file and runs 4 parallel Codex agents in read-only sandbox mode:

Agent	Focus
Dependencies	API contracts, schema conflicts, import graph, config gaps
Architecture	Pattern fit, simplification, existing code reuse, refactor-first alternatives
Security	Attack surface, auth gaps, input validation, secrets, existing vulnerabilities
Testing	Coverage gaps, corner cases, failure modes, race conditions, rollback, scale

Each agent explores the codebase (read-only) and maps risks against the plan. Output: /tmp/codex-plan-*/risk-report.md.

Cancel a review loop

/cancel-review

How it works

Hooks

Hook	Event	Always-on?	Purpose
`track-modified.sh`	PostToolUse (Edit/Write)	Yes	Accumulate changed files per session
`check-comment-replacement.sh`	PostToolUse (Edit)	Yes	Block code→comment replacement
`check-unused-parameters.sh`	PostToolUse (Edit)	Yes	Block `_param` lazy refactoring
`self-review.sh`	Stop	Yes	Randomized self-review (4 focus areas)
`stop-hook.sh`	Stop	Loop only	N parallel Codex reviews + compounding

Three-phase Stop hook

Phase 1 (task → addressing):
  ├─ N parallel Codex reviews (scoped to agent's files)
  │   Context: AGENTS.md conventions + dependency map
  └─ Parallel: lint + typecheck

Phase 2 (addressing → compound):
  └─ Claude fixes review findings

Phase 3 (compound → done):
  ├─ Classify findings: reusable vs task-specific
  ├─ Update nearest AGENTS.md (Least Common Ancestor)
  └─ Append to progress.txt

File structure

plugins/codex-review/
├── .claude-plugin/
│   └── plugin.json                  # Plugin manifest
├── commands/
│   ├── review-loop.md               # /review-loop slash command
│   ├── review-parallel.md           # /review-parallel (on-demand, N agents)
│   ├── review-plan.md               # /review-plan (pre-implementation risk assessment)
│   └── cancel-review.md             # /cancel-review
├── hooks/
│   ├── hooks.json                   # Hook registration
│   ├── track-modified.sh            # PostToolUse: file tracking
│   ├── check-comment-replacement.sh # PostToolUse: code→comment check
│   ├── check-unused-parameters.sh   # PostToolUse: _param check
│   ├── self-review.sh               # Stop: always-on self-review
│   └── stop-hook.sh                 # Stop: review + compound lifecycle
├── scripts/
│   └── setup-review-loop.sh         # State file creation
└── AGENTS.md                        # Agent guidelines

Configuration

Environment variables

Variable	Default	Description
`REVIEW_LOOP_CODEX_FLAGS`	`--dangerously-bypass-approvals-and-sandbox`	Flags passed to `codex exec` (review)
`REVIEW_LOOP_PLAN_FLAGS`	`--full-auto`	Flags passed to `codex exec` (plan)
`REVIEW_LOOP_OUTPUT_DIR`	auto-resolved	Override output dir for learnings
`REVIEW_LOOP_SINGLE_AGENT`	`false`	Disable parallel (single codex process)
`REVIEW_LOOP_SKIP_COMPOUND`	`false`	Skip knowledge extraction phase
`REVIEW_LOOP_SKIP_QUALITY_CHECKS`	`false`	Skip parallel lint/typecheck
`REVIEW_LOOP_SKIP_MAP`	`false`	Skip codebase-map injection
`REVIEW_LOOP_SKIP_SELF_REVIEW`	`false`	Disable self-review hook
`REVIEW_LOOP_MAP_FORMAT`	`graph`	codebase-map format

Output directory

Learnings and progress stored in configurable directory:

Priority	Source	Example
1	`REVIEW_LOOP_OUTPUT_DIR` env	`/path/to/learnings`
2	`compound.config.json` → `outputDir`	`./scripts/compound`
3	Default	`.claude/learnings/`

Timeouts

Stop hook: 1800s (30 min) for parallel Codex reviews. Self-review: 30s. PostToolUse hooks: 5s. Adjust in hooks.json.

Telemetry

Execution logs: .claude/codex-review.log (timestamps, codex exit codes, elapsed times). Gitignored.

Credits

Original plugin by Hamel Husain. Compound engineering approach inspired by Ryan Carson's compound engineering loop. Knowledge compounding via Self-review ported from ClaudeKit by Carl Rannaberg.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.claude-plugin		.claude-plugin
plugins/codex-review		plugins/codex-review
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

codex-review

Why this fork

What it does

Always-on (every session)

On-demand (/review-parallel)

Pre-implementation (/review-plan)

Full loop (/review-loop)

Fork additions

Parallel Codex reviews (N separate processes)

Self-review hook (from ClaudeKit)

File-scoped reviews (parallel agent safe)

Project convention injection

Auto-scoped dependency map

Real-time anti-pattern checks

AI anti-pattern detection (in Codex review)

Parallel quality checks

Knowledge compounding

Review coverage

Requirements

Optional

Installation

Usage

Start a review loop

On-demand parallel review

Pre-implementation risk assessment

Cancel a review loop

How it works

Hooks

Three-phase Stop hook

File structure

Configuration

Environment variables

Output directory

Timeouts

Telemetry

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

On-demand (`/review-parallel`)

Pre-implementation (`/review-plan`)

Full loop (`/review-loop`)

Packages