MAP Framework

A structured AI development workflow for engineers who need control, not just generated code.

MAP turns Claude Code and Codex CLI from ad-hoc coding assistants into a repeatable engineering loop with explicit artifacts, review points, and project memory.

Without MAP:

idea -> prompt -> code -> hope

With MAP:

SPEC -> PLAN -> TEST -> CODE -> REVIEW -> LEARN

AI already writes code quickly. MAP helps you keep the architecture, scope, tests, and review process under control while it does.

Why MAP Exists

Ad-hoc prompting feels fast on simple tasks. On complex systems, it often creates a different problem: code appears quickly, but the engineering process disappears.

Common failure modes:

AI silently makes architecture decisions you did not approve.
One prompt produces a large diff that is hard to review.
Tests are written around the generated implementation, including its mistakes.
The output compiles, but you cannot explain why the design is correct.
The next session forgets the gotchas you already paid to discover.

MAP is a lightweight workflow for moving engineering judgment earlier: write down the behavior, split the work into small contracts, verify each stage, review against the spec, and save lessons for the next run.

When To Use MAP

Good fits	Poor fits
Complex backend features	Typos and tiny edits
Kubernetes controllers and operators	Small one-off scripts
Internal platform tooling	Product ideas where the desired behavior is still unknown
API, CRD, or domain-model changes with invariants	Broad rewrites without clear boundaries
Refactoring with a meaningful test harness	Tasks cheaper to do directly than to plan

Quick Start

1. Install

uv tool install mapify-cli

# or with pip
pip install mapify-cli

2. Initialize your project

Claude Code is the default provider:

cd your-project
mapify init
claude

Codex CLI is also supported:

cd your-project
mapify init . --provider codex
codex

Then enable the Codex hook manually: run /hooks, select PreToolUse, press t to toggle it on, then press Esc.

If your Codex version does not support the hooks feature key yet, either start it with codex --enable codex_hooks or upgrade Codex first. Upgrading is recommended.

Pick a context-compression policy if you want non-default behaviour (auto is the default; see docs/USAGE.md#context-budget-policy):

mapify init . --compression never                 # quality > cost
mapify init . --compression aggressive            # cost > quality
mapify init . --compression-threshold 250000      # Opus 1M project

Generated /map-efficient Actor context blocks and /map-review reviewer fan-out prompts are also bounded before they enter model calls. Override the default 4,000 Actor-context cap with MAP_CONTEXT_BLOCK_BUDGET_TOKENS, or the default 12,000 per-reviewer prompt cap with MAP_REVIEW_PROMPT_BUDGET_TOKENS, only when a large workflow genuinely needs more context. Values below the minimum fall back to the default so required wrappers and truncation notes stay present. When either active prompt path runs, MAP records the decision in .map/<branch>/token_budget.json so you can see which sections were clipped before deciding to raise a budget or split the workflow.

3. Use the golden path for serious work

When a task has unclear behavior, multiple files, or real review risk, run the full loop:

/map-plan define the behavior and split the task
/map-efficient implement the approved plan
/map-check
/map-review
/map-learn

Direct /map-efficient is for already-scoped tasks. If you are unsure, start with /map-plan.

Codex users should invoke MAP skills with $: type $map-plan, $map-fast, or $map-check instead of /map-plan, /map-fast, or /map-check. See the Usage Guide for provider details.

What Success Looks Like

After a good first workflow, you should see:

a written plan or spec before implementation starts;
small implementation contracts instead of one giant AI diff;
verification and review artifacts under .map/<branch>/;
review comments focused on correctness and semantics, not formatting noise;
/map-learn preserving project rules, gotchas, and handoffs for future sessions.

MAP review is useful, but it is not a replacement for engineering judgment. Serious changes still need human review. The goal is to make that review smaller, earlier, and better grounded.

Why Engineers Stick With It

Daily-driver speed - optimized for repeated use, not occasional demo workflows.
Low overhead - structured enough to prevent chaos, lightweight enough to keep token and time cost under control.
Compact high-traffic playbooks - /map-plan, /map-efficient, /map-check, and /map-review keep their active SKILL.md bodies focused on the next workflow action, with detailed examples and troubleshooting in bundled supporting files that load only when needed.
Calibrated workflow effort - each shipped slash skill declares a thinking_policy and parallel_tool_policy, so lightweight commands stay direct while planning, review, and release workflows reserve deeper reasoning and parallel fan-out for the stages that benefit from it. Non-release prompts use targeted guardrails instead of blanket all-caps prohibition blocks, reducing over-triggered agents and tool calls while keeping true hard stops explicit.
Mutation boundary constraints - write-capable Claude and Codex provider surfaces tell agents not to edit unrelated files, add or upgrade dependencies, or refactor neighboring code unless the current task/subtask explicitly requires it. If broader scope is necessary, agents should report it as a blocker or tradeoff instead of silently expanding the diff.
Context-first prompt envelopes - high-context /map-plan, /map-efficient, /map-debug, and /map-review agent prompts wrap branch artifacts in XML-style <documents>, then state the <task> and <expected_output>, so specs, review bundles, diffs, logs, and schemas stay visually and machine-semantically separated.
Skill IR audit - release checks can lower shipped Claude and Codex SKILL.md files into a typed SkillIR, verify content hashes, catch unsupported frontmatter, reject missing supporting-file links, and block injection-like instructions before mapify init copies provider surfaces into user repos.
Reviewable diffs - planning and task contracts keep changes small enough to inspect.
Contract-sized subtasks - /map-plan and /map-efficient require per-subtask expected_diff_size, concern_type, one_logical_step, hard_constraints, soft_constraints, and coverage_map metadata, then validate blueprint.json before implementation so oversized, mixed-concern, untraceable, or hard-constraint-dropping plans fail early instead of surprising reviewers later. Soft constraints can be consciously traded off only with explicit rationale.
Useful quality gates - /map-check and /map-review validate against the plan instead of just asking whether code "looks fine".
Review bundle - /map-review auto-generates .map/<branch>/review-bundle.json and .map/<branch>/review-bundle.md before launching reviewer agents, bundling spec, plan, tests, verification, latest code review, coverage_map acceptance-tag evidence, and prior-stage consumption status into a single durable input contract. Reviewers read the bundle first; raw diff is secondary and is clipped first when build_review_prompts has to keep each reviewer prompt under MAP_REVIEW_PROMPT_BUDGET_TOKENS. Use /map-review --detached to open an isolated read-only git worktree at .map/<branch>/detached-review/ for clean-room inspection without touching the source branch. Bundle generation records the review stage in .map/<branch>/artifact_manifest.json; python3 .map/scripts/map_step_runner.py validate_prior_stage_consumption review can fail missing spec/blueprint/test/diff inputs before review. Section-order flags reduce anchoring bias: --reverse-sections (invert order), --shuffle-sections [--seed N] (seeded random order), --compare-orderings (run default + reverse, aggregate via strict-wins, surface drift).
Token budget report - Actor context and review fan-out prompt builders append active-path budget decisions to .map/<branch>/token_budget.json, including before/after estimated tokens, clipped sections, source artifact references, and the budget environment variable. This is the operator breadcrumb for diagnosing missing context: inspect the report, then either raise MAP_CONTEXT_BLOCK_BUDGET_TOKENS / MAP_REVIEW_PROMPT_BUDGET_TOKENS or split the workflow before rerunning.
Clean retry quarantine - after repeated Monitor rejection, write-capable workflows switch the next Actor attempt into clean-retry mode and use .map/<branch>/retry_quarantine.json instead of raw failed-session context. The artifact preserves constraints, required evidence, and do-not-repeat feedback so retries can change approach without dropping the contract.
Run health report - /map-efficient, /map-debug, /map-check, and /map-review write .map/<branch>/run_health_report.json during closeout via .map/scripts/map_step_runner.py write_run_health_report. The report is a machine-readable snapshot of terminal status, step progress, retry counters, artifact presence, and latest hook-injection status, including explicit skipped reasons for malformed hook input or insignificant Bash commands when branch state can be safely updated. CI or operators can fail inconsistent closeouts with python3 .map/scripts/map_step_runner.py validate_run_health_report.
Compact recovery surface - /map-resume keeps the active recovery flow in a short SKILL.md and moves low-frequency examples, state-file notes, token-budget notes, and troubleshooting into resume-reference.md, so users recovering after /clear or context exhaustion get the next checkpoint action without loading the whole reference appendix.
Project memory - /map-learn turns hard-won fixes and gotchas into reusable context for the next session.

Core Commands

Command	Use For
`/map-plan`	Start here for non-trivial work; clarify behavior and decompose tasks
`/map-efficient`	Implement an approved plan or already-scoped task
`/map-fast`	Small, low-risk changes where full planning would be overhead
`/map-check`	Quality gates, verification, and artifact checks
`/map-review`	Pre-commit semantic review against the plan, tests, and diff
`/map-learn`	Capture project memory and reusable lessons
`/map-debug`	Bug fixes and debugging
`/map-task`	Execute a single subtask from an existing plan
`/map-tdd`	Test-first implementation workflow
`/map-release`	Package release workflow
`/map-resume`	Resume interrupted workflows

Detailed usage and options ->

Canonical MAP flows:

Standard: /map-plan -> /map-efficient -> /map-check -> /map-review -> /map-learn
Full TDD: /map-plan -> /map-tdd -> /map-check -> /map-review -> /map-learn
Targeted subtask TDD: /map-plan -> /map-tdd ST-001 -> /map-task ST-001 -> ... -> /map-check -> /map-review -> /map-learn

/map-plan records a workflow-fit decision first, so trivial work can exit early with a direct edit or /map-fast recommendation instead of forcing full MAP planning.

These workflows maintain branch-scoped artifacts like blueprint.json with subtask size/concern contracts, code-review-001.md, qa-001.md, verification-summary.md with acceptance coverage and prior-stage consumption, pr-draft.md, artifact_manifest.json, run_health_report.json diagnostics, and run dossiers under .map/<branch>/. Targeted TDD flows also persist test_contract_ST-00N.md and test_handoff_ST-00N.json so /map-task can resume implementation from a clean red-phase handoff.

LEARN is the memory step of the MAP cycle. After implementation, checks, or review, MAP writes a learning handoff under .map/<branch>/ so /map-learn can run later without reconstructing context. Pass /map-learn [workflow-summary] when you want to provide an explicit summary.

Case Study

The DevOpsConf 2026 case study applies this process to a production Kubernetes Project Operator, not a toy CRUD app:

human estimate: 90 days;
MAP-style delivery: 7 days;
workflow: SPEC -> PLAN -> TEST -> CODE -> REVIEW -> LEARN;
small reviewable PRs instead of one giant generated diff;
tests before implementation for critical pieces;
semantic bugs caught in review before merge.

DevOpsConf 2026 case study ->

How It Works

MAP orchestrates specialized roles through slash commands and skills:

TaskDecomposer -> breaks goals into subtasks
Actor          -> implements scoped tasks
Monitor        -> validates quality and blocks invalid output
Predictor      -> analyzes impact for risky changes
Learner        -> captures reusable project memory

For Claude Code, MAP slash surfaces live in .claude/skills/map-*/SKILL.md files created by mapify init. The .claude/commands/ directory is reserved for user-custom commands and a README that points back to the skill-backed MAP surfaces. The shipped skill catalog classifies skills with skillClass: MAP slash workflows are task skills, while map-state is a hybrid skill because it combines planning guidance with hooks/scripts that manage .map/<branch>/ focus artifacts.

For Codex CLI, mapify init . --provider codex creates .codex/skills/, .codex/agents/, .codex/config.toml, hooks, and shared .map/scripts/.

Maintainers can audit both provider skill trees with python -m mapify_cli.skill_ir src/mapify_cli/templates/skills src/mapify_cli/templates/codex/skills. The command exits non-zero if a shipped skill has unsupported frontmatter, unresolved bundled references, or hidden prompt-injection-like text.

MAP is inspired by MAP cognitive architecture (Nature Communications, 2025), which reported a 74% improvement in planning tasks. The CLI turns that idea into a practical software development workflow.

Architecture deep-dive ->

Documentation

Guide	Description
Installation	All install methods, PATH setup, troubleshooting
Usage Guide	Workflows, examples, cost optimization, playbook
Architecture	Agents, MCP integration, customization
Platform Spec	Platform refactor roadmap, codebase analysis

Trouble?

Command not found -> Run mapify init in your project first.
Agent errors -> Check .claude/agents/ has all shipped agent .md files, or run mapify doctor.
Poor output on a complex task -> Start with /map-plan and feed /map-efficient the approved plan instead of asking it to infer the architecture.
More help ->

Contributing

Improvements welcome: prompts for specific languages, new agents, provider integrations, and CI/CD workflow support.

License

MIT

Start with /map-plan. Keep the model inside your engineering process, not the other way around.

Name		Name	Last commit message	Last commit date
Latest commit History 730 Commits
.claude-plugin		.claude-plugin
.claude		.claude
.codex		.codex
.github		.github
.map		.map
.review_output		.review_output
.vscode		.vscode
docs		docs
scripts		scripts
src/mapify_cli		src/mapify_cli
tests		tests
.gitignore		.gitignore
.mcp.json.example		.mcp.json.example
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
RELEASING.md		RELEASING.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MAP Framework

Why MAP Exists

When To Use MAP

Quick Start

What Success Looks Like

Why Engineers Stick With It

Core Commands

Case Study

How It Works

Documentation

Trouble?

Contributing

License

About

Uh oh!

Releases 33

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MAP Framework

Why MAP Exists

When To Use MAP

Quick Start

What Success Looks Like

Why Engineers Stick With It

Core Commands

Case Study

How It Works

Documentation

Trouble?

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 33

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages