cladding

English · 한국어

cladding

Unified Governance for AI-Coupled Engineering.
AI-generated code, held to the same bar as human code.

Reference implementation of the Ironclad standard. 33 detectors and a 13-stage gate verify, on every commit, that the code your AI assistant wrote still matches the spec.

Vanilla AI coding 2/8 traps caught · 25%	cladding 8/8 traps caught · 100%
_{Same spec · same model · event-sourcing store benchmark}

Quick start

npm install -g cladding   # global CLI
clad setup                # global wiring — Claude / Codex / Gemini / Cursor
# then inside your project, from your AI tool:
/cladding:init "your project intent"

Full install options, marketplace route, and host channel table ↓

Why

The why fades after 3 months

The reason an AI assistant wrote code a certain way doesn't survive in the code alone.

→ spec/features/*.yaml becomes the permanent record of why.

✓ AI context survives time — six months later, the AI reconstructs intent straight from the spec (new hires get the same entry point).

AI gives a different answer each time

The same spec produces code with inconsistent patterns and structure.

→ The spec becomes the fixed reference against which every commit is checked.

✓ Enterprise-ready consistency — code style and patterns stay aligned across teams and PRs.

AI hallucination

Generated code calls APIs, functions, or options that don't exist.

→ 33 detectors and a 13-stage gate block hallucinated code on every commit.

✓ Production incidents prevented up front — CI auto-rejects hallucinated code before it merges.

What you get

How a vanilla AI coding environment and a cladding environment behave when the same situation comes up.

Situation	Vanilla AI coding	cladding
Code drifts from spec	fixed if a reviewer notices	auto-blocked on every commit
Two devs build the same feature in parallel	merge conflicts	hash-based IDs route to separate files → 0 conflicts
Who verifies AI-written code?	the AI that wrote it (risky)	a separate reviewer agent — duties split
Switching AI tools (Claude → Cursor)	reconfigure per tool	one spec → mirrored across 4 hosts
Spec authority	the AI reinterprets it each time	the sealed spec is the single source of truth

The hero's 8/8 vs 2/8 is an early benchmark (details) · larger-scale measurements are in progress.

How it works

Spec → Code → Tests runs as a single cycle — the spec captures the why, Iron Law verifies the implementation, and Drift Detection blocks anything that no longer matches.

Spec → Code → Tests as a single cycle — one feature's lifecycle

1. Spec — SSoT, single source of intent

The spec is where the why (what we're building and why) lives. A 4-tier (A/B/C/D) Single Source of Truth — intent on top, implementation below.

Tier	Role	Who edits	Authority
A — Spec	intent (what to build)	humans only	sealed · LLMs cannot edit
B — Design	design (how to build it)	humans freely	checked against A
C — Derived	implementation (code · tests)	LLMs and humans	regenerated by reading the code
D — Audit	audit log (what actually happened)	append-only	immutable

A outranks B — if code and spec disagree, the code is wrong. The spec is sealed because changing the why shakes everything downstream, so LLMs are kept out.

Sharded · multi-dev safe — spec/features/<slug>-<hash6>.yaml puts each feature in its own file with a 6-char hash ID (e.g. F-5f6b45). Two devs creating new features at the same time land in different files with different IDs — zero merge conflicts. Details: Hash-based feature IDs.

4-tier SSoT — A(Spec) → B(Design) → C(Derived) → D(Audit), A outranks B

2. Code — Iron Law (required) gate

Every change has to clear all 13 stages — typically called from CI, a git pre-push hook, or manual clad check. Each stage ships with its own unit tests.

13-stage Iron Law gate — every change must clear static(6) · test(2) · e2e(3) · evidence(2) wherever clad check runs (CI / git hook / manual)

Stage	What it checks
1.1 Type · 1.2 Lint	type errors · code style
1.3 Drift	spec ↔ code mismatches across 33 detectors
1.4 Commit · 1.5 Arch · 1.6 Secret	clean working tree · architecture invariants (forbidden imports, etc.) · leaked API keys
2.1 Unit · 2.2 Cov	unit tests pass · project coverage threshold
3.1 Smoke · 3.2 Perf · 3.3 Visual	end-to-end critical paths · performance budgets · visual regression
4.1 Audit · 4.2 UAT	every AC (acceptance criteria) has at least one piece of evidence · every `status=done` feature has at least one piece of evidence

3. Tests — 33 drift detectors

Seven categories of mismatch across spec · code · test, all caught automatically. Full catalog: src/stages/detectors/README.md.

Category	What it catches	Count	Representative detectors
spec ↔ code drift	something in the spec missing from code, or in code with nothing in the spec	7	`UNMAPPED_ARTIFACT`, `MISSING_IMPLEMENTATION`, `AC_DRIFT`, `PLANNED_BACKLOG`
code ↔ test	code without tests · coverage falling below threshold	6	`MISSING_TESTS`, `COVERAGE_DROP`, `HARDCODED_SECRET`
spec ↔ test	an AC in the spec that no test actually verifies	5	`UNTESTED_AC`, `STATUS_DRIFT`, `SCENARIO_COVERAGE`
spec maintenance	spec hygiene — slug collisions, ID duplicates, dependency cycles	6	`SLUG_CONFLICT`, `ID_COLLISION`, `INVENTORY_DRIFT`, `DEPENDENCY_CYCLE`
environment integrity	build environment and meta-file integrity	3	`HARNESS_INTEGRITY`, `META_INTEGRITY`
architecture · capability	code that breaks the architecture or capability shape declared in the spec	2	`ARCHITECTURE_FROM_SPEC`, `CAPABILITIES_FEATURE_MAPPING`
governance · policy	code that breaks an `ai_hints` policy, or a hollow / unrefined governance tier	4	`AI_HINTS_FORBIDDEN_PATTERN`, `HOLLOW_GOVERNANCE`, `PROJECT_CONTEXT_DRIFT`

4. Cycle — one feature's lifecycle

The 4 steps that wrap Spec → Code → Test into a single cycle. Merge if drift is 0, block otherwise.

One feature's lifecycle — Define → Sync → Implement → Verify, merge if drift=0 / block otherwise

Multi-Agent Workflow

cladding is a 5-agent system working in concert. The agents that build are kept separate from the agents that verify — so no agent ever signs off on its own work. That split maps cleanly to compliance regimes (EU AI Act · K-AI Framework · SOX).

5 personas with CQS — orchestrator dispatches, librarian/specialist/reviewer act, observability watches metrics

Ecosystem

cladding sits at the intersection of three existing categories.

How cladding differs from the neighbors

Spec Kit · OpenSpec · Tessl · Kiro help you write a good spec. cladding goes further — it verifies on every commit that the code still matches that spec.
BMAD · ChatDev · Claude Code Agent Teams are about splitting work across multiple AI agents. cladding's 5 agents take that further by tying spec, code, and audit log into the same loop.
tdd-guard forces test-first development. That's roughly what the Unit · Coverage stages do inside cladding's 13-stage gate.
OpenHands · Cline · Aider · Goose are runners — they tell the AI to write code. cladding is the governance layer that verifies and controls what those runners produce.

cladding's edge is the combination — it folds the strongest parts of all four categories into one verification loop.

Install

Two steps: install the infrastructure, then create the project spec.

Step 1 — Install the infrastructure

Pick the route that fits how you work — both land in the same place:

(a) npm — for terminal / CI users

npm install -g cladding   # install the cladding CLI (global)
clad setup                # connect your AI tools (global — Claude / Codex / Gemini / Cursor)
cd <project>              # for the next step (clad setup itself is project-agnostic)

(b) Marketplace — for AI-tool plugin users

Open the plugin marketplace inside your AI tool (Claude Code · Codex CLI · Gemini CLI)
Search for cladding and install it
No clad setup needed — the plugin manifest wires everything

Where clad setup connects (5 host channels)

Host (when detected)	Wired location	Auto-activation
Claude Code (`~/.claude/`)	`~/.claude/plugins/cladding`	`claude plugin marketplace add` + `claude plugin install claude-code@cladding`
Codex CLI skills (`~/.agents/`)	`~/.agents/skills/cladding-*`	(auto on Codex restart)
Codex CLI MCP server (`~/.codex/`)	`[mcp_servers.cladding]` in `~/.codex/config.toml`	(TOML entry itself)
Gemini CLI (`~/.gemini/`)	`~/.gemini/extensions/cladding`	`gemini extensions link`
Cursor (`~/.cursor/`)	`mcpServers.cladding` in `~/.cursor/mcp.json`	(JSON entry itself)

clad setup invokes the per-host activation commands automatically when claude / gemini binaries are on PATH. Safe to re-run after a cladding upgrade or after installing another AI tool.

About the MCP server. Every host gets cladding wired as an MCP server — only the wire location differs. Claude Code and Gemini CLI auto-start it through the plugin/extension manifest's mcpServers field; Codex through ~/.codex/config.toml [mcp_servers.cladding]; Cursor through ~/.cursor/mcp.json. You never invoke MCP directly — no /mcp slash, no manual server-connect step. The AI in each host calls cladding's tools (clad_create_feature, etc.) in response to natural-language requests; you keep typing /cladding:init plus normal chat.

Benchmark. v0.4.0 measurements show ~60% consistency improvement and ~50% LOC reduction vs unguided AI coding on a fixed task, with 100% drift detection across a 5-iteration dev cycle. Full methodology and honest caveats (some of the consistency gain is the "more-specific-prompt" effect, not exclusively cladding) in docs/benchmarks/v0.4.0-consistency-bench.md.

Step 2 — Init (create the project spec)

Inside your project, run it once from your AI tool:

[inside your AI tool] /cladding:init "B2B payment SaaS"

This creates your project's spec.yaml and its supporting docs — one time per project.

Three init scenarios

/cladding:init takes a natural-language intent and picks the right path on its own. Same command, three starting points.

Starting point	Command	What happens
An idea, nothing else	`/cladding:init "I want to build a B2B payment SaaS"`	LLM infers the domain → spec · docs · policies generated, with 2–3 follow-up questions printed
A planning doc	`/cladding:init docs/plan.md`	cladding detects the file path, loads its contents, and uses them as the intent (absolute and relative paths both work)
Adopting into an existing project	`/cladding:init "apply cladding to this project"`	scans the existing code (≥3 source files trigger it) → observed patterns are merged with the intent

Init once, then carry on

Run init once and you're set — after that, just keep coding. cladding works in the background to keep your code and spec in sync, so there are no extra commands to remember.

Upgrading

npm update -g cladding     # 1. install the new cladding (marketplace: also `claude plugin update`)
cd <your project>          # 2. once per project
clad update                # 3. bring this project in line with the new version

After upgrading, run clad update once in each project. It never changes your code, spec.yaml, or docs, so it's always safe — and if the newer version is stricter, it just points that out (it won't block or fix anything).

Status

version

v0.5.0

2026-06

conformance

L4

top tier · self-declared

tests

1105/1105

all pass

coverage

93.89%+

enforced

features

148

spec'd

_{112 test files · installable from the Claude Code · OpenAI Codex · Gemini CLI marketplaces.}

Road to Ironclad 1.0 — 1.0 locks when two independent implementations pass the L4 conformance fixtures (GOVERNANCE § 1). cladding is the first one.

Docs

License

MIT. LICENSE · Related: Ironclad (the standard cladding implements) · harness-boot (the seed project).

Name		Name	Last commit message	Last commit date
Latest commit History 359 Commits
.claude-plugin		.claude-plugin
.claude		.claude
.github		.github
bin		bin
conformance		conformance
docs		docs
plugins		plugins
scripts		scripts
skills		skills
spec		spec
src		src
tests		tests
.gitignore		.gitignore
.secretlintignore		.secretlintignore
.secretlintrc.json		.secretlintrc.json
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
README.html		README.html
README.ko.html		README.ko.html
README.ko.md		README.ko.md
README.md		README.md
SECURITY.md		SECURITY.md
eslint.config.js		eslint.config.js
package-lock.json		package-lock.json
package.json		package.json
spec.yaml		spec.yaml
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cladding

Quick start

Why

What you get

How it works

1. Spec — SSoT, single source of intent

2. Code — Iron Law (required) gate

3. Tests — 33 drift detectors

4. Cycle — one feature's lifecycle

Multi-Agent Workflow

Ecosystem

How cladding differs from the neighbors

Install

Step 1 — Install the infrastructure

Step 2 — Init (create the project spec)

Three init scenarios

Init once, then carry on

Upgrading

Status

Docs

License

About

Uh oh!

Releases 36

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

cladding

Quick start

Why

What you get

How it works

1. Spec — SSoT, single source of intent

2. Code — Iron Law (required) gate

3. Tests — 33 drift detectors

4. Cycle — one feature's lifecycle

Multi-Agent Workflow

Ecosystem

How cladding differs from the neighbors

Install

Step 1 — Install the infrastructure

Step 2 — Init (create the project spec)

Three init scenarios

Init once, then carry on

Upgrading

Status

Docs

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 36

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages