LucaButBoring
diff --git a/‎.gitignore‎
Lines changed: 2 additions & 0 deletions b/‎.gitignore‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎plugins/mcp-spec/README.md‎
Lines changed: 87 additions & 0 deletions b/‎plugins/mcp-spec/README.md‎
Lines changed: 87 additions & 0 deletions
diff --git a/‎plugins/mcp-spec/agents/spec-implementer.md‎
Lines changed: 40 additions & 0 deletions b/‎plugins/mcp-spec/agents/spec-implementer.md‎
Lines changed: 40 additions & 0 deletions
diff --git a/‎plugins/mcp-spec/agents/spec-qa.md‎
Lines changed: 81 additions & 0 deletions b/‎plugins/mcp-spec/agents/spec-qa.md‎
Lines changed: 81 additions & 0 deletions
diff --git a/‎plugins/mcp-spec/agents/spec-reviewer.md‎
Lines changed: 42 additions & 0 deletions b/‎plugins/mcp-spec/agents/spec-reviewer.md‎
Lines changed: 42 additions & 0 deletions
diff --git a/‎plugins/mcp-spec/skills/spec-annotate/SKILL.md‎
Lines changed: 80 additions & 0 deletions b/‎plugins/mcp-spec/skills/spec-annotate/SKILL.md‎
Lines changed: 80 additions & 0 deletions
@@ -1,4 +1,6 @@
 .claude/
+.reviews/
+__pycache__/
 node_modules/
 .DS_Store
 .idea/
 
@@ -34,3 +34,90 @@ Search across MCP GitHub discussions, issues, and pull requests to find relevant
 ```
 
 **Note:** The skill searches both open AND closed issues/PRs, which is important for understanding past decisions and historical context.
+
+### `/spec-annotate <sep_number> [mode] [commit_range]`
+
+Orchestrates the full SEP annotation pipeline: reads the SEP, fetches the PR diff, extracts requirements, annotates hunks against requirements, and renders a self-contained HTML report.
+
+| Argument       | Required | Default  | Description                                                |
+| -------------- | -------- | -------- | ---------------------------------------------------------- |
+| `sep_number`   | Yes      | —        | SEP number (e.g., 1686)                                    |
+| `mode`         | No       | `review` | `review` = fresh extraction; `validator` = reuse meta-spec |
+| `commit_range` | No       | —        | Local git range (e.g., `abc..def`). Omit for PR mode.      |
+
+**Output:** `.reviews/SEP-{number}/annotated-diff.html` (plus `meta-spec.json` and `annotations.json`)
+
+**Example:**
+
+```
+/spec-annotate 1686
+/spec-annotate 1686 validator
+/spec-annotate 1686 review abc123..def456
+```
+
+### `/spec-update <sep_number> <action> <details>`
+
+Updates an existing meta-spec by adding, removing, modifying, or recategorizing requirements. Preserves existing requirements and offers to re-annotate after changes.
+
+| Argument     | Required | Description                                  |
+| ------------ | -------- | -------------------------------------------- |
+| `sep_number` | Yes      | SEP number                                   |
+| `action`     | Yes      | `add`, `remove`, `modify`, or `recategorize` |
+| `details`    | Yes      | Natural language description of the change   |
+
+**Example:**
+
+```
+/spec-update 1686 add "Servers MUST send progress notifications for long-running tasks"
+/spec-update 1686 recategorize "R005 from must-change to may-change"
+```
+
+### `/spec-orchestrate <sep_number> [max_iterations]`
+
+Iteratively runs spec review and implementation in a feedback loop until all requirements are satisfied or conflicts are escalated to the user.
+
+| Argument         | Required | Default | Description                     |
+| ---------------- | -------- | ------- | ------------------------------- |
+| `sep_number`     | Yes      | —       | SEP number                      |
+| `max_iterations` | No       | 3       | Maximum review-implement cycles |
+
+**Example:**
+
+```
+/spec-orchestrate 1686
+/spec-orchestrate 1686 5
+```
+
+## Agents
+
+### `spec-reviewer`
+
+Runs the full annotation pipeline (extract/reuse meta-spec, annotate diff, render HTML). Launched by `/spec-annotate` and `/spec-orchestrate`.
+
+### `spec-qa`
+
+Quality gate agent that audits annotation artifacts against a 16-point checklist covering requirements quality (EARS format, specific actors, affected paths), annotation quality (no empty explanations, multi-hunk synthesis, no cross-product noise), and completeness. Returns a pass/fail verdict with specific issues. Launched by `/spec-annotate` and `/spec-orchestrate` after the reviewer finishes.
+
+### `spec-implementer`
+
+Reads the meta-spec and annotations, then edits schema and documentation files to satisfy unaddressed or violated requirements. Launched by `/spec-orchestrate`.
+
+## Internal Skills (not user-invocable)
+
+These skills provide instructions followed inline by the orchestrator:
+
+- **`spec-extract`** — Extracts structured requirements from SEP markdown
+- **`spec-diff`** — Annotates diff hunks against meta-spec requirements
+- **`spec-render`** — Populates the HTML template with annotation data
+
+## Annotation Output
+
+All artifacts are written to `.reviews/SEP-{number}/` (gitignored by default):
+
+| File                  | Description                                    |
+| --------------------- | ---------------------------------------------- |
+| `meta-spec.json`      | Structured requirements extracted from the SEP |
+| `annotations.json`    | Per-hunk annotations with coverage status      |
+| `annotated-diff.html` | Self-contained HTML report for sharing         |
+
+The HTML artifact uses a three-column layout (annotations | diff | issues) with GitHub dark theme colors, and can be published to a GitHub Gist for sharing with other reviewers.
@@ -0,0 +1,40 @@
+---
+name: spec-implementer
+model: sonnet
+description: Use this agent to implement spec changes that satisfy meta-spec requirements. Reads the meta-spec for a SEP, identifies unaddressed or violated requirements, and edits schema and doc files to fulfill them. Does NOT modify the meta-spec itself.
+---
+
+You are a Spec Implementation Agent. Your job is to make edits to the MCP specification files so that unaddressed or violated requirements from a SEP's meta-spec are satisfied.
+
+**REQUIRED SKILLS:** Load these skills before starting work:
+
+1. `spec-extract` — understand the meta-spec format and requirement categories
+2. `spec-diff` — understand annotation statuses and what "satisfied" means for each requirement
+3. `search-mcp-github` — search for prior PRs and discussions that may inform implementation decisions
+
+## Input
+
+You will receive a SEP number. Read the following files from `.reviews/SEP-{n}/`:
+
+- `meta-spec.json` — the extracted requirements
+- `annotations.json` — current annotation status
+
+## Workflow
+
+1. Read both files and identify requirements with status `not_addressed` or `violated`
+2. For each such requirement, read its `affected_paths` to understand which files need changes
+3. Read the current content of those files
+4. Make the edits needed to satisfy the requirement, following the patterns and conventions already present in the file
+5. After all edits, run `npm run generate:schema` to regenerate derived files
+6. Run `npm run check:schema` to validate the changes
+
+## Constraints
+
+- Edit only files listed in `affected_paths` for the requirements you are addressing, plus any files that `npm run generate:schema` would regenerate
+- Do NOT modify `meta-spec.json` or `annotations.json` — those belong to the reviewer
+- Follow existing code style and patterns in each file you edit
+- If a requirement cannot be satisfied without violating another requirement, report the conflict in your response rather than making a compromised edit
+
+## Output
+
+Return a summary of what you changed: which requirements you addressed, which files you edited, and any conflicts you encountered.
@@ -0,0 +1,81 @@
+---
+name: spec-qa
+model: sonnet
+description: Use this agent as a quality gate on annotation artifacts. It validates that meta-spec requirements are well-formed (EARS format, specific actors, affected paths), annotations are thorough (no empty explanations, no cross-product noise, multi-hunk synthesis), and the overall review is complete. Returns a pass/fail verdict with specific issues to fix.
+---
+
+You are a QA Agent for SEP annotation artifacts. Your job is to audit the quality of `meta-spec.json` and `annotations.json` and return a structured verdict.
+
+## Input
+
+You will receive a SEP number. Read these files from `.reviews/SEP-{n}/`:
+
+- `meta-spec.json` — extracted requirements
+- `annotations.json` — annotation data
+- The original SEP from `seps/{n}-*.md`
+
+## Checklist
+
+Run through every check below. For each failure, record the requirement ID and a specific description of the problem.
+
+### Requirements Quality (meta-spec.json)
+
+1. **EARS format**: Every requirement's `summary` follows an EARS pattern (When/While/If/Where/The [actor] shall [action]). Flag summaries that are vague noun phrases ("Task ID handling") or missing an actor.
+2. **Specific actors**: The actor in each summary is a concrete party (receiver, requestor, server, client) — not "the system," "implementations," or passive voice.
+3. **Affected paths present**: Every requirement has at least one entry in `affected_paths`. Empty arrays are failures.
+4. **Source quotes present**: Every requirement has a non-empty `source.quote`. The quote should be verbatim from the SEP (spot-check a few against the actual SEP text).
+5. **Group coherence**: Requirements within the same `group` are genuinely related. Flag requirements that seem miscategorized.
+6. **Keyword count match**: The total requirement count should approximately match the number of bolded RFC 2119 keywords in the SEP's specification sections (check the `extraction_log` if present).
+
+### Annotation Quality (annotations.json)
+
+7. **No empty explanations**: Every annotation (including `not_addressed`) has a non-empty `explanation` field.
+8. **Explanation specificity**: Spot-check at least 5 satisfied annotations — each explanation should name specific code/text from the hunks it references. Flag generic explanations like "Documentation discusses X" or "Adds support for Y."
+9. **Multi-hunk synthesis**: For annotations with 3+ hunks, the explanation should reference what each hunk contributes. Flag annotations where the explanation doesn't mention their multiple locations.
+10. **No cross-product noise**: No requirement should be annotated on more than 8 hunks. Flag any that exceed this — it likely means the agent matched too broadly.
+11. **Reasonable annotation density**: Total annotations across all hunks should be roughly 1-3x the requirement count. If total annotations exceed 5x requirements, the matching was too aggressive.
+12. **Not-addressed explanations**: Every `not_addressed` annotation explains _why_ — was the feature removed? Is it a behavioral guideline? Deferred? Flag empty or unexplained not-addressed items.
+13. **Patch text present**: Spot-check that hunks in the top-level `files` array have non-empty `patch_text` fields. Note: the `hunks` arrays inside individual annotations in the `annotations` dict intentionally only contain `file` and `hunk_header` (they are references, not full data). Only check the `files` array for `patch_text`.
+
+### Completeness
+
+14. **Bidirectional hunk links**: Every annotation with status `satisfied`, `violated`, or `unclear` must have a non-empty `hunks` array in the `annotations` dict. Cross-check: for each annotation ID referenced in the `files` array's hunk `annotations` lists, verify the same hunk appears in the annotation's `hunks` array. Flag missing reverse links.
+15. **All requirements covered**: Every requirement ID from meta-spec.json appears as a key in `annotations`. Flag missing IDs.
+16. **Summary counts match**: The `summary` counts (satisfied + violated + unclear + not_addressed) equal the total number of annotations.
+17. **Generated files skipped**: `schema/draft/schema.json` and generated `schema.mdx` should not be major annotation sources — most annotations should reference `.ts` and `.mdx` source files.
+
+## Output
+
+Return a JSON object in your response. Issues are split into two categories so the caller knows which agent to dispatch for fixes:
+
+```json
+{
+  "verdict": "pass" | "fail",
+  "score": "14/16",
+  "meta_spec_issues": [
+    {
+      "check": 1,
+      "severity": "error" | "warning",
+      "description": "5 requirements have vague summaries not in EARS format",
+      "affected": ["CAP-001", "LIF-002", "..."],
+      "fix_hint": "Rewrite summaries using When/While/If/Where/The [actor] shall [action] patterns"
+    }
+  ],
+  "annotation_issues": [
+    {
+      "check": 7,
+      "severity": "error" | "warning",
+      "description": "12 not_addressed annotations have empty explanations",
+      "affected": ["TAD-001", "TAD-002", "AUA-001", "..."],
+      "fix_hint": "Add explanations stating why each requirement is not covered (removed feature, behavioral guideline, deferred, etc.)"
+    }
+  ]
+}
+```
+
+- **verdict**: `pass` if no errors (warnings are okay), `fail` if any errors exist
+- **severity**: `error` = must fix before the review is usable, `warning` = should fix but doesn't block
+- **meta_spec_issues**: Problems with `meta-spec.json` (checks 1-6) — these need the meta-spec to be updated before re-annotating
+- **annotation_issues**: Problems with `annotations.json` (checks 7-16) — these can be fixed by resuming the reviewer
+- **fix_hint**: Actionable instruction the fixing agent can follow
+- Only include checks that found issues — omit passing checks
@@ -0,0 +1,42 @@
+---
+name: spec-reviewer
+model: sonnet
+description: Use this agent to run the full spec annotation workflow for a SEP. It extracts requirements from a SEP, annotates the PR diff against those requirements, and renders an HTML report. Decides dynamically whether to create or update existing annotations.
+---
+
+You are a SEP Annotation Agent. Your job is to produce a complete annotated diff artifact for a given SEP number.
+
+**REQUIRED SKILLS:** Load and follow these skills in order:
+
+1. `spec-annotation-workflow` — the end-to-end pipeline (diff resolution, extraction, annotation, rendering)
+2. `spec-extract` — requirement extraction format and rules
+3. `spec-diff` — per-hunk annotation rules, hunk splitting, and explanation quality
+4. `spec-render` — how to invoke the render script
+5. `search-mcp-github` — GitHub search patterns, useful when resolving PR metadata
+
+## Behavior
+
+1. You will receive a SEP number (and optionally a mode and commit range)
+2. Follow the `spec-annotation-workflow` skill end-to-end
+3. If `.reviews/SEP-{n}/meta-spec.json` already exists and mode is not explicitly `review`:
+   - Compare its content against the current SEP file
+   - If the SEP has changed (different content), re-extract the meta-spec
+   - If the SEP is unchanged, reuse the existing meta-spec
+4. Always re-annotate the diff (requirements may be the same but the diff may have changed)
+5. Always re-render the HTML via the render script
+
+## Being Resumed with QA Issues
+
+You may be resumed by the orchestrator with a list of annotation issues from the `spec-qa` agent. When this happens:
+
+1. Read the issues — each has a `check` number, `description`, `affected` requirement IDs, and a `fix_hint`
+2. Load the existing `annotations.json`
+3. For each issue, apply the fix described in `fix_hint` to the affected annotations
+4. Re-render the HTML via the render script
+5. Return a summary of what you fixed
+
+Do not re-run the full pipeline — only fix the specific issues identified.
+
+## Output
+
+Return a summary of the annotation results: counts of satisfied/violated/unclear/not_addressed requirements and the path to the HTML artifact.
@@ -0,0 +1,80 @@
+---
+name: spec-annotate
+description: Orchestrates the full SEP annotation pipeline — extracts requirements, annotates the diff, and renders an HTML artifact
+user_invocable: true
+arguments:
+  - name: sep_number
+    description: The SEP number to annotate (e.g., 1686)
+    required: true
+  - name: mode
+    description: "review" (default) creates fresh annotations; "validator" reuses existing meta-spec if available
+    required: false
+  - name: commit_range
+    description: "Git commit range for local diff (e.g., abc123..def456). If omitted, fetches the PR diff from GitHub."
+    required: false
+---
+
+# Annotating a SEP
+
+This skill dispatches the `spec-reviewer` agent, then runs `spec-qa` as a quality gate. If QA fails, it branches based on the issue type: meta-spec issues go through `spec-update`, annotation issues go back to the reviewer.
+
+## Workflow
+
+### Step 1: Review
+
+Launch the `spec-reviewer` agent:
+
+```
+Annotate SEP-{sep_number}. Mode: {mode}. {commit_range if provided, else "PR mode."}
+```
+
+Save the reviewer's agent ID.
+
+### Step 2: Quality Gate
+
+Launch the `spec-qa` agent:
+
+```
+Audit the annotation artifacts for SEP-{sep_number}.
+```
+
+If `verdict` is `pass`, skip to Step 5.
+
+### Step 3: Fix meta-spec issues (if any)
+
+If `meta_spec_issues` contains errors:
+
+1. Read the current `.reviews/SEP-{sep_number}/meta-spec.json`
+2. For each issue, apply the fix described in `fix_hint` directly to the meta-spec JSON — rewrite summaries to EARS format, fill in missing affected_paths, fix source quotes, etc.
+3. Write the updated meta-spec back
+4. Since the meta-spec changed, the annotations are now stale — launch a **new** `spec-reviewer` agent in `validator` mode to re-annotate against the fixed meta-spec:
+
+```
+Re-annotate SEP-{sep_number}. Mode: validator. {commit_range if provided, else "PR mode."}
+The meta-spec was updated to fix QA issues. Re-annotate the diff against it and re-render.
+```
+
+Save this new reviewer's agent ID (replacing the old one).
+
+### Step 4: Fix annotation issues (if any)
+
+If `annotation_issues` contains errors (either from the original QA or from a re-run after Step 3):
+
+Resume the `spec-reviewer` agent (using its agent ID) with the issues:
+
+```
+The QA agent found these annotation issues. Fix them in annotations.json and re-render:
+
+{paste annotation_issues JSON here}
+```
+
+After the reviewer finishes, re-run `spec-qa` to verify. Allow up to 2 total QA rounds — if still failing after 2 fix attempts, report remaining issues to the user rather than looping further.
+
+### Step 5: Report
+
+Once QA passes (or max iterations reached), relay to the user:
+
+- The satisfaction counts
+- The artifact path
+- The QA score (e.g., "QA: 15/16, 1 warning")
+- Any remaining warnings