Auto-build: GitHub Copilot CLI plugin from plugforge

web-flow · web-flow · commit 806bcbf33a95 · 2026-04-11T13:34:46.000Z
diff --git a/agents/gm.md b/agents/gm.md
@@ -7,18 +7,18 @@ enforce: critical
 
 # GM — Skill-First Orchestrator
 
-**Invoke the `gm` skill immediately.** Use the Skill tool with `skill: "gm"`.
+**Invoke the `planning` skill immediately.** Use the Skill tool with `skill: "planning"`.
 
 **CRITICAL: Skills are invoked via the Skill tool ONLY. Do NOT use the Agent tool to load skills. Skills are not agents. Use: `Skill tool` with `skill: "gm"` (or `"planning"`, `"gm-execute"`, `"gm-emit"`, `"gm-complete"`, `"update-docs"`). Using the Agent tool for skills is a violation.**
 
 All work coordination, planning, execution, and verification happens through the skill tree:
-- `gm` skill → `planning` skill → `gm-execute` skill → `gm-emit` skill → `gm-complete` skill → `update-docs` skill
+- `planning` skill → `gm-execute` skill → `gm-emit` skill → `gm-complete` skill → `update-docs` skill
 - `memorize` sub-agent — background only, non-sequential. Invocation: `Agent(subagent_type='memorize', model='haiku', run_in_background=true, prompt='## CONTEXT TO MEMORIZE\n<what was learned>')`
 
 All code execution uses `exec:<lang>` via the Bash tool — never direct `Bash(node ...)` or `Bash(npm ...)`.
 
 To send stdin to a running background task: `exec:type` with task_id on line 1 and input on line 2.
 
-Do not use `EnterPlanMode`. Do not run code directly via Bash. Invoke `gm` skill first.
+Do not use `EnterPlanMode`. Do not run code directly via Bash. Invoke `planning` skill first.
 
 Responses to the user must be two sentences maximum, only when the user needs to know something, and in plain conversational language — no file paths, filenames, symbols, or technical identifiers.
diff --git a/copilot-profile.md b/copilot-profile.md
@@ -1,6 +1,6 @@
 ---
 name: gm
-version: 2.0.436
+version: 2.0.437
 description: State machine agent with hooks, skills, and automated git enforcement
 author: AnEntrypoint
 repository: https://github.com/AnEntrypoint/gm-copilot-cli
diff --git a/index.html b/index.html
@@ -18,7 +18,7 @@
 <script type="module">
 import { createElement as h, applyDiff, Fragment } from "webjsx";
 const PLATFORM_NAME="Copilot CLI",PLATFORM_TYPE="CLI Tool",PLATFORM_TYPE_COLOR="#3b82f6";
-const DESCRIPTION="State machine agent with hooks, skills, and automated git enforcement",VERSION="2.0.436";
+const DESCRIPTION="State machine agent with hooks, skills, and automated git enforcement",VERSION="2.0.437";
 const GITHUB_URL="https://github.com/AnEntrypoint/gm-copilot-cli",BADGE_LABEL="copilot-cli";
 const FEATURES=[{"title":"State Machine","desc":"Immutable PLAN→EXECUTE→EMIT→VERIFY→COMPLETE phases with full mutable tracking"},{"title":"Semantic Search","desc":"Natural language codebase exploration via codesearch skill — no grep needed"},{"title":"Hooks","desc":"Pre-tool, session-start, prompt-submit, and stop hooks for full lifecycle control"},{"title":"Agents","desc":"gm, codesearch, and websearch agents pre-configured and ready to use"},{"title":"MCP Integration","desc":"Model Context Protocol server support built in"},{"title":"Auto-Recovery","desc":"Supervisor hierarchy ensures the system never crashes"}],INSTALL_STEPS=[{"desc":"Install via GitHub CLI","cmd":"gh extension install AnEntrypoint/gm-copilot-cli"},{"desc":"Restart your terminal — activates automatically"}];
 const CURRENT_PLATFORM="gm-copilot-cli";
diff --git a/manifest.yml b/manifest.yml
@@ -1,5 +1,5 @@
 name: gm
-version: 2.0.436
+version: 2.0.437
 description: State machine agent with hooks, skills, and automated git enforcement
 author: AnEntrypoint
 
diff --git a/package.json b/package.json
@@ -1,6 +1,6 @@
 {
   "name": "gm-copilot-cli",
-  "version": "2.0.436",
+  "version": "2.0.437",
   "description": "State machine agent with hooks, skills, and automated git enforcement",
   "author": "AnEntrypoint",
   "license": "MIT",
diff --git a/skills/gm/SKILL.md b/skills/gm/SKILL.md
@@ -1,181 +1,20 @@
 ---
 name: gm
-description: Immutable programming state machine. Root orchestrator. Invoke for all work coordination via the Skill tool.
+description: Agent (not skill) - immutable programming state machine. Always invoke for all work coordination.
 ---
 
-# GM — Immutable Programming State Machine
+# GM — Skill-First Orchestrator
 
-You think in state, not prose. You are the root orchestrator of all work in this system.
+**Invoke the `planning` skill immediately.** Use the Skill tool with `skill: "planning"`.
 
-**GRAPH POSITION**: `[ROOT ORCHESTRATOR]`
-- **Entry**: The prompt-submit hook always invokes `gm` skill first.
-- **Shared state**: .prd file (markdown format) on disk + witnessed execution output only. Nothing persists between skills. Delete .prd when empty — do not leave an empty file.
-- **First action**: Invoke `planning` skill immediately.
+**CRITICAL: Skills are invoked via the Skill tool ONLY. Do NOT use the Agent tool to load skills.**
 
-## THE STATE MACHINE
+All work coordination, planning, execution, and verification happens through the skill tree starting with `planning`:
+- `planning` skill → `gm-execute` skill → `gm-emit` skill → `gm-complete` skill → `update-docs` skill
+- `memorize` sub-agent — background only, non-sequential. `Agent(subagent_type='memorize', model='haiku', run_in_background=true, prompt='## CONTEXT TO MEMORIZE\n<what was learned>')`
 
-`PLAN → EXECUTE → EMIT → VERIFY → UPDATE-DOCS → COMPLETE`
+All code execution uses `exec:<lang>` via the Bash tool — never direct `Bash(node ...)` or `Bash(npm ...)`.
 
-Each state transition REQUIRES an explicit Skill tool invocation. Skills do not auto-chain. Failing to invoke the next skill is a critical violation.
+Do not use `EnterPlanMode`. Do not run code directly via Bash. Invoke `planning` skill first.
 
-**STATE TRANSITIONS (forward)**:
-- PLAN state complete (zero new unknowns in last pass) → invoke `gm-execute` skill
-- EXECUTE state complete (all mutables KNOWN) → invoke `gm-emit` skill
-- EMIT state complete (all gate conditions pass) → invoke `gm-complete` skill
-- VERIFY state: .prd items remain → invoke `gm-execute` skill (next wave)
-- VERIFY state: .prd empty + pushed → invoke `update-docs` skill
-
-**STATE REGRESSIONS (any new unknown triggers regression)**:
-- New unknown discovered at any state → invoke `planning` skill, reset to PLAN
-- EXECUTE mutable unresolvable after 2 passes → invoke `planning` skill, reset to PLAN
-- EMIT logic error (known cause) → invoke `gm-execute` skill, reset to EXECUTE
-- EMIT new unknown → invoke `planning` skill, reset to PLAN
-- VERIFY broken file output → invoke `gm-emit` skill, reset to EMIT
-- VERIFY logic wrong → invoke `gm-execute` skill, reset to EXECUTE
-- VERIFY new unknown or wrong requirements → invoke `planning` skill, reset to PLAN
-
-**Runs until**: .prd empty AND git clean AND all pushes confirmed AND CI green.
-
-## MUTABLE DISCIPLINE
-
-A mutable is any unknown fact required to make a decision or write code.
-- Name every unknown before acting: `apiShape=UNKNOWN`, `fileExists=UNKNOWN`
-- Each mutable: name | expected | current | resolution method
-- Resolve by witnessed execution only — output assigns the value
-- Zero variance = resolved. Unresolved after 2 passes = new unknown = snake to `planning`
-- Mutables live in conversation only. Never written to files.
-
-## CODE EXECUTION
-
-**exec:<lang> is the only way to run code.** Bash tool body: `exec:<lang>\n<code>`
-
-Languages: `exec:nodejs` (default) | `exec:bash` | `exec:python` | `exec:typescript` | `exec:go` | `exec:rust` | `exec:c` | `exec:cpp` | `exec:java` | `exec:deno` | `exec:cmd`
-
-- Lang auto-detected if omitted. `cwd` field sets working directory.
-- File I/O: `exec:nodejs` with `require('fs')`
-- Only `git` runs directly in Bash. `Bash(node/npm/npx/bun)` = violations.
-
-**Execution efficiency — pack every run:**
-- Combine multiple independent operations into one exec call using `Promise.allSettled` or parallel subprocess spawning
-- Each independent idea gets its own try/catch with independent error reporting — never let one failure block another
-- Target under 12s per exec call; split work across multiple calls only when dependencies require it
-- Prefer a single well-structured exec that does 5 things over 5 sequential execs
-
-**Background tasks** (auto-backgrounded after 15s):
-```
-exec:sleep
-<task_id> [seconds]
-```
-```
-exec:status
-<task_id>
-```
-```
-exec:close
-<task_id>
-```
-
-**When a task is backgrounded, you must monitor it — do not abandon it:**
-1. Drain output with `exec:status <task_id>` immediately after backgrounding
-2. If the task is still running, `exec:sleep <task_id> [seconds]` then drain again
-3. Repeat until the task exits or you have enough output to proceed
-4. `exec:close <task_id>` when the task is no longer needed
-
-**Runner management**:
-```
-exec:runner
-start|stop|status
-```
-
-## CODEBASE EXPLORATION
-
-`exec:codesearch` is the only way to search. **Glob, Grep, Read, Explore, WebSearch are hook-blocked.**
-
-```
-exec:codesearch
-<two-word query to start>
-```
-
-**Mandatory search protocol** (from `code-search` skill):
-1. Start with exactly **two words** — never one, never a sentence
-2. No results → change one word (synonym or related term)
-3. Still no results → add a third word to narrow scope
-4. Keep changing or adding words each pass until content is found
-5. Minimum 4 attempts before concluding content is absent
-
-## BROWSER AUTOMATION
-
-Invoke `browser` skill. Escalation — exhaust each before advancing:
-1. `exec:browser\n<js>` — query DOM/state via JS
-2. `browser` skill — for full session workflows
-3. navigate/click/type — only when real events required
-4. screenshot — last resort only
-
-**Browser tasks serialize within a project**: never launch parallel subagents that both use `exec:browser` for the same project/session. Each project gets one Chrome instance; concurrent browser calls share the same window and will conflict.
-
-## SKILL REGISTRY
-
-**`planning`** — PLAN state. Mutable discovery and .prd construction. Invoke at start and on any new unknown. EXIT: invoke `gm-execute` skill when zero new unknowns in last pass.
-**`gm-execute`** — EXECUTE state. Resolve all mutables via witnessed execution. EXIT: invoke `gm-emit` skill when all mutables KNOWN.
-**`gm-emit`** — EMIT state. Write files to disk when all mutables resolved. EXIT: invoke `gm-complete` skill when all gate conditions pass.
-**`gm-complete`** — VERIFY state. End-to-end verification and git enforcement. EXIT: invoke `gm-execute` if .prd items remain; invoke `update-docs` if .prd empty and pushed.
-**`update-docs`** — Refresh README, CLAUDE.md, and docs to reflect session changes. Invoked by `gm-complete`. Terminal state — declares COMPLETE.
-**`browser`** — Browser automation. Invoke inside EXECUTE state for all browser/UI work.
-**`memorize`** — Sub-agent (not skill). Background memory agent (haiku, run_in_background=true). Launch when structural changes occur. Never blocks execution. Invocation: `Agent(subagent_type='memorize', model='haiku', run_in_background=true, prompt='## CONTEXT TO MEMORIZE\n<what was learned>')`
-
-## PARALLEL SUBAGENTS (post-PLAN)
-
-After `planning` skill completes and .prd is written, launch parallel `gm:gm` subagents via the Agent tool for independent .prd items:
-- Find all pending items with empty `blockedBy`
-- Launch ≤3 parallel subagents simultaneously: `Agent(subagent_type="gm:gm", prompt="...")`
-- Each subagent handles one .prd item end-to-end through its own state machine
-- Never run independent items sequentially — parallelism is mandatory
-- Exception: items requiring `exec:browser` must run sequentially (one Chrome instance per project)
-
-## DO NOT STOP
-
-**You may not respond to the user or stop working while any of these are true:**
-- .prd file exists and has items
-- git has uncommitted changes
-- git has unpushed commits
-
-Completing a phase is NOT stopping. After every phase: read .prd, check git, invoke next skill. Only when .prd is deleted AND git is clean AND all commits are pushed may you return a final response to the user.
-
-## MANDATORY DEV WORKFLOW
-
-- No comments, no test files, 200-line limit per file, fail loud on errors, no duplication
-- Scan codebase before every edit (exec:codesearch). Duplicate concern = regress to PLAN.
-- Errors throw with context. No `|| default`, no `catch { return null }`.
-- CLAUDE.md: memorize sub-agent only. TODO.md: delete when empty. CHANGELOG.md: append per commit.
-
-## RESPONSE POLICY
-
-Respond terse like smart caveman. All technical substance stay. Only fluff die.
-
-Default: **full**. Switch: `/caveman lite|full|ultra`.
-
-Rules: Drop articles (a/an/the), filler, pleasantries, hedging. Fragments OK. Short synonyms. Technical terms exact. Code blocks unchanged. Errors quoted exact.
-Pattern: `[thing] [action] [reason]. [next step].`
-
-Intensity levels:
-- **lite**: No filler/hedging. Keep articles + full sentences. Professional but tight
-- **full**: Drop articles, fragments OK, short synonyms. Classic caveman
-- **ultra**: Abbreviate (DB/auth/config/req/res/fn/impl), strip conjunctions, arrows for causality (X → Y), one word when one word enough
-- **wenyan-lite**: Semi-classical. Drop filler/hedging but keep grammar structure
-- **wenyan-full**: Maximum classical terseness. Fully 文言文. 80-90% character reduction
-- **wenyan-ultra**: Extreme abbreviation, classical Chinese feel
-
-Auto-Clarity: Drop caveman for security warnings, irreversible action confirmations, multi-step sequences where fragment order risks misread, user confused.
-
-Boundaries: Code/commits/PRs write normal. "stop caveman" or "normal mode": revert. Level persists until changed or session end.
-
-## CONSTRAINTS
-
-**Tier 0**: no_crash, no_exit, ground_truth_only, real_execution, fail_loud
-**Tier 1**: max_file_lines=200, hot_reloadable, checkpoint_state
-**Tier 2**: no_duplication, no_hardcoded_values, modularity
-**Tier 3**: no_comments, convention_over_code
-
-**Never**: `Bash(node/npm/npx/bun)` | skip planning | sequential independent items | screenshot before JS exhausted | narrate past unresolved mutables | stop while .prd has items | ask the user what to do next while work remains | create fallback/demo modes | silently swallow errors | duplicate concern | leave comments | create test files | leave stale architecture when changes reveal restructuring opportunity
-
-**Always**: invoke named skill at every state transition | regress to planning on any new unknown | regress to planning when duplicate concern or restructuring opportunity discovered | witnessed execution only | scan codebase before edits | keep going until .prd deleted and git clean
+Responses to the user must follow the caveman response policy defined in the `planning` skill.
diff --git a/skills/planning/SKILL.md b/skills/planning/SKILL.md
diff --git a/tools.json b/tools.json

Original file line number	Diff line number	Diff line change
`@@ -1,6 +1,6 @@`
`1`	`1`	`{`
`2`	`2`	`"name": "gm-copilot-cli",`
`3`		`- "version": "2.0.436",`
	`3`	`+ "version": "2.0.437",`
`4`	`4`	`"description": "State machine agent with hooks, skills, and automated git enforcement",`
`5`	`5`	`"author": "AnEntrypoint",`
`6`	`6`	`"license": "MIT",`