IronAdamant
diff --git a/‎CLAUDE.md‎
Lines changed: 12 additions & 6 deletions b/‎CLAUDE.md‎
Lines changed: 12 additions & 6 deletions
diff --git a/‎chisel/cli.py‎
Lines changed: 38 additions & 0 deletions b/‎chisel/cli.py‎
Lines changed: 38 additions & 0 deletions
diff --git a/‎chisel/engine.py‎
Lines changed: 25 additions & 0 deletions b/‎chisel/engine.py‎
Lines changed: 25 additions & 0 deletions
diff --git a/‎chisel/impact.py‎
Lines changed: 11 additions & 0 deletions b/‎chisel/impact.py‎
Lines changed: 11 additions & 0 deletions
diff --git a/‎chisel/mcp_server.py‎
Lines changed: 8 additions & 3 deletions b/‎chisel/mcp_server.py‎
Lines changed: 8 additions & 3 deletions
diff --git a/‎chisel/mcp_stdio.py‎
Lines changed: 5 additions & 2 deletions b/‎chisel/mcp_stdio.py‎
Lines changed: 5 additions & 2 deletions
diff --git a/‎chisel/next_steps.py‎
Lines changed: 169 additions & 0 deletions b/‎chisel/next_steps.py‎
Lines changed: 169 additions & 0 deletions
@@ -14,10 +14,11 @@ chisel/
   metrics.py        — Pure computation: churn scoring, ownership aggregation, co-change detection. _parse_iso_date shared utility.
   test_mapper.py    — Test file discovery, framework detection, dependency extraction, edge building.
   impact.py         — Impact analysis, risk scoring, stale test detection, reviewer suggestions. Caches failure rates.
-  cli.py            — argparse CLI (17 subcommands). _run_tool() shared handler. Entry point: chisel.cli:main
-  schemas.py        — JSON Schema definitions for all 15 tools + dispatch table. Shared by HTTP and stdio servers.
+  cli.py            — argparse CLI (18 subcommands). _run_tool() shared handler. Entry point: chisel.cli:main
+  schemas.py        — JSON Schema definitions for all 16 tools + dispatch table. Shared by HTTP and stdio servers.
   mcp_server.py     — HTTP MCP server (GET /tools, /health, POST /call). ThreadedHTTPServer. dispatch_tool() shared by both servers.
   mcp_stdio.py      — stdio MCP server (requires optional 'mcp' package). _configure_server() for engine lifecycle mgmt.
+  next_steps.py     — Contextual next-step suggestions for MCP tool responses. compute_next_steps() dispatched per tool.
   rwlock.py         — Read-write lock for in-process concurrent access.
 ```
 
@@ -44,7 +45,10 @@ chisel/
 - **Unit-churn scaling**: `_UNIT_CHURN_FILE_LIMIT = 2000` in `engine.py`. Repos with more than 2000 code files skip per-function `git log -L` churn (each function spawns a subprocess). File-level churn is always computed. Validated on Grafana (21k files, 62k units in ~3 min).
 - **Numstat validation**: `_parse_log_output` in `git_analyzer.py` validates tab-separated fields are digits or `-` before treating them as numstat. Diff lines with tabs were being misidentified as numstat entries in `git log -L` output.
 - **Encoding safety**: All `subprocess.run()` calls use `encoding="utf-8", errors="replace"`. Git history may contain non-UTF-8 bytes (Latin-1 commit messages, binary diff fragments); these are replaced with `�` instead of crashing. File reads in `engine.py` and `test_mapper.py` already used `errors="replace"`.
-- **Empty-state detection**: All 11 query tools return `{"status": "no_data", "message": "...", "hint": "chisel analyze"}` when the DB has no analysis data, instead of `[]`. `_check_analysis_data()` in `engine.py` calls `storage.has_analysis_data()` (`SELECT 1 FROM code_units LIMIT 1`). Write tools (`analyze`, `update`, `record_result`) and `stats` are unaffected. `stats` adds a `hint` key when all counts are zero. CLI detects this via `_is_no_data()` in `cli.py`.
+- **Empty-state detection**: All 12 query tools return `{"status": "no_data", "message": "...", "hint": "chisel analyze"}` when the DB has no analysis data, instead of `[]`. `_check_analysis_data()` in `engine.py` calls `storage.has_analysis_data()` (`SELECT 1 FROM code_units LIMIT 1`). Write tools (`analyze`, `update`, `record_result`) and `stats` are unaffected. `stats` adds a `hint` key when all counts are zero. CLI detects this via `_is_no_data()` in `cli.py`.
+- **Next-step suggestions**: `next_steps.py` provides `compute_next_steps(tool_name, result)` which returns contextual follow-up suggestions per tool. Integrated at the dispatch level in `mcp_server.py` — HTTP responses include `"next_steps": [...]` as a sibling to `"result"`, stdio wraps both in a `{"result": ..., "next_steps": [...]}` envelope. CLI is unaffected. Only tools with registered hint functions get suggestions; others return empty.
+- **Inline coupling partners**: `risk_map` includes `"coupling_partners"` (top 3 by co-commit count) in each file entry alongside the breakdown. Data is already fetched in the batch query — no extra DB calls.
+- **Triage tool**: Composite `triage` runs `risk_map` (top-N) + `test_gaps` (filtered to top-N files) + `stale_tests` in a single read lock. Returns a dict, not a list, so `limit` is not injected.
 
 ## Dev Commands
 
@@ -66,13 +70,14 @@ impact.py → metrics.py
 metrics.py → (no internal deps)
 cli.py → engine.py, mcp_server.py, mcp_stdio.py
 schemas.py → (no internal deps)
-mcp_server.py → engine.py, schemas.py
+mcp_server.py → engine.py, next_steps.py, schemas.py
 mcp_stdio.py → engine.py, mcp_server.py, schemas.py
+next_steps.py → (no internal deps)
 ```
 
-## 15 MCP Tools
+## 16 MCP Tools
 
-`analyze`, `impact`, `suggest_tests`, `churn`, `ownership`, `coupling`, `risk_map`, `stale_tests`, `history`, `who_reviews`, `diff_impact`, `update`, `test_gaps`, `record_result`, `stats`
+`analyze`, `impact`, `suggest_tests`, `churn`, `ownership`, `coupling`, `risk_map`, `stale_tests`, `history`, `who_reviews`, `diff_impact`, `update`, `test_gaps`, `record_result`, `stats`, `triage`
 
 Each wired through: engine.tool_*() → CLI subcommand, HTTP POST /call, stdio MCP.
 
@@ -81,5 +86,6 @@ Each wired through: engine.tool_*() → CLI subcommand, HTTP POST /call, stdio M
 - **`test_gaps`**: Finds code units with zero test coverage, prioritized by churn risk. Excludes test files by default.
 - **`record_result`**: Records test pass/fail outcomes. Feeds into `suggest_tests` (failure rate boost) and `risk_map` (test instability component).
 - **`stats`**: Returns summary counts for all database tables (code units, tests, edges, commits, etc.).
+- **`triage`**: Combined risk_map + test_gaps + stale_tests for top-N riskiest files. Single command for pre-audit/refactor prioritization. Returns `{top_risk_files, test_gaps, stale_tests, summary}`.
 - **`limit` parameter**: All list-returning tools accept `limit` to cap result size.
 - **Adaptive coupling threshold**: `max(3, total_commits // 4)` — scales with project maturity.
@@ -133,6 +133,14 @@ def create_parser():
     sub.add_parser("stats", parents=[shared],
                    help="Show database summary counts")
 
+    # triage
+    p_triage = sub.add_parser("triage", parents=[shared],
+                               help="Combined risk + gap + stale triage")
+    p_triage.add_argument("directory", nargs="?", default=None,
+                           help="Directory to scope (default: all)")
+    p_triage.add_argument("--top-n", type=int, default=10,
+                           help="Number of top-risk files (default: 10)")
+
     # serve
     p_serve = sub.add_parser("serve", parents=[shared],
                              help="Start HTTP server")
@@ -332,6 +340,35 @@ def fmt(_result, args):
                      fmt, use_limit=False)
 
 
+def cmd_triage(args):
+    def fmt(result, _args):
+        summary = result["summary"]
+        print(f"Triage ({summary['files_triaged']} files):")
+        print("\nTop risk files:")
+        for r in result["top_risk_files"]:
+            partners = ""
+            cp = r.get("coupling_partners", [])
+            if cp:
+                names = [p["file"] for p in cp[:2]]
+                partners = f"  coupled: {', '.join(names)}"
+            print(f"  {r['file_path']}: {r['risk_score']}{partners}")
+        if result["test_gaps"]:
+            print(f"\nTest gaps ({summary['total_test_gaps']}):")
+            for g in result["test_gaps"]:
+                print(f"  {g['file_path']}:{g['name']} ({g['unit_type']})")
+        else:
+            print("\nNo test gaps in triaged files.")
+        if result["stale_tests"]:
+            print(f"\nStale tests ({summary['total_stale_tests']}):")
+            for s in result["stale_tests"]:
+                print(f"  {s['test_id']}  ({s['edge_type']})")
+        else:
+            print("\nNo stale tests found.")
+    return _run_tool(args, "tool_triage",
+                     {"directory": args.directory, "top_n": args.top_n},
+                     fmt, use_limit=False)
+
+
 def cmd_stats(args):
     return _run_tool(args, "tool_stats", {},
                      _fmt_kv("Chisel database stats:"), use_limit=False)
@@ -381,6 +418,7 @@ def cmd_serve_mcp(args):
     "update": cmd_update,
     "test-gaps": cmd_test_gaps,
     "record-result": cmd_record_result,
+    "triage": cmd_triage,
     "stats": cmd_stats,
     "serve": cmd_serve,
     "serve-mcp": cmd_serve_mcp,
 
@@ -296,6 +296,31 @@ def tool_record_result(self, test_id, passed, duration_ms=None):
                 self.storage.record_test_result(test_id, passed, duration_ms)
                 return {"test_id": test_id, "passed": passed, "recorded": True}
 
+    def tool_triage(self, directory=None, top_n=10):
+        """MCP tool: combined risk_map + test_gaps + stale_tests triage."""
+        with self._process_lock.shared():
+            with self.lock.read_lock():
+                empty = self._check_analysis_data()
+                if empty is not None:
+                    return empty
+                risk_map = self.impact.get_risk_map(directory)[:top_n]
+                test_gaps = self.impact.get_test_gaps(directory=directory)
+                stale = self.impact.detect_stale_tests()
+
+                top_files = {r["file_path"] for r in risk_map}
+                relevant_gaps = [g for g in test_gaps if g["file_path"] in top_files]
+
+                return {
+                    "top_risk_files": risk_map,
+                    "test_gaps": relevant_gaps,
+                    "stale_tests": stale,
+                    "summary": {
+                        "files_triaged": len(risk_map),
+                        "total_test_gaps": len(relevant_gaps),
+                        "total_stale_tests": len(stale),
+                    },
+                }
+
     def tool_stats(self):
         """MCP tool: get summary counts for the Chisel database."""
         with self._process_lock.shared():
 
@@ -267,6 +267,16 @@ def get_risk_map(self, directory=None):
             co_changes = co_changes_batch.get(fp, [])
             coupling_norm = min(len(co_changes) / 10.0, 1.0)
 
+            # Top 3 coupling partners (by co-commit count, desc)
+            sorted_cc = sorted(co_changes, key=lambda c: c["co_commit_count"], reverse=True)[:3]
+            coupling_partners = [
+                {
+                    "file": cc["file_b"] if cc["file_a"] == fp else cc["file_a"],
+                    "co_commits": cc["co_commit_count"],
+                }
+                for cc in sorted_cc
+            ]
+
             code_units = code_units_batch.get(fp, [])
             tested_count = 0
             covering_test_ids = set()
@@ -295,6 +305,7 @@ def get_risk_map(self, directory=None):
                 "file_path": fp,
                 "unit_name": None,
                 "risk_score": round(risk, 4),
+                "coupling_partners": coupling_partners,
                 "breakdown": {
                     "churn": round(churn_norm, 4),
                     "coupling": round(coupling_norm, 4),
 
@@ -14,6 +14,7 @@
 from socketserver import ThreadingMixIn
 
 from chisel.engine import ChiselEngine
+from chisel.next_steps import compute_next_steps
 from chisel.schemas import _TOOL_DISPATCH, _TOOL_SCHEMAS
 
 logger = logging.getLogger(__name__)
@@ -36,7 +37,8 @@ def dispatch_tool(engine, tool_name, arguments):
     result = getattr(engine, method_name)(**kwargs)
     if limit is not None and isinstance(result, list):
         result = result[:int(limit)]
-    return result
+    next_steps = compute_next_steps(tool_name, result)
+    return result, next_steps
 
 
 # ------------------------------------------------------------------ #
@@ -127,8 +129,11 @@ def _handle_call(self):
             return
 
         try:
-            result = dispatch_tool(self.server.engine, tool_name, arguments)
-            self._send_json({"result": result})
+            result, next_steps = dispatch_tool(self.server.engine, tool_name, arguments)
+            response = {"result": result}
+            if next_steps:
+                response["next_steps"] = next_steps
+            self._send_json(response)
         except ValueError as exc:
             self._send_error_json(404, str(exc))
         except TypeError as exc:
 
@@ -59,14 +59,17 @@ async def call_tool(name: str, arguments: dict):
         """Dispatch an MCP tool call to the appropriate engine method."""
         try:
             loop = asyncio.get_running_loop()
-            result = await loop.run_in_executor(
+            result, next_steps = await loop.run_in_executor(
                 None, lambda: dispatch_tool(engine, name, arguments),
             )
         except Exception as exc:
             logger.exception("Error executing tool %s", name)
             return [TextContent(type="text", text=f"Error: {exc}")]
 
-        text = json.dumps(result, indent=2, default=str)
+        payload = {"result": result}
+        if next_steps:
+            payload["next_steps"] = next_steps
+        text = json.dumps(payload, indent=2, default=str)
         return [TextContent(type="text", text=text)]
 
     return server
 
@@ -0,0 +1,169 @@
+"""Contextual next-step suggestions for MCP tool responses.
+
+Computes follow-up tool suggestions based on what a tool returned,
+so LLM agents know what to invoke next. Only used by MCP servers
+(HTTP and stdio), not the CLI.
+"""
+
+
+def compute_next_steps(tool_name, result):
+    """Return a list of next-step suggestion strings for a tool result.
+
+    Args:
+        tool_name: Name of the tool that produced the result.
+        result: The tool's return value (dict or list).
+
+    Returns:
+        List of strings, each a brief actionable suggestion. Empty list
+        if no suggestions apply.
+    """
+    fn = _TOOL_HINTS.get(tool_name)
+    if fn is None:
+        return []
+    return fn(result)
+
+
+# ------------------------------------------------------------------ #
+# Per-tool hint functions
+# ------------------------------------------------------------------ #
+
+def _hints_analyze(result):
+    if isinstance(result, dict) and "code_files_scanned" in result:
+        return [
+            "Run 'risk_map' to identify high-risk files.",
+            "Run 'test_gaps' to find untested code.",
+            "Run 'triage' for a combined risk + gap + stale overview.",
+        ]
+    return []
+
+
+def _hints_update(result):
+    if isinstance(result, dict) and result.get("files_updated", 0) > 0:
+        return [
+            "Run 'diff_impact' to see which tests are affected by the changes.",
+            "Run 'risk_map' to check updated risk scores.",
+        ]
+    return []
+
+
+def _hints_risk_map(result):
+    if isinstance(result, list) and result:
+        top = result[:3]
+        files = [r["file_path"] for r in top]
+        steps = [
+            "Run 'test_gaps' to find missing test coverage for high-risk files.",
+        ]
+        # Suggest coupling drilldown for files with high coupling scores
+        high_coupling = [
+            r["file_path"] for r in top
+            if r.get("breakdown", {}).get("coupling", 0) > 0.3
+        ]
+        if high_coupling:
+            steps.append(
+                f"Run 'coupling {high_coupling[0]}' to see co-change partners."
+            )
+        # Suggest churn drilldown for high-churn files
+        high_churn = [
+            r["file_path"] for r in top
+            if r.get("breakdown", {}).get("churn", 0) > 0.5
+        ]
+        if high_churn:
+            steps.append(
+                f"Run 'churn {high_churn[0]}' for detailed change history."
+            )
+        steps.append(
+            f"Run 'suggest_tests {files[0]}' for test recommendations on the riskiest file."
+        )
+        return steps
+    if isinstance(result, list):
+        return ["Run 'analyze' to populate risk data."]
+    return []
+
+
+def _hints_diff_impact(result):
+    if isinstance(result, list) and result:
+        return [
+            "Run the listed tests to verify your changes.",
+            "Use 'record_result' to log outcomes for future prioritization.",
+            "Run 'coupling' on changed files to check for hidden dependents.",
+        ]
+    if isinstance(result, list):
+        return [
+            "Run 'test_gaps' to check if new code needs tests.",
+            "Run 'update' if you've made changes since last analysis.",
+        ]
+    return []
+
+
+def _hints_test_gaps(result):
+    if isinstance(result, list) and result:
+        top_file = result[0]["file_path"]
+        return [
+            "Write tests for the highest-churn untested units first.",
+            f"Run 'churn {top_file}' to see change frequency.",
+            f"Run 'ownership {top_file}' to find who can help write tests.",
+        ]
+    if isinstance(result, list):
+        return ["All code units have test coverage."]
+    return []
+
+
+def _hints_stale_tests(result):
+    if isinstance(result, list) and result:
+        return [
+            "Update or remove the stale tests listed above.",
+            "Run 'update' to re-analyze after fixing test files.",
+        ]
+    if isinstance(result, list):
+        return ["All tests reference current code."]
+    return []
+
+
+def _hints_impact(result):
+    if isinstance(result, list) and result:
+        return [
+            "Run the impacted tests to verify correctness.",
+            "Use 'record_result' to log outcomes for future prioritization.",
+        ]
+    return []
+
+
+def _hints_suggest_tests(result):
+    if isinstance(result, list) and result:
+        return [
+            "Run the suggested tests in order of relevance.",
+            "Use 'record_result' to log outcomes for future prioritization.",
+        ]
+    return []
+
+
+def _hints_triage(result):
+    if isinstance(result, dict) and "summary" in result:
+        steps = []
+        if result["summary"].get("total_test_gaps", 0) > 0:
+            steps.append(
+                "Focus on files appearing in both risk and gap sections."
+            )
+        if result["top_risk_files"]:
+            top = result["top_risk_files"][0]["file_path"]
+            steps.append(f"Run 'suggest_tests {top}' on the highest-risk file.")
+            steps.append(f"Run 'ownership {top}' to find who owns the riskiest code.")
+        return steps
+    return []
+
+
+# ------------------------------------------------------------------ #
+# Dispatch table
+# ------------------------------------------------------------------ #
+
+_TOOL_HINTS = {
+    "analyze": _hints_analyze,
+    "update": _hints_update,
+    "risk_map": _hints_risk_map,
+    "diff_impact": _hints_diff_impact,
+    "test_gaps": _hints_test_gaps,
+    "stale_tests": _hints_stale_tests,
+    "impact": _hints_impact,
+    "suggest_tests": _hints_suggest_tests,
+    "triage": _hints_triage,
+}