Docs.

josealekhine · josealekhine · commit 1bfc1e955d08 · 2026-02-25T04:14:03.000-08:00
Signed-off-by: Jose Alekhinne &lt;alekhinejose@gmail.com&gt;
diff --git a/docs/home/prompting-guide.md b/docs/home/prompting-guide.md
@@ -654,14 +654,14 @@ powerful optimizers find solutions that technically satisfy the
 objective but are practically useless.
 
 **CLI commands as prompts** ("*Run `ctx status`*") interleave
-*reasoning with acting* — the model thinks, acts on external tools,
+*reasoning with acting*: The model thinks, acts on external tools,
 observes results, then thinks again. Grounding reasoning in real
 tool output reduces hallucination because the model can't ignore
 evidence it just retrieved.
 <br>Yao et al., [ReAct: Synergizing Reasoning and Acting in Language Models](https://arxiv.org/abs/2210.03629) (2022).
 
 **Task decomposition** ("*Prompts by Task Type*") applies
-*least-to-most prompting* — breaking a complex problem into
+*least-to-most prompting*: Breaking a complex problem into
 subproblems and solving them sequentially, each building on the last.
 This is the research version of "plan, then implement one slice."
 <br>Zhou et al., [Least-to-Most Prompting Enables Complex Reasoning in Large Language Models](https://arxiv.org/abs/2205.10625) (2022).
@@ -674,7 +674,7 @@ understanding the problem.
 <br>Wang et al., [Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models](https://arxiv.org/abs/2305.04091) (2023).
 
 **Session reflection** ("*What did we learn?*", `/ctx-reflect`) is
-a form of *verbal reinforcement learning* — improving future
+a form of *verbal reinforcement learning*: Improving future
 performance by persisting linguistic feedback as memory rather than
 updating weights. This is exactly what `LEARNINGS.md` and
 `DECISIONS.md` provide: a durable feedback signal across sessions.
diff --git a/site/home/prompting-guide/index.html b/site/home/prompting-guide/index.html
@@ -4570,13 +4570,13 @@ <h2 id="why-do-these-approaches-work">Why Do These Approaches Work?<a class="hea
 powerful optimizers find solutions that technically satisfy the
 objective but are practically useless.</p>
 <p><strong>CLI commands as prompts</strong> ("<em>Run <code>ctx status</code></em>") interleave
-<em>reasoning with acting</em> — the model thinks, acts on external tools,
+<em>reasoning with acting</em>: The model thinks, acts on external tools,
 observes results, then thinks again. Grounding reasoning in real
 tool output reduces hallucination because the model can't ignore
 evidence it just retrieved.
 <br>Yao et al., <a href="https://arxiv.org/abs/2210.03629">ReAct: Synergizing Reasoning and Acting in Language Models</a> (2022).</p>
 <p><strong>Task decomposition</strong> ("<em>Prompts by Task Type</em>") applies
-<em>least-to-most prompting</em> — breaking a complex problem into
+<em>least-to-most prompting</em>: Breaking a complex problem into
 subproblems and solving them sequentially, each building on the last.
 This is the research version of "plan, then implement one slice."
 <br>Zhou et al., <a href="https://arxiv.org/abs/2205.10625">Least-to-Most Prompting Enables Complex Reasoning in Large Language Models</a> (2022).</p>
@@ -4587,7 +4587,7 @@ <h2 id="why-do-these-approaches-work">Why Do These Approaches Work?<a class="hea
 understanding the problem.
 <br>Wang et al., <a href="https://arxiv.org/abs/2305.04091">Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models</a> (2023).</p>
 <p><strong>Session reflection</strong> ("<em>What did we learn?</em>", <code>/ctx-reflect</code>) is
-a form of <em>verbal reinforcement learning</em> — improving future
+a form of <em>verbal reinforcement learning</em>: Improving future
 performance by persisting linguistic feedback as memory rather than
 updating weights. This is exactly what <code>LEARNINGS.md</code> and
 <code>DECISIONS.md</code> provide: a durable feedback signal across sessions.
diff --git a/site/search.json b/site/search.json