Feat/english thinking when hidden by cmyyy · Pull Request #1843 · Hmbown/CodeWhale

cmyyy · 2026-05-20T16:18:08Z

Problem

When show_thinking is disabled, thinking blocks are hidden from the
UI (HistoryCell::Thinking is filtered out in history.rs), but the
API still generates reasoning_content (controlled separately by
reasoning_effort). The ## Language rule in base.md forces the
thinking chain to match the user's input language — so a Chinese user
pays for Chinese thinking they never see.

The token-count difference between Chinese and English reasoning is
modest (DeepSeek's tokenizer handles CJK efficiently), but there is
still waste: the surrounding system prompt is English, mixed-language
reasoning fragments the token stream, and invisible content has no
reason to be localized at all.

Solution

Inject a ## Thinking Language override into the system prompt when
show_thinking is false. The model is told:

The user has disabled thinking display in settings — they will never
see your reasoning_content. Therefore, your internal thinking MUST
be in English regardless of the user's language. Your final reply
must STILL match the user's language.

Benefits:

Prompt-cache locality — English reasoning sits better inside an
English system prompt, improving prefix-cache hit rates.
Mixed-language overhead — reasoning often interleaves code
identifiers, file paths, and API names with natural language.
Switching between CJK and ASCII mid-stream creates unnecessary
token-boundary breaks.
Aligns cost with intent — when the user hides thinking, there is
no user-facing reason to localize it.

Changes (6 files, +53 lines)

File	Change
`crates/tui/src/prompts.rs`	Add `show_thinking` to `PromptSessionContext`; inject `## Thinking Language` override

|
| crates/tui/src/core/ops.rs | Add show_thinking to Op::SendMessage |
| crates/tui/src/core/engine.rs | Add to EngineConfig; thread through handle_send_message and
refresh_system_prompt |
| crates/tui/src/tui/ui.rs | Wire app.show_thinking through build_engine_config and prompt construction |
| crates/tui/src/main.rs | Pass show_thinking in CLI exec path |
| crates/tui/src/runtime_threads.rs | show_thinking: false for background agent threads |

Data flow: Settings::show_thinking → App::show_thinking →
Op::SendMessage → EngineConfig::show_thinking →
PromptSessionContext::show_thinking → override injection.

Behavior

`show_thinking`	Reasoning language	Reply language
`true` (default)	Matches user input	Matches user input
`false`	English (forced)	Matches user input

Testing

cargo test --all-features — 3120 passed; 6 pre-existing
failures unrelated to this change (API key / temp directory /
AGENTS.md path)
cargo fmt --all -- --check — clean
cargo clippy --all-targets --all-features — clean (2
pre-existing needless_return warnings unrelated to this change)

Checklist

Updated docs or comments as needed
Added or updated tests where relevant (existing prompt tests
updated for the new PromptSessionContext field)
Verified TUI behavior manually if UI changes (no UI logic
change; the entire path is a backend-only data flow)

When `show_thinking` is disabled in settings, thinking blocks are hidden from the UI but the API still generates `reasoning_content`. Because of the `## Language` rule in the system prompt, the thinking chain follows the user's input language — if the user writes in Chinese, the model thinks in Chinese for content they never see. The tokenizer-level savings are modest (DeepSeek's vocab handles Chinese efficiently), but the real benefit is keeping invisible reasoning in English for better prompt-cache locality and fewer token-boundary breaks in mixed-language (code + natural language) contexts. When thinking is hidden, there is no reason not to use the most cache-friendly language. Changes: - Add `show_thinking: bool` to `PromptSessionContext`, `EngineConfig`, and `Op::SendMessage` - Inject a `## Thinking Language` override when `show_thinking` is false, redirecting `reasoning_content` to English while the final reply still matches the user's language - Wire the field through the engine and TUI layers

Replace "To save tokens" with a rationale based on the user's intent: when thinking is hidden, there is no reason to localize it.

gemini-code-assist

Code Review

This pull request introduces a show_thinking configuration option across the TUI engine, prompts, and UI components. When disabled, the system prompt is updated to instruct the model to perform its internal reasoning in English to save tokens, while still responding in the user's preferred language. Feedback was provided regarding the formatting of the multi-line system prompt string in crates/tui/src/prompts.rs, where the use of line continuation characters might introduce unintended whitespace.

gemini-code-assist · 2026-05-20T16:22:30Z

+        full_prompt.push_str(
+            "\n\n## Thinking Language\n\n\
+             The user has disabled thinking display in settings — they will \
+             never see your `reasoning_content`. Therefore, your internal \
+             thinking MUST be in English regardless of the user's language. \
+             This directive overrides the `## Language` section above for \
+             reasoning_content only. Your final reply must STILL match the \
+             user's language.",
+        );


The use of \ for line continuation in this string literal will include all the leading whitespace from the subsequent lines. This results in a single long line with multiple spaces between words, which is likely not the intended formatting for the prompt. Using string literal concatenation is a cleaner way to format this multi-line string while maintaining readability in the code.

full_prompt.push_str( "\n\n## Thinking Language\n\n" "The user has disabled thinking display in settings — they will " "never see your `reasoning_content`. Therefore, your internal " "thinking MUST be in English regardless of the user's language. " "This directive overrides the `## Language` section above for " "reasoning_content only. Your final reply must STILL match the " "user's language." );

…ale (Hmbown#1842/Hmbown#1843) - Add show_thinking flag to PromptSessionContext - When show_thinking=false, emit hidden-thinking English instruction - Omit locale-reinforcement bookends when user can't see thinking blocks - Keep final-visible-reply language rule unchanged - Add test for hidden-thinking language directive

Hmbown · 2026-05-27T13:25:20Z

Independent review:

PR threads a new show_thinking: bool from settings through EngineConfig → PromptSessionContext → the system prompt builder, and emits a ## Thinking Language block when !show_thinking instructing the model to keep reasoning_content in English while replying in the user's locale. Wiring is correct and the test coverage is reasonable.

This is fully superseded by v0.8.48 (pr-2256):

crates/tui/src/prompts.rs already declares pub show_thinking: bool on PromptSessionContext (line 40) with default true.
v0.8.48 emits the override under the header ## Hidden Thinking Language (line 118-121 of prompts.rs) with essentially the same semantics: "keep that hidden internal thinking in English regardless of..." — the test even asserts text.contains("## Hidden Thinking Language") and text.contains("reasoning_content") && text.contains("English").
runtime_threads.rs in v0.8.48 already reads Settings::load().show_thinking (line 1614) and propagates it.
All ~12 show_thinking: ... initializers across prompts.rs tests in v0.8.48 confirm the field is fully integrated.

Merge sim vs main: conflicts in core/engine.rs, main.rs, prompts.rs, runtime_threads.rs, tui/ui.rs. Vs pr-2256: same five files conflict — v0.8.48 already owns the surface this PR wants to add.

Header naming differs slightly (## Thinking Language here vs ## Hidden Thinking Language in v0.8.48); v0.8.48's is clearer.

v0.8.48 (#2256) compatibility: superseded — v0.8.48 implements the same feature with the same show_thinking field name and equivalent prompt-injection semantics. Recommend close as superseded by #2256.

Hmbown · 2026-05-27T13:40:37Z

@cmyyy — thank you for this. The "English thinking when hidden" feature you proposed is now fully present in v0.8.48 (#2256) — same show_thinking: bool field on PromptSessionContext, the same ## Hidden Thinking Language system-prompt block, and the same Settings::load().show_thinking wiring through runtime_threads.rs. Your design landed.

If there's a specific test scenario or edge case from your branch you'd like included in #2256 before it merges — particularly around the locale interaction (zh/ja preserving final-reply language while reasoning stays English) — flag it and I'll help carry it forward. Otherwise this gets credited in the v0.8.48 release notes.

Genuinely good idea — thanks for proposing it.

…ale (Hmbown#1842/Hmbown#1843) - Add show_thinking flag to PromptSessionContext - When show_thinking=false, emit hidden-thinking English instruction - Omit locale-reinforcement bookends when user can't see thinking blocks - Keep final-visible-reply language rule unchanged - Add test for hidden-thinking language directive

cmyyy added 2 commits May 21, 2026 00:03

fixup: refine thinking language override wording

7ab0c4d

Replace "To save tokens" with a rationale based on the user's intent: when thinking is hidden, there is no reason to localize it.

cmyyy mentioned this pull request May 20, 2026

show_thinking=false still wastes tokens on non-English reasoning_content #1842

Open

gemini-code-assist Bot reviewed May 20, 2026

View reviewed changes

Hmbown mentioned this pull request May 21, 2026

v0.8.42 tracker: inbox-zero triage and backlog close-out #1876

Closed

Hmbown added this to the v0.8.47 milestone May 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/english thinking when hidden#1843

Feat/english thinking when hidden#1843
cmyyy wants to merge 2 commits into
Hmbown:mainfrom
cmyyy:feat/english-thinking-when-hidden

cmyyy commented May 20, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

Hmbown commented May 27, 2026

Uh oh!

Hmbown commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cmyyy commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Changes (6 files, +53 lines)

Behavior

Testing

Checklist

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Hmbown commented May 27, 2026

Uh oh!

Hmbown commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cmyyy commented May 20, 2026 •

edited

Loading