Cli install sdk evals by viadezo1er · Pull Request #85 · braintrustdata/bt

ViaDézo1er / cedric (viadezo1er) · 2026-03-27T01:52:42Z

bt setup instrument: interactive mode, language selection, and scoped permissions

Adds three new flags to bt setup instrument and wires them end-to-end through agent invocation and task generation.

--interactive / -i opens the agent in its interactive TUI (Claude Code, etc.) so the user can review and approve each tool use.
--yolo runs the agent in the background with bypassPermissions — no approval prompts.
--language <LANG> restricts instrumentation to specific languages (python, typescript, go, java, ruby, csharp); repeatable; omit to let the agent auto-detect.

Run-mode prompt (interactive terminal, no flags)

When none of the above flags are passed and the terminal is interactive, the user is asked how to run the agent. Background mode uses acceptEdits with --allowedTools scoped to the package managers for the selected language(s) only (e.g. uv for Python, npm/yarn/pnpm for TypeScript, dotnet for C#). Interactive TUI mode opens the agent's terminal UI.

Language selection prompt

A multi-select prompt is shown between the workflow and run-mode prompts. Selecting "All languages" (the default) lets the agent auto-detect; selecting specific languages also narrows the background tool allowlist.

Andrew Kent (realark) · 2026-03-27T20:08:37Z

skills/sdk-install/instrument-task.md

+
+**How to obtain the permalink:**
+
+Most language SDKs print a direct URL to the emitted trace after the app runs. Capture that URL and print it.


I don't think this is true outside of our sample applications. Especially with auto-instrumentation, the app will usually not print a url to the trace.

In mcp, the coding agent can fetch recent objects and get a permalink that way. Does the bt cli agent have a similar means to do this?

Would be super useful if we could figure this out. Getting a valid url was very helpful feedback to the agent when it was installing via mcp. Not sure if this is possible in bt cli though

Andrew Kent (realark) · 2026-03-27T20:16:49Z

src/setup/sdk_install_docs.rs

+    write_text_file(&dir.join("java.md"), JAVA_DOCS)?;
+    write_text_file(&dir.join("csharp.md"), CSHARP_DOCS)?;
+    write_text_file(&dir.join("braintrust-url-formats.md"), URL_FORMATS_DOCS)?;
+    write_text_file(&dir.join("_index.md"), INDEX)?;


should instrument-task.md be in this list too?

It's passed as the first prompt to the agent (wether the agent is in the background or in claude code/codex/...), and tweaked a bit depending on the options chosen.
See src/setup/mod.rs lines 27 and 1475

Adds an optional, repeatable `--language` flag to `bt setup instrument` that lets callers specify the target language(s) directly, bypassing the agent's language auto-detection step. Accepted values (case-insensitive): python, typescript, javascript, go, csharp, c#, java, ruby `typescript` and `javascript` are treated as the same language; duplicate values are deduplicated before being passed to the agent. When one or more languages are provided the rendered task prompt includes a "Language Override" section telling the agent to skip Step 2 (auto-detection) and instrument the specified language(s) directly. Also fixes a pre-existing compile error in tests where `render_instrument_task` was already called with a `workflows` argument that the implementation didn't accept, and adds the `{WORKFLOW_CONTEXT}` placeholder so non-instrument workflows inject `bt` CLI guidance. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…chosen

ViaDézo1er / cedric (viadezo1er) · 2026-03-27T20:45:36Z

CI passes after git rebase origin/main cedric/cli-install-sdk-evals

github-actions · 2026-03-27T20:54:42Z

Latest downloadable build artifacts for this PR commit bcec4727269d:

Workflow run: https://github.com/braintrustdata/bt/actions/runs/23666717791
Download all artifacts (GitHub CLI): gh run download 23666717791 --repo braintrustdata/bt
Installers are published from main automatically. To publish one for a PR branch, run release-canary manually via workflow_dispatch.

Available artifact names

``artifacts-build-global
``artifacts-build-local-x86_64-pc-windows-msvc
``artifacts-build-local-x86_64-apple-darwin
``artifacts-build-local-x86_64-unknown-linux-musl
``artifacts-build-local-aarch64-apple-darwin
``artifacts-build-local-x86_64-unknown-linux-gnu
``artifacts-build-local-aarch64-unknown-linux-musl
``artifacts-build-local-aarch64-unknown-linux-gnu
``artifacts-plan-dist-manifest
``cargo-dist-cache

ViaDézo1er / cedric (viadezo1er) force-pushed the cedric/cli-install-sdk-evals branch from 8dc2661 to b68b629 Compare March 27, 2026 19:55

ViaDézo1er / cedric (viadezo1er) requested review from Abhijeet Prasad (AbhiPrasad) and Olmo Maldonado (ibolmo) and removed request for Olmo Maldonado (ibolmo) March 27, 2026 19:57

ViaDézo1er / cedric (viadezo1er) marked this pull request as ready for review March 27, 2026 20:05

Andrew Kent (realark) reviewed Mar 27, 2026

View reviewed changes

ViaDézo1er / cedric (viadezo1er) and others added 3 commits March 27, 2026 13:43

feat: add mcp prompts to bt cli

a5af193

feat: bt setup install evals, either in the background or in the TUI …

bcec472

…chosen

ViaDézo1er / cedric (viadezo1er) force-pushed the cedric/cli-install-sdk-evals branch from b68b629 to bcec472 Compare March 27, 2026 20:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cli install sdk evals#85

Cli install sdk evals#85
ViaDézo1er / cedric (viadezo1er) wants to merge 3 commits intomainfrom
cedric/cli-install-sdk-evals

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026

Uh oh!

Andrew Kent (realark) Mar 27, 2026

Uh oh!

Andrew Kent (realark) Mar 27, 2026

Uh oh!

Andrew Kent (realark) Mar 27, 2026

Uh oh!

ViaDézo1er / cedric (viadezo1er) Mar 27, 2026

Uh oh!

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		How to obtain the permalink:

		Most language SDKs print a direct URL to the emitted trace after the app runs. Capture that URL and print it.

Conversation

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026

bt setup instrument: interactive mode, language selection, and scoped permissions

Run-mode prompt (interactive terminal, no flags)

Language selection prompt

Uh oh!

Andrew Kent (realark) Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Andrew Kent (realark) Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Andrew Kent (realark) Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

ViaDézo1er / cedric (viadezo1er) Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ViaDézo1er / cedric (viadezo1er) commented Mar 27, 2026 •

edited

Loading