docs: consolidated Evals docs (Performance redesign + workforce evals) by jordanc-relevanceai · Pull Request #662 · RelevanceAI/relevance-docs

jordanc-relevanceai · 2026-06-03T09:49:01Z

This PR consolidates 2 drafter PRs that both modify the agent Evals page. Opened as draft for review before the source PRs are closed.

Source PRs (being closed in favor of this one)

docs(TSP-1256): update agent evals page for multi-dashboard Performance tab redesign #645 — TSP-1256: Evals page rewrite for the multi-dashboard Performance tab
docs(TSP-1095): add workforce evaluations documentation #542 — TSP-1095: workforce evaluations

Why consolidated

Both edit build/agents/build-your-agent/evals.mdx. #645 is a near-total rewrite (renames the Monitor tab to Evaluate/Performance, "Evaluators" to "checks", documents multiple Performance dashboards). #542's additions to the same file would conflict badly with that rewrite and reintroduce the old "Monitor"/"Evaluators" terminology. Consolidating lets the rewrite land while keeping #542's genuinely new workforce-evals content.

Changes by source PR

#645 — TSP-1256

Rewrites evals.mdx: Monitor → Performance tab, dashboard list page, creating/configuring Performance dashboards, per-check charts, "Use cases for multiple dashboards", and replaces "evaluators" with "checks" throughout.

#542 — TSP-1095

Adds a workforce evals <Note> to the top of evals.mdx: evals also run against entire workforces, currently API-only via the Relevance AI MCP / eval API.
Adds an "Evaluate a workforce" example accordion to get-started/core-concepts/programmatic-gtm.mdx.

Reconciliation note

Kept docs(TSP-1256): update agent evals page for multi-dashboard Performance tab redesign #645's rewritten evals.mdx in full, then grafted in docs(TSP-1095): add workforce evaluations documentation #542's workforce-evals <Note> (it's still accurate — it doesn't reference the old tab names).
Dropped docs(TSP-1095): add workforce evaluations documentation #542's other evals.mdx hunks: they were cosmetic iframe-embed reformatting that conflicted with docs(TSP-1256): update agent evals page for multi-dashboard Performance tab redesign #645's rewrite and carried no content. Heads-up: docs(TSP-1095): add workforce evaluations documentation #542's PR description mentioned a separate workforce-features/evals.mdx page, but that file was not in its branch's actual diff — its only real contributions were the Note and the MCP example, both preserved here.
docs(TSP-1095): add workforce evaluations documentation #542's programmatic-gtm accordion merged with no conflict.

Adds evals page for workforces covering generate-and-score and score-only modes, evaluator types, key differences from agent evals, and when to use each mode. Updates docs.json navigation to include the new page. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Workforce evals BE shipped (relevance-api-node #12943) but FE is still in flight. Replacing the standalone workforce evals page with a small note on the agent evals page and a workforce prompt example on the Programmatic GTM intro — pointing users at the MCP/API path that actually works today. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Both embeds were using kebab-case 'padding-top' (invalid in JSX style objects), 56.75% instead of 56.25%, and a single-line wrapper that didn't match the standard snippet. Swapped in the canonical wrapper from the style guide. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

… redesign Reflects the redesigned Performance tab (formerly Monitor): - Renames Monitor section to Performance throughout - Documents the new dashboard list page with preview cards - Adds steps for creating and configuring dashboards via the settings drawer - Describes individual dashboard views (charts, version markers, run history table) - Adds instructions for updating dashboard settings post-creation - Adds new "Use cases for multiple dashboards" section with CardGroup - Replaces all "evaluators" references with "checks" - Updates best practices and FAQs to cover Performance-specific questions Linear: https://linear.app/relevance/issue/TSP-1256/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…evals

645 rewrote evals.mdx (Monitor->Performance, Evaluators->checks). 542's only substantive evals.mdx change was a workforce-evals Note; its other hunks were cosmetic iframe reformatting, dropped in favor of 645's rewrite. Kept 542's workforce Note (still accurate) and its MCP 'Evaluate a workforce' example. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

mintlify · 2026-06-03T09:49:04Z

Preview deployment for your docs. Learn more about Mintlify Previews.

Project	Status	Preview	Updated (UTC)
relevanceai	🟢 Ready	View Preview	Jun 3, 2026, 9:51 AM

💡 Tip: Enable Workflows to automatically generate PRs for you.

github-actions Bot and others added 6 commits May 6, 2026 16:28

Merge remote-tracking branch 'origin/docs/TSP-1256' into consolidate/…

3b5f5a4

…evals

This was referenced Jun 3, 2026

docs(TSP-1256): update agent evals page for multi-dashboard Performance tab redesign #645

Closed

docs(TSP-1095): add workforce evaluations documentation #542

Closed

mintlify Bot deployed to staging June 3, 2026 09:51 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: consolidated Evals docs (Performance redesign + workforce evals)#662

docs: consolidated Evals docs (Performance redesign + workforce evals)#662
jordanc-relevanceai wants to merge 6 commits into
mainfrom
consolidate/evals

jordanc-relevanceai commented Jun 3, 2026

Uh oh!

mintlify Bot commented Jun 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jordanc-relevanceai commented Jun 3, 2026

Source PRs (being closed in favor of this one)

Why consolidated

Changes by source PR

#645 — TSP-1256

#542 — TSP-1095

Reconciliation note

Uh oh!

mintlify Bot commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mintlify Bot commented Jun 3, 2026 •

edited

Loading