feat: warn on MCP tool response content-type drift between calls by julian-risch · Pull Request #3364 · deepset-ai/haystack-core-integrations

julian-risch · 2026-05-28T15:46:41Z

Related Issues

Fixes https://github.com/deepset-ai/haystack-private/issues/362

Proposed Changes:

Add _check_response_shape in mcp_toolset.py that records the set of content block types each MCP tool returns on its first invocation and emits a warning whenever a later call introduces a previously unseen type.
Wire the per-toolset baseline (MCPToolset._response_shapes) through create_invoke_tool so every tool produced by an MCPToolset participates in the check.
Move the JSON parse to the top of invoke_tool and share it with the existing outputs_to_state extraction path; on parse failure we still return the raw string so the no-outputs_to_state contract is preserved.
Add a RugPull fixture server that returns TextContent on the first call and a ResourceLink on the second, plus unit tests for the helper directly and one integration-style test against the rug-pull server.

This is intentionally a detection-only signal — it does not block the response — because the MCP protocol allows servers to legitimately vary their content types between calls, and an attacker who keeps the content type stable will not trip this check. The aim is to give pipeline owners a clear, loggable signal when a tool's response shape shifts under them.

How did you test it?

New tests:
- TestCheckResponseShape::test_first_call_establishes_baseline
- TestCheckResponseShape::test_drift_emits_warning_and_extends_baseline
- TestCheckResponseShape::test_same_shape_does_not_warn
- TestCheckResponseShape::test_non_dict_parsed_is_ignored
- TestCheckResponseShape::test_missing_or_malformed_content_field_is_ignored
- TestMCPToolset::test_response_shape_drift_logs_warning

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.

Track the set of content block types each MCP tool returns and emit a warning when a subsequent invocation introduces a previously unseen type. This surfaces a class of server-side rug-pull where a benign tool silently substitutes different content (e.g. a ResourceLink with a sensitive URI) on later calls. Detection only — does not block. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-28T15:48:17Z

Coverage report (mcp)

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
integrations/mcp/src/haystack_integrations/tools/mcp
mcp_toolset.py					396-397
Project Total

_{This report was generated by python-coverage-comment-action}

github-actions Bot added integration:mcp type:documentation Improvements or additions to documentation labels May 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: warn on MCP tool response content-type drift between calls#3364

feat: warn on MCP tool response content-type drift between calls#3364
julian-risch wants to merge 1 commit into
mainfrom
feat/mcp-toolset-validate-response-shape

julian-risch commented May 28, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

julian-risch commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

Uh oh!

github-actions Bot commented May 28, 2026

Coverage report (mcp)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

julian-risch commented May 28, 2026 •

edited

Loading