The Model context protocol (MCP) standardises how applications expose tools and context to language models. From the official documentation:
MCP is an open protocol that standardizes how applications provide context to LLMs. Think of MCP like a USB-C port for AI applications. Just as USB-C provides a standardized way to connect your devices to various peripherals and accessories, MCP provides a standardized way to connect AI models to different data sources and tools.
The Agents Python SDK understands multiple MCP transports. This lets you reuse existing MCP servers or build your own to expose filesystem, HTTP, or connector backed tools to an agent.
Before wiring an MCP server into an agent decide where the tool calls should execute and which transports you can reach. The matrix below summarises the options that the Python SDK supports.
| What you need | Recommended option |
|---|---|
| Let OpenAI's Responses API call a publicly reachable MCP server on the model's behalf | Hosted MCP server tools via [HostedMCPTool][agents.tool.HostedMCPTool] |
| Connect to Streamable HTTP servers that you run locally or remotely | Streamable HTTP MCP servers via [MCPServerStreamableHttp][agents.mcp.server.MCPServerStreamableHttp] |
| Talk to servers that implement HTTP with Server-Sent Events | HTTP with SSE MCP servers via [MCPServerSse][agents.mcp.server.MCPServerSse] |
| Launch a local process and communicate over stdin/stdout | stdio MCP servers via [MCPServerStdio][agents.mcp.server.MCPServerStdio] |
The sections below walk through each option, how to configure it, and when to prefer one transport over another.
Hosted tools push the entire tool round-trip into OpenAI's infrastructure. Instead of your code listing and calling tools, the
[HostedMCPTool][agents.tool.HostedMCPTool] forwards a server label (and optional connector metadata) to the Responses API. The
model lists the remote server's tools and invokes them without an extra callback to your Python process. Hosted tools currently
work with OpenAI models that support the Responses API's hosted MCP integration.
Create a hosted tool by adding a [HostedMCPTool][agents.tool.HostedMCPTool] to the agent's tools list. The tool_config
dict mirrors the JSON you would send to the REST API:
import asyncio
from agents import Agent, HostedMCPTool, Runner
async def main() -> None:
agent = Agent(
name="Assistant",
tools=[
HostedMCPTool(
tool_config={
"type": "mcp",
"server_label": "gitmcp",
"server_url": "https://gitmcp.io/openai/codex",
"require_approval": "never",
}
)
],
)
result = await Runner.run(agent, "Which language is this repository written in?")
print(result.final_output)
asyncio.run(main())The hosted server exposes its tools automatically; you do not add it to mcp_servers.
Hosted tools support streaming results in exactly the same way as function tools. Pass stream=True to Runner.run_streamed to
consume incremental MCP output while the model is still working:
result = Runner.run_streamed(agent, "Summarise this repository's top languages")
async for event in result.stream_events():
if event.type == "run_item_stream_event":
print(f"Received: {event.item}")
print(result.final_output)If a server can perform sensitive operations you can require human or programmatic approval before each tool execution. Configure
require_approval in the tool_config with either a single policy ("always", "never") or a dict mapping tool names to
policies. To make the decision inside Python, provide an on_approval_request callback.
from agents import MCPToolApprovalFunctionResult, MCPToolApprovalRequest
SAFE_TOOLS = {"read_project_metadata"}
def approve_tool(request: MCPToolApprovalRequest) -> MCPToolApprovalFunctionResult:
if request.data.name in SAFE_TOOLS:
return {"approve": True}
return {"approve": False, "reason": "Escalate to a human reviewer"}
agent = Agent(
name="Assistant",
tools=[
HostedMCPTool(
tool_config={
"type": "mcp",
"server_label": "gitmcp",
"server_url": "https://gitmcp.io/openai/codex",
"require_approval": "always",
},
on_approval_request=approve_tool,
)
],
)The callback can be synchronous or asynchronous and is invoked whenever the model needs approval data to keep running.
Hosted MCP also supports OpenAI connectors. Instead of specifying a server_url, supply a connector_id and an access token. The
Responses API handles authentication and the hosted server exposes the connector's tools.
import os
HostedMCPTool(
tool_config={
"type": "mcp",
"server_label": "google_calendar",
"connector_id": "connector_googlecalendar",
"authorization": os.environ["GOOGLE_CALENDAR_AUTHORIZATION"],
"require_approval": "never",
}
)Fully working hosted tool samples—including streaming, approvals, and connectors—live in
examples/hosted_mcp.
When you want to manage the network connection yourself, use
[MCPServerStreamableHttp][agents.mcp.server.MCPServerStreamableHttp]. Streamable HTTP servers are ideal when you control the
transport or want to run the server inside your own infrastructure while keeping latency low.
import asyncio
import os
from agents import Agent, Runner
from agents.mcp import MCPServerStreamableHttp
from agents.model_settings import ModelSettings
async def main() -> None:
token = os.environ["MCP_SERVER_TOKEN"]
async with MCPServerStreamableHttp(
name="Streamable HTTP Python Server",
params={
"url": "http://localhost:8000/mcp",
"headers": {"Authorization": f"Bearer {token}"},
"timeout": 10,
},
cache_tools_list=True,
max_retry_attempts=3,
) as server:
agent = Agent(
name="Assistant",
instructions="Use the MCP tools to answer the questions.",
mcp_servers=[server],
model_settings=ModelSettings(tool_choice="required"),
)
result = await Runner.run(agent, "Add 7 and 22.")
print(result.final_output)
asyncio.run(main())The constructor accepts additional options:
client_session_timeout_secondscontrols HTTP read timeouts.use_structured_contenttoggles whethertool_result.structured_contentis preferred over textual output.max_retry_attemptsandretry_backoff_seconds_baseadd automatic retries forlist_tools()andcall_tool().tool_filterlets you expose only a subset of tools (see Tool filtering).
Agents convert each MCP tool into an SDK FunctionTool. If a remote tool raises an exception the default behaviour is to bubble
the error up and abort the run. When you want to return a string error message to the model instead, set
failure_error_function in the agent's mcp_config. The callback receives the tool run context and the raised exception (with
the original exception attached as __cause__) and should return the string that the model sees.
from typing import Any
from agents import Agent, RunContextWrapper
def format_mcp_error(ctx: RunContextWrapper[Any], error: Exception) -> str:
# Unwrap the root cause to keep the message concise.
root_error = error.__cause__ if getattr(error, "__cause__", None) else error
return f"The MCP tool failed: {root_error}"
agent = Agent(
name="Assistant",
mcp_servers=[server],
mcp_config={"failure_error_function": format_mcp_error},
)If you prefer to let the exception terminate the run, omit failure_error_function or set it to None.
If the MCP server implements the HTTP with SSE transport, instantiate
[MCPServerSse][agents.mcp.server.MCPServerSse]. Apart from the transport, the API is identical to the Streamable HTTP server.
from agents import Agent, Runner
from agents.model_settings import ModelSettings
from agents.mcp import MCPServerSse
workspace_id = "demo-workspace"
async with MCPServerSse(
name="SSE Python Server",
params={
"url": "http://localhost:8000/sse",
"headers": {"X-Workspace": workspace_id},
},
cache_tools_list=True,
) as server:
agent = Agent(
name="Assistant",
mcp_servers=[server],
model_settings=ModelSettings(tool_choice="required"),
)
result = await Runner.run(agent, "What's the weather in Tokyo?")
print(result.final_output)For MCP servers that run as local subprocesses, use [MCPServerStdio][agents.mcp.server.MCPServerStdio]. The SDK spawns the
process, keeps the pipes open, and closes them automatically when the context manager exits. This option is helpful for quick
proofs of concept or when the server only exposes a command line entry point.
from pathlib import Path
from agents import Agent, Runner
from agents.mcp import MCPServerStdio
current_dir = Path(__file__).parent
samples_dir = current_dir / "sample_files"
async with MCPServerStdio(
name="Filesystem Server via npx",
params={
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-filesystem", str(samples_dir)],
},
) as server:
agent = Agent(
name="Assistant",
instructions="Use the files in the sample directory to answer questions.",
mcp_servers=[server],
)
result = await Runner.run(agent, "List the files available to you.")
print(result.final_output)Each MCP server supports tool filters so that you can expose only the functions that your agent needs. Filtering can happen at construction time or dynamically per run.
Use [create_static_tool_filter][agents.mcp.create_static_tool_filter] to configure simple allow/block lists:
from pathlib import Path
from agents.mcp import MCPServerStdio, create_static_tool_filter
samples_dir = Path("/path/to/files")
filesystem_server = MCPServerStdio(
params={
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-filesystem", str(samples_dir)],
},
tool_filter=create_static_tool_filter(allowed_tool_names=["read_file", "write_file"]),
)When both allowed_tool_names and blocked_tool_names are supplied the SDK applies the allow-list first and then removes any
blocked tools from the remaining set.
For more elaborate logic pass a callable that receives a [ToolFilterContext][agents.mcp.ToolFilterContext]. The callable can be
synchronous or asynchronous and returns True when the tool should be exposed.
from pathlib import Path
from agents.mcp import MCPServerStdio, ToolFilterContext
samples_dir = Path("/path/to/files")
async def context_aware_filter(context: ToolFilterContext, tool) -> bool:
if context.agent.name == "Code Reviewer" and tool.name.startswith("danger_"):
return False
return True
async with MCPServerStdio(
params={
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-filesystem", str(samples_dir)],
},
tool_filter=context_aware_filter,
) as server:
...The filter context exposes the active run_context, the agent requesting the tools, and the server_name.
MCP servers can also provide prompts that dynamically generate agent instructions. Servers that support prompts expose two methods:
list_prompts()enumerates the available prompt templates.get_prompt(name, arguments)fetches a concrete prompt, optionally with parameters.
from agents import Agent
prompt_result = await server.get_prompt(
"generate_code_review_instructions",
{"focus": "security vulnerabilities", "language": "python"},
)
instructions = prompt_result.messages[0].content.text
agent = Agent(
name="Code Reviewer",
instructions=instructions,
mcp_servers=[server],
)Every agent run calls list_tools() on each MCP server. Remote servers can introduce noticeable latency, so all of the MCP
server classes expose a cache_tools_list option. Set it to True only if you are confident that the tool definitions do not
change frequently. To force a fresh list later, call invalidate_tools_cache() on the server instance.
Tracing automatically captures MCP activity, including:
- Calls to the MCP server to list tools.
- MCP-related information on tool calls.
- Model Context Protocol – the specification and design guides.
- examples/mcp – runnable stdio, SSE, and Streamable HTTP samples.
- examples/hosted_mcp – complete hosted MCP demonstrations including approvals and connectors.
