examples/anthropic/mcp/README.md
This example wires Claude up to a Model Context Protocol server through the Anthropic Messages provider. Promptfoo exposes the MCP server's tools to Claude, executes any tool_use blocks the model emits, and feeds the tool_result back into the conversation until Claude produces a final reply.
npx promptfoo@latest init --example anthropic/mcp
cd anthropic/mcp
export ANTHROPIC_API_KEY=sk-ant-...
npx promptfoo@latest eval
The bundled config uses the public deepwiki MCP server so the example works out of the box without installing anything. To swap in your own server, replace the mcp block:
mcp:
enabled: true
# Local stdio server
server:
command: npx
args: ['-y', '@modelcontextprotocol/server-filesystem', '/tmp/workspace']
# …or remote SSE / streamable HTTP
# servers:
# - name: my-server
# url: https://example.com/mcp
tools you define.tool_use block matching an MCP tool, promptfoo invokes the tool with the model's arguments and appends a tool_result on the next user turn. The loop continues until Claude returns text — bounded by max_tool_calls (default 8).tool_result blocks with is_error: true so Claude can recover or report the failure.web_search) fall through to the existing tool-use output without being auto-executed.Promptfoo's disk response cache is skipped automatically when mcp.enabled is true, because tool results can vary between runs. Use max_tool_calls to bound the per-request cost.