Security Quarantine

MCPProxy includes an automatic quarantine system to protect against Tool Poisoning Attacks (TPA).

What is a Tool Poisoning Attack?

Tool Poisoning Attacks occur when malicious MCP servers:

Hidden Instructions: Embed malicious instructions in tool descriptions that AI agents might follow
Data Exfiltration: Trick AI agents into sending sensitive data to external servers
Credential Theft: Attempt to extract API keys or tokens
System Manipulation: Try to execute unauthorized commands

How Quarantine Works

Automatic Quarantine

When a new server is added via an AI client (using the upstream_servers tool):

Server is automatically placed in quarantine status
Tool calls to quarantined servers return a security analysis instead of executing
Server remains quarantined until manually approved

Tool Discovery and Search Isolation

Quarantined servers are completely isolated from the tool discovery and search system:

Feature	Quarantined Server	Approved Server
Tools indexed	❌ No	✅ Yes
Tools searchable via `retrieve_tools`	❌ No	✅ Yes
Tools appear in HTTP API search	❌ No	✅ Yes
Tool calls allowed	❌ No (returns security analysis)	✅ Yes

This isolation prevents Tool Poisoning Attacks from:

Injecting malicious descriptions into search results that AI agents might read and follow
Appearing in tool recommendations where they could be mistakenly selected
Influencing AI agent behavior through carefully crafted tool metadata

When a server is quarantined:

Its tools are immediately removed from the search index
retrieve_tools queries will never return tools from that server
The server remains visible in the server list (marked as quarantined) for management

When a server is unquarantined (approved):

The server connects to discover its tools
Tools are indexed and become searchable
Tool calls are allowed to execute normally
Any pending (newly-discovered, never-reviewed) tool-approval records for the server are auto-promoted to approved — approving a server means you trust its current tool snapshot (baseline trust). Tools whose description or schema later changes (changed, i.e. rug-pull) are not affected and stay blocked until you re-approve them explicitly.

Security Analysis

When a quarantined server's tool is called, MCPProxy returns:

json

{
  "status": "quarantined",
  "server": "suspicious-server",
  "analysis": {
    "tool_count": 15,
    "suspicious_patterns": [
      "Tool 'fetch_data' description contains external URL",
      "Tool 'execute' has overly broad permissions"
    ],
    "risk_level": "medium",
    "recommendation": "Review tool descriptions before approving"
  }
}

Managing Quarantine

View Quarantined Servers

Web UI:

Open the dashboard
Click "Quarantine" in the navigation
Review pending servers

CLI:

bash

mcpproxy upstream list
# Shows quarantine status for each server

Approve a Server

Web UI:

Click on the quarantined server
Review the security analysis
Click "Approve" to remove from quarantine

API:

bash

curl -X POST \
  -H "X-API-Key: your-key" \
  http://127.0.0.1:8080/api/v1/servers/server-name/unquarantine

Config File:

Edit ~/.mcpproxy/mcp_config.json and add "quarantined": false:

json

{
  "mcpServers": [
    {
      "name": "reviewed-server",
      "command": "npx",
      "args": ["@example/mcp-server"],
      "quarantined": false,
      "enabled": true
    }
  ]
}

Re-quarantine a Server

If you need to quarantine a previously approved server:

bash

curl -X POST \
  -H "X-API-Key: your-key" \
  http://127.0.0.1:8080/api/v1/servers/server-name/quarantine

Security Checklist

Before approving a server, verify:

Source: Is the server from a trusted source?
Code Review: Have you reviewed the server's code?
Tool Descriptions: Do tool descriptions look legitimate?
Network Access: Does the server need network access?
Permissions: Are requested permissions appropriate?

Detection Patterns

MCPProxy checks for these suspicious patterns:

Pattern	Risk Level	Description
External URLs in descriptions	Medium	May indicate data exfiltration
Credential keywords	High	Mentions of "password", "token", "key"
Execution commands	High	Shell execution capabilities
Hidden instructions	Critical	Base64 encoded or obfuscated content
Overly broad permissions	Medium	Access to all files or network

Best Practices

Review All Servers: Never auto-approve servers added by AI agents
Source Verification: Only approve servers from known, trusted sources
Minimal Permissions: Prefer servers with limited, specific capabilities
Regular Audits: Periodically review approved servers
Network Isolation: Use Docker isolation with network_mode: "none" for untrusted servers

Tool-Level Quarantine

In addition to server-level quarantine, MCPProxy provides tool-level quarantine that detects changes to individual tool descriptions and schemas using SHA256 hashing. This protects against "rug pull" attacks where a previously trusted server silently modifies tool behavior.

See Tool Quarantine for complete documentation on:

SHA256 hash-based tool approval
CLI commands: mcpproxy upstream inspect and mcpproxy upstream approve
Configuration: quarantine_enabled and skip_quarantine
REST API endpoints for tool approval management

Block (approve + disable)

When reviewing a pending or changed tool you may want to acknowledge it but keep it hidden from MCP clients — for example, dismissing a noisy "changed" flag for a tool you never intend to use. The block operation does this atomically: it approves the tool (clearing the quarantine flag) and disables it in a single, all-or-nothing server-side write, so a tool is never left in the approved+enabled state.

REST: POST /api/v1/servers/{id}/tools/block with {"tools":[...]} or {"block_all": true}.
MCP: quarantine_security operations block_tool (with name + tool_name) and block_all_tools (with name).

A blocked tool can be re-exposed later with the normal enable operation (POST /api/v1/servers/{id}/tools/{tool}/enabled with {"enabled": true}).

Disabling Quarantine

Not recommended, but you can opt out of quarantine globally by setting a single top-level flag in ~/.mcpproxy/mcp_config.json:

json

{
  "quarantine_enabled": false
}

When quarantine_enabled is false:

Servers added dynamically via the upstream_servers MCP tool or the POST /api/v1/servers REST endpoint default to not quarantined.
Tool-level quarantine (per-tool SHA-256 approval of descriptions and schemas, see Tool Quarantine) is skipped.

An explicit quarantined field in an add-server request still wins over the default, so client code can always override on a per-server basis. Per-server skip_quarantine: true continues to apply at the tool level.

Warning: Disabling quarantine exposes your system to Tool Poisoning Attacks. Only do this on machines where every MCP server you connect to is already trusted.