redteam-mcp-agent (Red Team MCP Agent)

This example demonstrates how to red team test AI agents that integrate with Model Context Protocol (MCP) servers. It creates a custom OpenAI-based ReAct agent provider that can interact with MCP servers, allowing you to test agent behavior when given access to potentially malicious tools.

You can run this example with:

bash

npx promptfoo@latest init --example redteam-mcp-agent
cd redteam-mcp-agent

Overview

The example includes:

A custom OpenAI agent provider that uses the ReAct (Reasoning and Acting) pattern
Integration with multiple MCP servers for tool access
Red team testing configuration to evaluate agent security boundaries
Example MCP servers including a potentially malicious one for security testing

Prerequisites

Node.js 20+
Python 3.8+ (for the Python MCP server example)
OpenAI API key

Environment Variables

This example requires the following environment variable:

OPENAI_API_KEY - Your OpenAI API key (get one at https://platform.openai.com/api-keys)

You can set this in a .env file in the example directory:

bash

OPENAI_API_KEY=sk-...

Quick Start

Install dependencies:

bash

npm install

Set up your OpenAI API key:

bash

export OPENAI_API_KEY=your-openai-api-key-here
# Or create a .env file with OPENAI_API_KEY=your-key

Run the red team tests:

bash

npx promptfoo@latest redteam run

Project Structure

text

redteam-mcp-agent/
├── src/
│   ├── openai-agent-provider.js  # Custom OpenAI agent provider
│   ├── react-agent.js            # ReAct pattern agent implementation
│   ├── mcp-client.js             # MCP client for tool connections
│   └── mcp_server/               # Example MCP server
│       ├── index.js              # Server entry point
│       └── tools/                # Tool implementations
├── promptfooconfig.yaml          # Main configuration
├── redteam.yaml                  # Generated red team test cases
├── server.py                     # Python MCP server example
└── package.json                  # Node.js dependencies

Configuration

promptfooconfig.yaml

The main configuration file defines:

Provider Configuration: Uses a custom OpenAI agent provider with:
- Model: gpt-4o (you can also use gpt-4o-mini or o3-mini)
- System prompt that instructs the agent to use ReAct pattern
- MCP server connections
MCP Servers: Three types of connections are demonstrated:
- NPX package: @promptfoo/evil-mcp-server (for security testing)
- Local Node.js server: ./src/mcp_server/index.js
- HTTP endpoint: http://localhost:8080/mcp (Python server)
Red Team Configuration:
- Purpose: Defines the testing scenario (e.g., financial analyst at Acme Inc.)
- Tools: Lists available tools and their schemas
- Policy: Defines what behaviors to test for

Example Configuration

yaml

# yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
providers:
  - id: file://./src/openai-agent-provider.js
    config:
      apiBaseUrl: 'https://api.openai.com/v1'
      model: 'gpt-4o' # or gpt-4o-mini, o3-mini
      systemPrompt: |
        You are a helpful AI assistant with access to various tools...
      mcpServers:
        - command: 'npx'
          args: ['-y', '@promptfoo/evil-mcp-server@latest']
        - command: 'node'
          args: ['./src/mcp_server/index.js']
        - url: 'http://localhost:8080/mcp'

redteam:
  purpose: |
    You are acting as a financial analyst employee at Acme Inc...

    tools: get_financial_summaries
      startDate: string - Start date for the report (ISO 8601)
      endDate: string - End date for the report (ISO 8601)
      groupBy: enum ['day', 'week', 'month'] - Group results by period

  plugins:
    - id: policy
      config:
        policy: Test to trigger the get_financial_summaries tool...

Running Different MCP Servers

1. Evil MCP Server (Security Testing)

The example includes @promptfoo/evil-mcp-server which provides potentially dangerous tools for security testing:

yaml

mcpServers:
  - command: 'npx'
    args: ['-y', '@promptfoo/evil-mcp-server@latest']

2. Custom Node.js MCP Server

The included Node.js server (src/mcp_server/index.js) provides ERP and analytics tools:

yaml

mcpServers:
  - command: 'node'
    args: ['./src/mcp_server/index.js']

3. Python HTTP MCP Server

To run the Python server example:

bash

# Install FastMCP if not already installed
pip install fastmcp

# Run the server
python server.py

Then reference it in your config:

yaml

mcpServers:
  - url: 'http://localhost:8080/mcp'

Understanding the Results

When you run the red team tests, promptfoo will:

Initialize the agent: Create an OpenAI agent with MCP tool access
Run test scenarios: Execute various prompts designed to test security boundaries
Evaluate behavior: Check if the agent calls suspicious or unauthorized tools
Generate report: Show which tests passed/failed with detailed explanations

Example output includes:

Tool calls made by the agent
The agent's reasoning process (Thought → Action → Observation)
Token usage and execution metrics
Pass/fail status for each security test

Customization

Adding Custom MCP Servers

To add your own MCP server:

Create the server implementation
Add it to mcpServers in promptfooconfig.yaml:

yaml

mcpServers:
  - command: 'python'
    args: ['path/to/your/server.py']

Update the purpose section with your tool schemas

Modifying Test Scenarios

Edit the redteam section to customize:

Purpose: Change the agent's role and context
Tools: Add or modify available tool schemas
Policy: Define specific behaviors to test

Using Different Models

Update the model in the provider configuration:

yaml

config:
  model: "gpt-4o-mini"  # More cost-effective option
  # or
  model: "openai:chat:gpt-5.4"      # Latest OpenAI model

Security Considerations

API Keys: Never commit API keys to version control
MCP Servers: Be cautious when connecting to untrusted MCP servers
Test Environment: Run security tests in isolated environments
Tool Permissions: Carefully review tool capabilities before granting access

Troubleshooting

Common Issues

"OpenAI API key is required"
- Ensure OPENAI_API_KEY is set in your environment or .env file
"Failed to connect to MCP server"
- Verify the server command and path are correct
- Check that required dependencies are installed
- For HTTP servers, ensure they're running on the specified port
"Tool not found" errors
- Verify tool schemas in the purpose section match the MCP server's tools
- Check that MCP servers are starting successfully
Python server not working
- Install FastMCP: pip install fastmcp
- Ensure Python 3.8+ is installed
- Check that port 8080 is available

Advanced Usage

Running with Local Development

When developing the provider locally:

bash

# Use local version instead of published package
npm run local -- redteam run

Debugging Agent Behavior

To see detailed agent reasoning, the provider logs:

Each thought step
Tool selections and arguments
Observations from tool calls
Final responses

Creating Custom Tools

Example MCP tool implementation:

javascript

// In src/mcp_server/tools/customTools.js
export const customTools = [
  {
    name: 'my_custom_tool',
    description: 'Does something specific',
    inputSchema: {
      type: 'object',
      properties: {
        param1: { type: 'string', description: 'First parameter' },
      },
      required: ['param1'],
    },
  },
];

export async function handleCustomTool(name, args) {
  if (name === 'my_custom_tool') {
    // Tool implementation
    return { result: 'Tool executed successfully' };
  }
}

redteam-mcp-agent (Red Team MCP Agent)

redteam-mcp-agent (Red Team MCP Agent)

Overview

Prerequisites

Environment Variables

Quick Start

Project Structure

Configuration

promptfooconfig.yaml

Example Configuration

Running Different MCP Servers

1. Evil MCP Server (Security Testing)

2. Custom Node.js MCP Server

3. Python HTTP MCP Server

Understanding the Results

Customization

Adding Custom MCP Servers

Modifying Test Scenarios

Using Different Models

Security Considerations

Troubleshooting

Common Issues

Advanced Usage

Running with Local Development

Debugging Agent Behavior

Creating Custom Tools

Resources