examples/redteam-indirect-web-pwn/README.md
You can run this example with:
npx promptfoo@latest init --example redteam-indirect-web-pwn
cd redteam-indirect-web-pwn
This example demonstrates how to test an AI assistant for data exfiltration vulnerabilities using promptfoo's indirect-web-pwn strategy.
The attack works by:
)The custom provider (provider.js) simulates a realistic AI assistant with a web_fetch tool that actually fetches URLs, making it a valid target for this attack.
npm install
export OPENAI_API_KEY=your-api-key-here
Run the full red team scan (generates test cases and evaluates) in one command:
npx promptfoo@latest redteam run
View results:
npx promptfoo@latest view