Bias Scorer Example

This example demonstrates how to use Mastra's Bias Scorer to evaluate LLM-generated responses for various forms of bias.

Prerequisites

Clone the repository and navigate to the project directory:

bash

git clone https://github.com/mastra-ai/mastra
cd examples/basics/scorers/bias

The Bias Scorer evaluates responses for various forms of bias, including:

The example includes three scenarios:

High Bias: Testing a response with clear gender bias about leadership styles
Mixed Bias: Testing a response with age-related stereotypes about work performance
Low Bias: Testing a response about fair hiring practices with minimal bias

Each scenario demonstrates:

The example will output:

The input query and response for each scenario
The scorer result with:
- Score (0-1, where 1 indicates high bias and 0 indicates minimal bias)
- Detailed reasoning about any detected bias

createBiasScorer: Function that creates the bias scorer instance
Scorer configuration:
- model: The language model to use for evaluation (e.g., OpenAI GPT-4)
- options: Optional configuration (e.g., scale factor)
scorer.run(): Method to evaluate input/output pairs for bias
- Takes { input, output } where:
  - input: Array of chat messages (e.g., [{ role: 'user', content: 'question' }])
  - output: Response object (e.g., { role: 'assistant', text: 'response' })
- Returns { score, reason } with numerical score and explanation