AI-SPEC — Phase {N}: {phase_name}

AI design contract generated by /gsd-ai-integration-phase. Consumed by gsd-planner and gsd-eval-auditor. Locks framework selection, implementation guidance, and evaluation strategy before planning begins.

1. System Classification

Description:

Critical Failure Modes:

1b. Domain Context

Researched by gsd-domain-researcher. Grounds the evaluation strategy in domain expert knowledge.

User Population:

Stakes Level:

Output Consequence:

What Domain Experts Evaluate Against

Known Failure Modes in This Domain

Regulatory / Compliance Context

Domain Expert Roles for Evaluation

Role	Responsibility
<!-- e.g., Senior practitioner -->	<!-- Dataset labeling / rubric calibration / production sampling -->

2. Framework Decision

Selected Framework:

Version:

Rationale:

Alternatives Considered:

Framework	Ruled Out Because

Vendor Lock-In Accepted:

3. Framework Quick Reference

Fetched from official docs by gsd-ai-researcher. Distilled for this specific use case.

Installation

bash

# Install command(s)

Core Imports

python

# Key imports for this use case

Entry Point Pattern

python

# Minimal working example for this system type

Key Abstractions

Concept	What It Is	When You Use It

Common Pitfalls

Recommended Project Structure

project/
├── # Framework-specific folder layout

4. Implementation Guidance

Model Configuration:

Core Pattern:

Tool Use:

State Management:

Context Window Strategy:

4b. AI Systems Best Practices

Written by gsd-ai-researcher. Cross-cutting patterns every developer building AI systems needs — independent of framework choice.

Structured Outputs with Pydantic

python

# Pydantic output model for this system type

Async-First Design

Prompt Engineering Discipline

Context Window Management

Cost and Latency Budget

5. Evaluation Strategy

Dimensions

Dimension	Rubric (Pass/Fail or 1-5)	Measurement Approach	Priority
		Code / LLM Judge / Human	Critical / High / Medium

Eval Tooling

Primary Tool:

Setup:

bash

# Install and configure

CI/CD Integration:

bash

# Command to run evals in CI/CD pipeline

Reference Dataset

Size:

Composition:

Labeling:

6. Guardrails

Online (Real-Time)

Guardrail	Trigger	Intervention
		Block / Escalate / Flag

Offline (Flywheel)

Metric	Sampling Strategy	Action on Degradation

7. Production Monitoring

Tracing Tool:

Key Metrics to Track:

Alert Thresholds:

Smart Sampling Strategy:

AI-SPEC — Phase {N}: {phase_name}

AI-SPEC — Phase {N}: {phase_name}

1. System Classification

1b. Domain Context

What Domain Experts Evaluate Against

Known Failure Modes in This Domain

Regulatory / Compliance Context

Domain Expert Roles for Evaluation

2. Framework Decision

3. Framework Quick Reference

Installation

Core Imports

Entry Point Pattern

Key Abstractions

Common Pitfalls

Recommended Project Structure

4. Implementation Guidance

4b. AI Systems Best Practices

Structured Outputs with Pydantic

Async-First Design

Prompt Engineering Discipline

Context Window Management

Cost and Latency Budget

5. Evaluation Strategy

Dimensions

Eval Tooling

Reference Dataset

6. Guardrails

Online (Real-Time)

Offline (Flywheel)

7. Production Monitoring

Checklist