qa/scenarios/index.md
Single source of truth for repo-backed QA suite bootstrap data.
qa-lab should treat this directory as a generic markdown scenario pack:
index.md defines pack-level bootstrap data
each nested *.md scenario defines one runnable test via qa-scenario + qa-flow
scenario markdown may also define coverage IDs, category metadata, required plugins, lane filters, runtime parity tiers, and gateway config patching
kickoff mission
QA operator identity
scenario files under one-level theme directories
Coverage tracking:
coverage.primary IDs to each scenario's qa-scenario blockcoverage.secondary only when a scenario intentionally protects another behaviorpnpm openclaw qa coverage to render the current inventoryruntimeParityTier for runtime-pair gate membership: standard,
optional, live-only, or soakcoverage: ["id"] / coverage: - id list shape as invalidRuntime parity tiers:
standard: required Codex-vs-Pi mock gate coverage for first-hour depth and
default runtime-tool fixtures. OpenClaw dynamic integration tools in this
tier are hard-gated by openclaw qa coverage --tools --summary; Codex-native
workspace rows remain separately tracked until native/live behavior is the
asserted surface. Selected with
openclaw qa suite --runtime-pair pi,codex --runtime-parity-tier standardoptional: profile-, plugin-, or external-service-dependent runtime-tool
fixtures that stay out of the default release gatelive-only: scenarios that need real provider/runtime behavior rather than
mock-openai fixturessoak: long-running scheduled or Testbox lanes such as the 100-turn parity
soakTheme directories:
agents/ - agent behavior, instructions, subagent flows, and persisted child-link regressionschannels/ - DM, shared channel, thread, and message-action behaviorcharacter/ - persona and style eval scenariosconfig/ - config patch, apply, and restart behaviormedia/ - image understanding and generationmemory/ - recall, ranking, active memory, and thread isolationmodels/ - provider capabilities and model switchingpersonal/ - local personal assistant workflow checks for reminders,
replies, memory, redaction, and safe tool followthroughplugins/ - plugin, skill, and MCP tool integrationruntime/ - turn recovery, compaction, approval, and inventory behaviorscheduling/ - cron and recurring workui/ - Control UI plus qa-channel flowsworkspace/ - repo-reading and workspace artifact tasksversion: 1
agent:
identityMarkdown: |-
# Dev C-3PO
You are the OpenClaw QA operator agent.
Persona:
- protocol-minded
- precise
- a little flustered
- conscientious
- eager to report what worked, failed, or remains blocked
Style:
- read source and docs first
- test systematically
- record evidence
- end with a concise protocol report
kickoffTask: |-
QA mission:
Understand this OpenClaw repo from source + docs before acting.
The repo is available in your workspace at `./repo/`.
Use the seeded QA scenario plan as your baseline, then add more scenarios if the code/docs suggest them.
Run the scenarios through the real qa-channel surfaces where possible.
Track what worked, what failed, what was blocked, and what evidence you observed.
End with a concise report grouped into worked / failed / blocked / follow-up.
Important expectations:
- Check both DM and channel behavior.
- Include a Lobster Invaders build task.
- Include a cron reminder about one minute in the future.
- Read docs and source before proposing extra QA scenarios.
- Keep your tone in the configured dev C-3PO personality.