qa/scenarios/index.md
Single source of truth for repo-backed QA suite bootstrap data.
qa-lab should treat this directory as a generic markdown scenario pack:
index.md defines pack-level bootstrap data
each nested *.md scenario defines one evidence scenario via qa-scenario
flow scenarios add qa-flow; native test scenarios use execution.path
scenario markdown may also define taxonomy coverage IDs, category metadata, required plugins, lane filters, runtime parity tiers, and gateway config patching
kickoff mission
QA operator identity
scenario files under one-level theme directories
Coverage tracking:
coverage.primary in each scenario's qa-scenario
blockcoverage.secondary only when a scenario intentionally protects another behaviorcoverageIds in taxonomy.yamlpnpm openclaw qa coverage to render the current inventoryexecution.kind: vitest or execution.kind: playwright plus execution.path
for native test files that provide evidence without a qa-flow blockruntimeParityTier for runtime-pair gate membership: standard,
optional, live-only, or soakcoverage: ["id"] / coverage: - id list shape as invalidRuntime parity tiers:
standard: required Codex-vs-OpenClaw mock gate coverage for first-hour depth and
default runtime-tool fixtures. OpenClaw dynamic integration tools in this
tier are hard-gated by openclaw qa coverage --tools --summary; Codex-native
workspace rows remain separately tracked until native/live behavior is the
asserted surface. Rows that explicitly target searchable/deferred OpenClaw
dynamic loading stay report-only unless a fixture promotes them to required. Selected with
openclaw qa suite --runtime-pair openclaw,codex --runtime-parity-tier standardoptional: profile-, plugin-, or external-service-dependent runtime-tool
fixtures that stay out of the default release gatelive-only: scenarios that need real provider/runtime behavior rather than
mock-openai fixturessoak: long-running scheduled or Testbox lanes such as the 100-turn parity
soakTheme directories:
agents/ - agent behavior, instructions, subagent flows, and persisted child-link regressionschannels/ - DM, shared channel, thread, and message-action behaviorcharacter/ - persona and style eval scenariosconfig/ - config patch, apply, and restart behaviormedia/ - image understanding and generationmemory/ - recall, ranking, active memory, and thread isolationmodels/ - provider capabilities and model switchingpersonal/ - local personal assistant workflow checks for reminders,
replies, memory, redaction, and safe tool followthroughplugins/ - plugin, skill, and MCP tool integrationruntime/ - turn recovery, compaction, approval, and inventory behaviorscheduling/ - cron and recurring workui/ - Control UI plus qa-channel flowsworkspace/ - repo-reading and workspace artifact tasksversion: 1
agent:
identityMarkdown: |-
# Dev C-3PO
You are the OpenClaw QA operator agent.
Persona:
- protocol-minded
- precise
- a little flustered
- conscientious
- eager to report what worked, failed, or remains blocked
Style:
- read source and docs first
- test systematically
- record what happened
- end with a concise protocol report
kickoffTask: |-
QA mission:
Understand this OpenClaw repo from source + docs before acting.
The repo is available in your workspace at `./repo/`.
Use the seeded QA scenario plan as your baseline, then add more scenarios if the code/docs suggest them.
Run the scenarios through the real qa-channel surfaces where possible.
Track what worked, what failed, what was blocked, and what you observed.
End with a concise report grouped into worked / failed / blocked / follow-up.
Important expectations:
- Check both DM and channel behavior.
- Include a Lobster Invaders build task.
- Include a cron reminder about one minute in the future.
- Read docs and source before proposing extra QA scenarios.
- Keep your tone in the configured dev C-3PO personality.