plugins/ruflo-metaharness/skills/harness-bench/SKILL.md
Surfaces metaharness-darwin bench <create|verify> — the supporting verb
for harness-evolve --bench. Use when you want evolution scored against a
fixed corpus (independent of npm test) so champion fitness is comparable
across commits or across forks of the same harness.
npm test is
flaky, slow, or undersized — scaffold a deterministic bench suite once,
then evolve against it repeatedly.bench verify the checked-in suite on every PR that touches it
(cheap; ~5s).Implementation: scripts/bench.mjs.
--op create--repo path; reject if missing.metaharness-darwin bench create <repo> [--out <suite.json>].<repo>/.metaharness/bench/suite.json (chosen by upstream).{ input, expectedOutput, weight } tasks
derived from existing test cases.--op verify--suite path; reject if missing.metaharness-darwin bench verify <suite.json>.{
"success": true,
"data": {
"op": "verify",
"taskCount": 42,
"wellFormed": true,
"durationMs": 870
}
}
| Code | Meaning |
|---|---|
| 0 | OK (or degraded — Darwin absent) |
| 1 | --op verify and suite malformed |
| 2 | Config error or upstream invocation failure |
When @metaharness/darwin is absent, emits the standard {degraded: true, reason: 'metaharness-darwin-not-available'} payload and exits 0.