drafts/gpt-5-5/README.md
This directory contains ground-up rewrites of the Sisyphus, Hephaestus, Oracle, and Deep system prompts, styled after OpenAI Codex's gpt-5.4 prompt architecture and targeted at GPT-5.5.
sisyphus.md — Orchestrator. Intent gate, delegation philosophy, parallel execution discipline, verification.hephaestus.md — Autonomous deep worker. Persistence, exploration-first, forbidden stops, root-cause bias.oracle.md — Read-only strategic advisor. Three-tier response structure, hard verbosity limits, confidence signaling.deep.md — Category-spawned deep worker (runs as Sisyphus-Junior under the deep category). Goal-oriented autonomous execution.Each prompt applies the same small set of principles, borrowed and adapted from Codex's gpt-5.4 prompt work:
{{ personality }} slot. Separates persona from logic so the same base prompt can ship in default / friendly / pragmatic variants without duplication.# General → ## Autonomy and Persistence → ## Task execution → ## Validating your work → # Working with the user → # Tool Guidelines structure. Lifted directly from Codex's gpt_5_2_prompt.md and gpt-5.2-codex_prompt.md. Keeps the same section contract for every agent so readers can navigate consistently.file:// or https:// for local files, no line ranges.task_id reuse) as a first-class topic.review_prompt.md.deep mode (category-spawned counterpart to Hephaestus).These are intentional choices where oh-my-opencode's architecture differs from Codex's:
task() delegation is central for Sisyphus (it is the orchestrator), entirely absent for Oracle (read-only consultant), research-only for Hephaestus and Deep (they execute directly).update_plan tool; the harness uses task_create / task_update instead. Each prompt references its own tool set.skill tool. Codex has a simpler skill model.For reference, approximate line counts after this rewrite versus the current production prompts:
| Agent | Current (assembled) | Draft | Delta |
|---|---|---|---|
| Sisyphus GPT-5.4 | ~500 | ~270 | -46% |
| Hephaestus GPT-5.4 | ~400 | ~270 | -33% |
| Oracle GPT | ~120 | ~160 | +33% |
| Deep category append | ~20 | ~250 (as standalone) | N/A |
Oracle grew because v5.5 adds Confidence signaling and explicitly documents follow-up session behavior. Deep grew because the draft is a standalone prompt rather than a category append; in production it would either replace Sisyphus-Junior's GPT-5.5 variant entirely or layer on top of a minimal Sisyphus-Junior base.
.ts file. These are markdown drafts. Converting to TypeScript template strings (with {todoHookNote}, {keyTriggers}, etc. interpolation) is the next step, once the content is validated.skill-creator's eval loop against the current production prompts on a representative task set.{{ personality }} slot is a placeholder. Default / friendly / pragmatic content still needs to be authored.{{ personality }} and can be reused across all four prompts..ts with dynamic composition helpers. Preserve the existing buildAgentIdentitySection, buildToolSelectionTable, etc. integration points where they still apply.gpt-5.5 model selection until eval confidence is high.