docs/workspace/test-area-auto-iterate-one-round/README.md
本目录记录“复用现有多槽位测试区,为目标模型引入一轮自动提示词迭代”的设计与过程。 本期不新建独立的 SPO/实验页面,而是在现有测试区内增加自动迭代入口与四槽位预设。
说明:目录名保留了最初的“一轮自动迭代”提法,但目录内容已经扩展为三部分:
- 一轮自动迭代的早期方案
- compare evaluation 的通用增强
- 薄
SPO的上层封装设计
./task_plan.md./findings.md./progress.mddocs/architecture/structured-compare-and-evaluation-rewrite.mddocs/architecture/spo-thin-loop-ui-and-stop-rules.md./implementation-split-compare-stop-signals-and-spo.mddocs/architecture/test-area-auto-iterate-one-round.mddocs/architecture/structured-compare-and-evaluation-rewrite.mddocs/architecture/spo-thin-loop-ui-and-stop-rules.mddocs/workspace/compare-evaluation-analysis/current-spec.mddocs/architecture/test-area-version-model-selection.mdbasic-system 优先workspace 提示词,不自动保存历史版本目标模型 + 参考模型 自动生成 4 槽位测试预设v-lastSPO 设计以“薄编排层”为目标,重点是多轮循环、停止条件和结果展示,而不是新增 judge 能力Generic CompareStructured Comparerewrite from evaluationstop signalsdocs/architecture/,本目录只保留任务计划、过程记录和拆分说明,避免同主题双份文档继续漂移。docs/workspace/compare-evaluation-analysis/current-spec.mddocs/architecture/structured-compare-and-evaluation-rewrite.mddocs/architecture/spo-thin-loop-ui-and-stop-rules.md./implementation-split-compare-stop-signals-and-spo.mddocs/archives/<id>-test-area-auto-iterate-one-round/