.agents/skills/heterogeneous-agent/references/debug-workflow.md
CLI raw stdout
-> HeterogeneousAgentCtr (Electron main)
-> heteroAgentRawLine broadcast
-> createAdapter(...)
-> executeHeterogeneousAgent(...)
-> persistToolBatch / persistToolResult
-> createGatewayEventHandler(...)
-> UI hydration
Start at the leftmost broken layer. Do not jump straight to UI rendering unless raw and adapted events already look correct.
Use a read-only prompt and save traces under the repo-local scratch directory .heerogeneous-tracing/.
ts=$(date +%Y%m%d-%H%M%S)
out=".heerogeneous-tracing/codex-${ts}.jsonl"
last=".heerogeneous-tracing/codex-${ts}.last.txt"
cat << 'EOF' | codex exec --json --skip-git-repo-check --sandbox read-only -C "$PWD" -o "$last" - > "$out"
You are being run only to collect a raw Codex JSON event trace.
Do not modify any files.
Use at least 4 separate shell tool invocations, one invocation per command.
Run a short sequence of read-only repo checks and then reply with a one-sentence summary.
EOF
What to look for in the JSONL:
thread.startedturn.starteditem.started / item.completeditem.type === 'command_execution'item.type === 'agent_message'turn.completedIf raw Codex already merges tools into one item, the adapter is innocent. If raw Codex emits independent items but UI collapses them, the bug is downstream.
If the repo already contains useful traces under .heerogeneous-tracing/, inspect them before reproducing.
Mirror the arguments from apps/desktop/src/main/modules/heterogeneousAgent/drivers/claudeCode.ts.
-p--input-format stream-json--output-format stream-json--verbose--include-partial-messages--permission-mode bypassPermissionsYou can capture a local raw trace like this:
ts=$(date +%Y%m%d-%H%M%S)
out=".heerogeneous-tracing/claude-${ts}.ndjson"
cat << 'EOF' | claude -p \
--input-format stream-json \
--output-format stream-json \
--verbose \
--include-partial-messages \
--permission-mode bypassPermissions \
> "$out"
{"type":"user","message":{"role":"user","content":[{"type":"text","text":"Do a few read-only repo checks, use several tool calls, and then summarize briefly."}]}}
EOF
What to look for in Claude Code raw traces:
type: 'system', subtype: 'init'type: 'assistant' blocks for thinking, tool_use, and texttype: 'user' blocks containing tool_resulttype: 'stream_event' with message_start, content_block_delta, and message_deltatype: 'result'type: 'rate_limit_event'Important Claude Code semantics:
message.id; that is still one turn.message.id change is the main-step boundary.message_delta.usage is the authoritative per-turn usage.parent_tool_use_id.If the repo already contains useful references, inspect these first:
.heerogeneous-tracing/cc-monitor-real-trace.jsonl.heerogeneous-tracing/cc-stream-chain-reference.mdIf you only need boundary semantics or tool persistence behavior, prefer existing adapter tests under:
packages/heterogeneous-agents/src/adapters/claudeCode.test.tspackages/heterogeneous-agents/src/adapters/claudeCode.e2e.test.tsIn dev builds, executeHeterogeneousAgent stores raw lines plus adapted events on:
window.__HETERO_AGENT_TRACEUse that trace to compare:
item.started / item.completedstream_chunk { chunkType: 'tools_calling' }tool_resulttool_endFor Codex, the usual mapping is:
item.started(command_execution) -> tools_calling + tool_startitem.completed(command_execution) -> tool_result + tool_enditem.completed(agent_message) -> stream_chunk(text)If the raw trace is right but adapted events are wrong, fix the adapter before touching persistence.
This is the first thing to verify for "mixed tools in one assistant" bugs.
Claude Code step boundaries are keyed off assistant message.id changes. The adapter should emit:
stream_endstream_start { newStep: true }Also verify these Claude-specific invariants:
message.id do not open a new stepcontent_block_delta text/thinking does not get duplicated by the later full assistant eventtool_result from type: 'user' updates the matching tool rowparent_tool_use_id creates thread-scoped subagent chunks instead of main-stream chunkstool_use.input is converted into synthesized pluginState.todos on tool_resultGood references:
packages/heterogeneous-agents/src/adapters/claudeCode.tspackages/heterogeneous-agents/src/adapters/claudeCode.test.tsCodex raw traces usually provide turn-level boundaries through:
turn.startedturn.completedThe executor only cuts a new assistant message when it receives a step-boundary signal it understands. If the adapter emits stream_start without newStep, multiple Codex tools and text chunks can accumulate under the same assistant longer than intended.
Relevant files:
packages/heterogeneous-agents/src/adapters/codex.tssrc/store/chat/slices/aiChat/actions/heterogeneousAgentExecutor.tsRead persistToolBatch and persistToolResult before changing UI code.
persistToolBatchThe expected order is:
tools[]role: 'tool' messagesresult_msg_id onto assistant tools[]If tool rows are created before assistant tools[] are registered, orphan tool messages are likely.
persistToolResulttool_result must resolve the tool row through toolMsgIdByCallId.
Warning signs:
tool_result for unknown toolCallIdresult_msg_idFor Claude Code, remember that tool results originate from raw type: 'user' events.
toolMsgIdByCallId is global across main and subagent scopes.If subagent events leak to the main handler, the main bubble can inherit the wrong tools[] and content.
Run the smallest useful test set first.
bunx vitest run --silent='passed-only' 'packages/heterogeneous-agents/src/adapters/codex.test.ts'
bunx vitest run --silent='passed-only' 'packages/heterogeneous-agents/src/adapters/claudeCode.test.ts'
bunx vitest run --silent='passed-only' 'src/store/chat/slices/aiChat/actions/__tests__/heterogeneousAgentExecutor.test.ts'
Especially useful places:
packages/heterogeneous-agents/src/adapters/codex.test.tspackages/heterogeneous-agents/src/adapters/claudeCode.test.tssrc/store/chat/slices/aiChat/actions/__tests__/heterogeneousAgentExecutor.test.tsClaude Code-specific assertions worth adding when fixing bugs:
message.id does not emit newStepmessage.id does emit stream_end plus stream_start { newStep: true }tool_result from user events reaches the right tool rowsubagent.parentToolCallIdpluginState.todosWhen the bug comes from a real trace, distill it into the closest existing test file instead of relying on manual UI-only repros.
.heerogeneous-tracing/.local-testing skill if UI confirmation is still needed.Do not start with a broad Electron repro if a raw trace or adapter test can prove the fault zone faster.