docs/dolt-health-guide.md
This guide covers evidence capture for Dolt outages and Gas Town behavior mismatches that look like Dolt trouble.
Use this checklist when any of these happen:
bd commands hang, time out, or return unexpected empty results.gt dolt status reports unhealthy server state, high latency, stale PIDs, or
orphan test databases.Do not restart Dolt before collecting diagnostics. A blind restart can destroy the state needed to explain the incident.
Capture non-fatal diagnostics first:
gt dolt dump 2>&1 | tee /tmp/dolt-hang-$(date +%s).log
gt dolt status 2>&1 | tee /tmp/dolt-status-$(date +%s).log
Then escalate with the evidence path:
gt escalate -s HIGH "Dolt: <symptom>" -m "Evidence: /tmp/dolt-status-..."
Attach this checklist to the escalation body, the follow-up bead, or the war-room
entry. Use N/A only when a field truly does not apply to a non-Dolt behavior
mismatch.
### RCA Capture
- Trigger command:
- Concurrent GT processes:
- Dolt pid/status:
- Stale pid status:
- Orphan test server status:
- Suspected GT code path:
- Expected behavior:
- Observed behavior:
- Evidence source:
- Likely root cause:
- Smallest fix direction:
gt dolt status or gt dolt dump.testdb_*, beads_t*, beads_pt*, or doctest_*.For documentation-only RCA work, use this smoke check to verify the checklist is available and wired into the escalation path:
test -f docs/dolt-health-guide.md
grep -n "Trigger command" docs/dolt-health-guide.md
grep -n "RCA capture checklist" internal/templates/townroot/claude.md docs/design/escalation.md