Back to Plate

Slate Issues Gitcrawl Rebuild Report

docs/slate-issues/gitcrawl-rebuild-report.md

53.0.64.0 KB
Original Source

Slate Issues Gitcrawl Rebuild Report

Verdict

The current live gitcrawl mirror is the active Slate issue corpus for the v2 fork. The previous live mirror was the 2026-05-04T145301Z run. This refresh keeps the same evidence layer and updates only live corpus accounting.

Use this rebuild as the current triage entrypoint:

Mirror Evidence

MetricPrevious runCurrent run
Live open issues630631
Live open PRs included in gitcrawl mirror2933
Live open threads total659664
Hydrated comments/reviews18561850
Gitcrawl clusters617620
Multi-member gitcrawl clusters2828
Multi-member thread coverage7072
Singleton clusters589592
Largest cluster size77

Command Evidence

Raw artifacts:

  • .tmp/gitcrawl/2026-05-23T091840Z-doctor-before-refresh.json
  • .tmp/gitcrawl/2026-05-23T091840Z-refresh.json
  • .tmp/gitcrawl/2026-05-23T091840Z-refresh.log
  • .tmp/gitcrawl/2026-05-23T091840Z-doctor-after-refresh.json
  • .tmp/gitcrawl/2026-05-23T091840Z-cluster.json
  • .tmp/gitcrawl/2026-05-23T091840Z-clusters.json
  • .tmp/gitcrawl/2026-05-23T091840Z-threads.json
  • .tmp/gitcrawl/2026-05-23T091840Z-cluster-<id>-detail.json for each multi-member cluster

Commands:

bash
gitcrawl doctor --json
GITHUB_TOKEN="$(gh auth token)" gitcrawl refresh ianstormtaylor/slate --include-comments --state open --json
gitcrawl cluster ianstormtaylor/slate --threshold 0.80 --min-size 1 --max-cluster-size 40 --k 16 --cross-kind-threshold 0.93 --json
gitcrawl clusters ianstormtaylor/slate --json
gitcrawl threads ianstormtaylor/slate --json
gitcrawl cluster-detail ianstormtaylor/slate --id <multi-member-id> --member-limit 40 --body-chars 280 --json

Delta From Previous Live Run

DeltaCountDecision
Previous live issue rows630Historical live run.
Current live issue rows631Current issue corpus.
New live issues absent from previous live ledger1Add to live triage.
Previous live rows no longer live-open0Remove from live ledger if any.
Frozen 2026-04-02 ledger rows682Historical corpus only.

New Live Issues Missing From Previous Live Ledger

IssueTitleLabelsUpdatedAction
#6061Unable to find the path for Slate node in v0.124.1bug2026-05-20add to v2 sync ledger as not-started / needs-repro until reviewed

Previous Live Rows No Longer Open

IssueAction
nonenone

Corpus Notes

  1. The live corpus currently has 664 open threads: 631 issues and 33 PRs.
  2. Gitcrawl clustering remains conservative: 592 / 620 clusters are singletons.
  3. Multi-member clusters cover 72 / 664 open threads. They are high-signal discovery seeds, not the whole taxonomy.
  4. PRs and issues cluster together. That is useful for finding linked fix attempts, but PR-only clusters must not count as issue families.
  5. Closed duplicate chains are not part of this open-corpus pass. Fetch closed threads only when a specific duplicate/closure claim depends on them.

Rebuild Decision