Back to Ruflo

ADR-075: Wire Self-Learning Pipeline End-to-End

v3/implementation/adrs/ADR-075-wire-learning-pipeline.md

3.6.302.9 KB
Original Source

ADR-075: Wire Self-Learning Pipeline End-to-End

Status: Implemented Date: 2026-04-06 Branch: feat/wire-learning-pipeline

Context

The learning pipeline had all building blocks (SONA, LocalReasoningBank, ONNX embeddings, trajectory recording, pattern search, file persistence) but they weren't connected end-to-end. Trajectories recorded without embeddings, so distillation produced 0 patterns. The memory bridge tried to use AgentDB's ReasoningBank/LearningSystem but they require a better-sqlite3 db handle that ControllerRegistry doesn't expose.

Decision

  1. Auto-generate embeddings in recordTrajectory — each step gets ONNX embedding (all-MiniLM-L6-v2, 384d) before distillation
  2. Store successful trajectory steps as patterns — directly into LocalReasoningBank with confidence scoring
  3. Wire intelligence module into hookssession-start initializes SONA+ReasoningBank, post-task records trajectories
  4. Wire intelligence into memory bridge — bridge registers LocalReasoningBank and SONA as controllers (bypassing AgentDB's constructor requirements)
  5. Add patternsLearned and signalsProcessed to global stats

Verification

CapabilityBeforeAfterEvidence
Trajectory recordingWorked but no patterns9 patterns from 3 trajectoriesstats.patternsLearned: 9
Embedding generationNot called during trajectoryAuto-generated per stepONNX all-MiniLM-L6-v2, 384d
Pattern search0 resultsCorrect semantic matches"auth" → JWT pattern (0.584), "security" → SQL injection (0.661)
Pattern persistenceFile never createdpatterns.json writtenpersist.patternsExist: true
SONA coordinatorInitialized but idleActive with trajectoriesstats.sonaEnabled: true
Intelligence in hooksNot wiredsession-start inits, post-task recordsAutomatic on every session

Architecture

Hook fires (session-start / post-task)
    │
    ├── intelligence.initializeIntelligence()
    │   ├── LocalSonaCoordinator (RL, <0.05ms adaptation)
    │   └── LocalReasoningBank (pattern storage + file persistence)
    │
    ├── intelligence.recordTrajectory(steps, verdict)
    │   ├── generateEmbedding(step.content)  ← ONNX all-MiniLM-L6-v2
    │   ├── sonaCoordinator.recordTrajectory()
    │   ├── sonaCoordinator.endTrajectory() → RL reward
    │   ├── sonaCoordinator.distillLearning() → EWC consolidation
    │   └── reasoningBank.store(pattern) → file persistence
    │
    └── intelligence.findSimilarPatterns(query)
        └── reasoningBank.findSimilar(embedding, {k, threshold})

References

  • Intelligence module: v3/@claude-flow/cli/src/memory/intelligence.ts
  • Memory bridge: v3/@claude-flow/cli/src/memory/memory-bridge.ts
  • Hooks wiring: v3/@claude-flow/cli/src/mcp-tools/hooks-tools.ts