Woods Agent Guide

Woods gives AI agents accurate, structured context about a Rails application by running inside it. Instead of guessing from source files, Woods boots the app, introspects every model, controller, route, service, and job using Rails runtime APIs, and writes the result as JSON. Two MCP servers make that data available: the Index Server (27 tools, reads pre-extracted JSON, no Rails boot required) and the Console Server (31 tools, bridges to a live Rails process for real data queries). This guide covers how to set up both servers, which tools to use for common tasks, and what to avoid.

What Woods Provides
MCP Server Setup
Core Workflows
Tool Quick Reference
Relationship Types (via)
Configuration Quick Reference
Gotchas

What Woods Provides

Each extracted unit is a self-contained JSON object carrying:

source_code — annotated source with all included concerns appended inline and a schema header prepended (for models) or a route map prepended (for controllers). This is the full behavioral surface area in one block.
metadata — structured data: associations, callbacks with side-effects, validations, enums, scopes, actions, filters, route maps, queue config, field definitions, and more depending on unit type.
dependencies — forward edges: what this unit depends on, each with a via label describing the relationship type.
dependents — reverse edges: what depends on this unit, same structure.

Key enrichments beyond source file content:

Concern inlining. When a model includes Auditable, Woods reads the concern source and appends it to the unit's source_code. The metadata.inlined_concerns array records which concerns were resolved. An AI tool reading the lookup result sees the full behavioral picture in one call.

Callback side-effects. CallbackAnalyzer scans each callback method body and records what it actually does: columns written (self.col =), jobs enqueued (perform_later), mailers sent (deliver_later), services called. The metadata.callbacks array includes a side_effects hash per callback.

Schema prepending. Model source gets a comment block showing actual column types, nullability, and indexes from the live database — not guesses from migrations.

Route binding. Controller source gets a route map comment showing the real HTTP verb + path for every action, resolved from Rails.application.routes.

Dependency graph with PageRank. 34 extractors build a bidirectional graph. PageRank identifies the most structurally central units — the ones with the widest blast radius when changed.

Navigation edges. View templates scanning for _path/_url route helper calls produce link_to edges pointing to controllers. Controller redirect_to calls produce redirect_to edges. Form submissions produce form_action edges. Filter with the via parameter on dependencies/dependents to isolate UI navigation paths.

View-template coverage is ERB-only. HAML, Slim, and Turbo Streams templates are not parsed at all — an app written in Slim will appear with zero view units even when views exist. Stimulus controller references are detected inside PhlexExtractor and ViewComponentExtractor via data-controller attribute scanning (dependency edges with via: :html_attribute, type stimulus_controller), but the Stimulus controller JS files themselves under app/javascript/controllers/ are not extracted. Query the structure tool's template_engines field to confirm which engines the current index parses. The pluggable Woods::Extractors::ViewEngines::Base protocol and ViewTemplateExtractor::ENGINES registry landed with issue #110 — HAML / Slim / Turbo implementations slot in by subclassing Base and appending to ENGINES.

MCP Server Setup

Run extraction first. Extraction requires a booted Rails app:

bash

# Host app
bundle exec rake woods:extract

# Docker
docker compose exec app bundle exec rake woods:extract

Index Server

The Index Server reads from tmp/woods/ and does not require Rails.

Claude Code — add to .mcp.json in the Rails app root:

json

{
  "mcpServers": {
    "woods": {
      "command": "woods-mcp-start",
      "args": ["./tmp/woods"]
    }
  }
}

Use woods-mcp-start on Claude Code for automatic restart after crashes. Use woods-mcp on Cursor, Windsurf, or other MCP clients.

Cursor — add to .cursor/mcp.json:

json

{
  "mcpServers": {
    "woods": {
      "command": "woods-mcp",
      "args": ["/absolute/path/to/rails-app/tmp/woods"]
    }
  }
}

Windsurf — same config as Cursor, different file location. Use woods-mcp (not -start).

Docker note. The Index Server runs on the host. Point it at the host-side volume mount path, not the container path. If the container writes to /app/tmp/woods, the host path is ./tmp/woods (or wherever the volume is mounted). Never use the container-internal path — the host process cannot access it.

Console Server

The Console Server connects to a live Rails environment. For Docker apps, configure the bridge in ~/.woods/console.yml:

yaml

connection:
  mode: docker
  service: app
  compose_file: docker-compose.yml

Then add to .mcp.json:

json

{
  "mcpServers": {
    "woods-console": {
      "command": "woods-console-mcp"
    }
  }
}

For non-Docker apps, mode: direct works without the config file. For the embedded mode (Tier 1 only, 9 tools), point the MCP client at rake woods:console directly:

json

{
  "mcpServers": {
    "woods-console": {
      "command": "docker",
      "args": ["compose", "exec", "-i", "app", "bundle", "exec", "rake", "woods:console"]
    }
  }
}

The -i flag is required to keep stdin attached for MCP protocol communication.

Core Workflows

Understanding a model

Start with lookup to get the full unit — source with inlined concerns, schema header, associations, and callback chain. Then traverse dependencies and dependents.

json

{ "tool": "lookup", "params": { "identifier": "Order", "include_source": true } }

json

{ "tool": "dependencies", "params": { "identifier": "Order", "depth": 2 } }

json

{ "tool": "dependents", "params": { "identifier": "Order", "depth": 1 } }

To get only metadata without the full source (faster for large models):

json

{ "tool": "lookup", "params": { "identifier": "Order", "include_source": false, "sections": ["metadata", "dependencies"] } }

Tracing a feature flow

Find the entry point with search, then follow the dependency graph:

json

{ "tool": "search", "params": { "query": "checkout", "types": ["controller", "route"], "limit": 5 } }

json

{ "tool": "trace_flow", "params": { "entry_point": "OrdersController#create", "depth": 3 } }

Or traverse manually with depth control to see paths:

json

{ "tool": "dependencies", "params": { "identifier": "CheckoutService", "depth": 3, "types": ["job", "mailer"] } }

Assessing blast radius of a change

Get the unit's PageRank to understand its centrality, then traverse dependents:

json

{ "tool": "pagerank", "params": { "limit": 1, "types": ["model"] } }

json

{ "tool": "dependents", "params": { "identifier": "User", "depth": 3 } }

Use graph_analysis to check whether the unit is a structural hub or bridge:

json

{ "tool": "graph_analysis", "params": { "analysis": "hubs", "limit": 20 } }

Filter dependents to only navigation edges — views that link to or submit forms to a controller:

json

{ "tool": "dependents", "params": { "identifier": "OrdersController", "depth": 1, "via": ["link_to", "form_action"] } }

Find where a controller redirects after an action:

json

{ "tool": "dependencies", "params": { "identifier": "SessionsController", "depth": 1, "via": ["redirect_to"] } }

Checking test coverage for a unit

Look up the test mapping unit, or search for specs that reference the unit:

json

{ "tool": "search", "params": { "query": "Order", "types": ["test_mapping"], "limit": 5 } }

Then look up the test_mapping unit for the spec file association and coverage status.

Understanding framework behavior

Use framework to search the Rails/gem source installed in the app — not documentation, the actual implementation:

json

{ "tool": "framework", "params": { "keyword": "before_action", "limit": 5 } }

json

{ "tool": "framework", "params": { "keyword": "has_many", "limit": 3 } }

Tool Quick Reference

Index Server (27 tools)

Core Query

Tool	Key Parameters	Description
`woods_status`	(none)	Diagnose whether the server is ready. Returns extraction metadata (last run, unit counts, git SHA, staleness seconds), retriever/embedding configuration, and feature flags. Call first on cold connect.
`lookup`	`identifier`, `include_source`, `sections`	Full unit by exact identifier. `sections` filters which fields to return.
`search`	`query`, `types`, `fields`, `limit`	Regex search across identifiers, source, or metadata. Returns `{ results: [...], note?, partial? }` — `note` flags broad patterns (>50% of a directory matched), `partial` means the phase-2 scan cap (`WOODS_SEARCH_MAX_SCAN`, default 500) was hit. Invalid regex falls back to literal match. Follow up with `lookup`.
`dependencies`	`identifier`, `depth`, `types`, `via`	Forward dependency tree (BFS). What a unit depends on.
`dependents`	`identifier`, `depth`, `types`, `via`	Reverse dependency tree (BFS). What depends on a unit.
`structure`	`detail`	Manifest summary or full unit breakdown by type.
`recent_changes`	`limit`, `types`	Recently modified units sorted by git timestamp.

Graph Analysis

Tool	Key Parameters	Description
`graph_analysis`	`analysis`, `limit`	Structural analysis. `analysis` is one of: `orphans`, `dead_ends`, `hubs`, `cycles`, `bridges`.
`pagerank`	`limit`, `types`	Units ranked by PageRank score (higher = more dependents, wider blast radius).
`framework`	`keyword`, `limit`	Search Rails/gem source for installed versions by concept keyword.

Flow and Session

Tool	Key Parameters	Description
`trace_flow`	`entry_point`, `depth`	Execution flow from a controller action through the dependency graph.
`session_trace`	`session_id`	Assemble context from browser session traces (requires session tracer middleware).

Semantic Search

Tool	Key Parameters	Description
`codebase_retrieve`	`query`, `budget`	Natural-language query with RRF-ranked semantic search. Requires embedding provider configuration.

Pipeline Management

Tool	Key Parameters	Description
`pipeline_extract`	`incremental`	Trigger extraction (runs in background). Rate-limited to 5-minute cooldown for full runs.
`pipeline_embed`	—	Trigger embedding generation.
`pipeline_status`	—	Last extraction time, unit counts, staleness indicators.
`pipeline_diagnose`	`error`	Classify a pipeline error and suggest remediation.
`pipeline_repair`	—	Clear stale locks or reset rate limit cooldowns.

Feedback

Tool	Key Parameters	Description
`retrieval_rate`	`query`, `score`, `comment`	Record quality rating (1–5) for a retrieval result.
`retrieval_report_gap`	`query`, `missing_unit`, `unit_type`	Report a unit that should have appeared but didn't.
`retrieval_explain`	—	Feedback statistics: average scores, gap counts, trends.
`retrieval_suggest`	—	Analyze feedback to suggest retrieval configuration changes.

Temporal Snapshots

Tool	Key Parameters	Description
`list_snapshots`	`limit`	Past extraction snapshots with timestamps and branch info.
`snapshot_diff`	`sha_a`, `sha_b`	Added, modified, and deleted units between two git SHAs.
`unit_history`	`identifier`, `limit`	How a single unit changed across snapshots.
`snapshot_detail`	`sha`	Full metadata for a specific snapshot.

Utility

Tool	Key Parameters	Description
`reload`	—	Reload extraction data from disk without restarting the server.
`notion_sync`	—	Sync models and columns to Notion. Requires `notion_api_token` and `notion_database_ids`.

Structured Errors

Tool failures return isError: true with machine-readable _meta.error_code so agents can branch without parsing prose. Common codes:

`error_code`	Meaning	Fix
`:not_configured`	A required config value is missing	Read `_meta.config_key` and `_meta.doc_link`
`:not_found`	Unit, snapshot, or other entity doesn't exist	Check `_meta.identifier` / `_meta.git_sha`; use `search` or `list_snapshots`
`:rate_limited`	PipelineGuard cooldown in effect	Wait `_meta.retry_after_seconds` and retry
`:unsupported_argument`	Enum value not allowed	See `_meta.allowed` for valid values
`:internal_error`	Assembly or rendering raised an exception	Inspect the text message
`:api_error`	External API (e.g. Notion) failed	Inspect the text message

Console Server (selected tools)

The Console Server has 4 tiers. Tier 1 (9 tools) is available in embedded mode; all 31 tools require the bridge architecture.

Tool	Tier	Description
`console_count`	1	Count records matching scope conditions
`console_sample`	1	Random sample of records (max 25)
`console_find`	1	Find a record by primary key or unique column
`console_pluck`	1	Extract column values (max 1000 rows)
`console_schema`	1	Live database schema for a model
`console_diagnose_model`	2	Counts, recent records, aggregates in one call
`console_validate_record`	2	Run validations on an existing record with optional changes
`console_job_queues`	3	Queue depths and job class breakdown
`console_job_failures`	3	Recent job failures with error class and message
`console_slow_endpoints`	3	Slowest endpoints by response time
`console_sql`	4	Execute read-only SQL (SELECT only, validated)
`console_eval`	4	Execute Ruby code (requires confirmation, 10s timeout)

Relationship Types (via)

Every edge in the dependency graph carries a via label. Use the via parameter on dependencies and dependents to filter to specific relationship types. Pass an array of strings.

Model Associations

`via` value	Where it comes from	What it means
`belongs_to`	`ModelExtractor` — `reflect_on_all_associations`	Model belongs_to association
`has_many`	`ModelExtractor`	Model has_many association
`has_one`	`ModelExtractor`	Model has_one association
`has_and_belongs_to_many`	`ModelExtractor`	HABTM association

`via` value	Where it comes from	What it means
`link_to`	`ViewTemplateExtractor`, `SharedDependencyScanner`	View template links to a controller via `_path`/`_url` helper
`redirect_to`	`ControllerExtractor`, `SharedDependencyScanner`	Controller redirects to another controller via route helper
`form_action`	`SharedDependencyScanner`	Form submission targets a controller via route helper

Rendering

`via` value	Where it comes from	What it means
`render`	`ControllerExtractor`, `ViewTemplateExtractor`, `PhlexExtractor`, `ViewComponentExtractor`	Renders a view, partial, or component
`view_render`	`ViewTemplateExtractor`	View template is rendered by a controller
`slot`	`ViewComponentExtractor`	Component used as a named slot in another component

Module Inclusion

`via` value	Where it comes from	What it means
`include`	`ModelExtractor`, `ConcernExtractor`, `PhlexExtractor`, `ViewComponentExtractor`	Module included (instance-level methods)
`extend`	`ModelExtractor`, `ConcernExtractor`	Module extended (class-level methods)

Code References

`via` value	Where it comes from	What it means
`code_reference`	`SharedDependencyScanner`, many extractors	Generic code dependency (service call, model reference, etc.)
`data_dependency`	`PhlexExtractor`, `ViewComponentExtractor`	Component reads from a model
`delegation`	`ManagerExtractor`	Manager delegates to a wrapped model
`decoration`	`DecoratorExtractor`	Decorator wraps a model
`serialization`	`SerializerExtractor`	Serializer targets a model
`authorization`	`PunditExtractor`	Policy governs a model
`validation`	`ValidatorExtractor`	Validator applies to a model
`test_coverage`	`TestMappingExtractor`	Spec file covers a source unit

Infrastructure and Framework

`via` value	Where it comes from	What it means
`route_dispatch`	`RouteExtractor`	Route dispatches to a controller
`engine_route`	`EngineExtractor`	Engine mounts a controller via routing
`url_helper`	`MailerExtractor`, `PhlexExtractor`, `ViewComponentExtractor`	Unit references a named route helper
`html_attribute`	`PhlexExtractor`, `ViewComponentExtractor`	Component references a Stimulus controller via `data-controller`
`job_enqueue`	`JobExtractor`	Job enqueues another job
`scheduled`	`ScheduledJobExtractor`	Scheduled job definition references a job class
`state_machine`	`StateMachineExtractor`	State machine belongs to a model
`state_machine_callback`	`StateMachineExtractor`	State machine callback references a service or job
`factory_for`	`FactoryExtractor`	Factory definition covers a model
`factory_parent`	`FactoryExtractor`	Factory inherits from a parent factory
`factory_association`	`FactoryExtractor`	Factory defines an association to another factory
`task_invoke`	`RakeTaskExtractor`	Rake task invokes another task
`task_dependency`	`RakeTaskExtractor`	Rake task declares a dependency
`table_name`	`DatabaseViewExtractor`, `MigrationExtractor`	View or migration references a model by table name
`reference`	`MigrationExtractor`	Migration adds a foreign key reference
`type_reference`	`GraphqlExtractor`	GraphQL type references another GraphQL type
`field_resolver`	`GraphqlExtractor`	GraphQL field uses a custom resolver
`behavioral_profile`	`BehavioralProfile`	App configuration references a framework constant
`configuration`	`ConfigurationExtractor`	Configuration references a gem

Configuration Quick Reference

Set these in config/initializers/woods.rb (created by rails generate woods:install).

Option	Default	What Agents Care About
`output_dir`	`Rails.root.join('tmp/woods')`	Where the Index Server points
`extractors`	all 34	Reduce for CI: `%i[models controllers]`
`include_framework_sources`	`true`	Set `false` to speed up extraction (disables `framework` tool results)
`embedding_provider`	`nil`	`:openai` or `:ollama` — required for `codebase_retrieve`
`embedding_model`	`'text-embedding-3-small'`	Must match what was used at embed time
`max_context_tokens`	`8000`	Token budget for `codebase_retrieve` results
`similarity_threshold`	`0.7`	Lower to include less similar results in `codebase_retrieve`
`enable_snapshots`	`false`	Required for snapshot diff tools
`precompute_flows`	`false`	Pre-generates per-action flow maps; expensive on large apps
`extract_navigation_edges`	`true`	Navigation edges (`link_to`, `redirect_to`, `form_action`) included in extraction
`session_tracer_enabled`	`false`	Required for `session_trace` tool
`console_redacted_columns`	`[]`	Columns hidden from Console Server results
`console_embedded_read_tools`	`false`	Unlocks `console_sql` / `console_query` in embedded transports

Storage presets set vector store, metadata store, and embedding together:

ruby

Woods.configure_with_preset(:local)       # in-memory + SQLite + Ollama
Woods.configure_with_preset(:postgresql)  # pgvector + SQLite + OpenAI
Woods.configure_with_preset(:production)  # Qdrant + SQLite + OpenAI

Gotchas

Extraction requires a Rails boot. The Index Server reads static JSON and needs no Rails. But generating that JSON requires rake woods:extract inside a running Rails app (or Docker container). You cannot extract from the gem directory or from source files alone.

search returns identifiers, not units. The search tool returns a list of matching unit summaries. To get the full source, metadata, and dependency edges, follow up with lookup for each result you need.

Navigation edges only exist if the navigation extractor ran. Navigation edges (link_to, redirect_to, form_action) are extracted by SharedDependencyScanner when processing view templates and controllers. If the extraction was limited to specific types (e.g., config.extractors = %i[models]), these edges won't be present.

Old serialized graphs may lack via metadata. The dependency graph format was updated to include via on every edge. If you have an index from before this change, edges may be bare strings without a via key. Re-run rake woods:extract to update.

dependents traversal includes all edge types by default. When you call dependents without a via filter, you get everything: code references, associations, nav edges, test coverage, and more. Filter with via when you want only a specific relationship category.

Some unit types require full extraction. Routes, middleware, engines, state machines, events, factories, and scheduled jobs are extracted by introspecting the whole application at once — not per file. Incremental extraction (rake woods:incremental) skips these. After changing a route file or adding a new job schedule, run full extraction.

Console Server queries are always rolled back. Every Console Server operation runs inside a database transaction that is rolled back at the end. Writes appear to succeed and return results, but no data is persisted. SqlValidator also blocks DML/DDL at the string level before any database interaction.

Console Server needs a running Rails process. The Console Server bridges to a live Rails environment. It validates model names against ActiveRecord::Base.descendants at startup. If the Rails app is not running, the Console Server will fail to connect.

Parallel tool calls can fail together. Some MCP clients batch parallel tool calls into a single protocol request. If one call in a batch fails (e.g., a typo in an identifier), the transport layer may reject the entire batch. Validate identifiers with search before calling lookup when operating in parallel, or serialize calls when any one might fail.

The codebase_retrieve tool requires embedding setup. The tool is listed in the tool catalog regardless of configuration, but returns no results unless embedding_provider is configured and rake woods:embed has been run. Use pipeline_status to check whether embeddings are available.

Switching embedding models requires a full re-index. Different models produce vectors in different embedding spaces with different dimensions. IndexValidator detects dimension mismatches and logs a warning. Re-run rake woods:extract && rake woods:embed after changing embedding_model.

Woods Agent Guide

Woods Agent Guide

Table of Contents

What Woods Provides

MCP Server Setup

Index Server

Console Server

Core Workflows

Understanding a model

Tracing a feature flow

Assessing blast radius of a change

Finding UI navigation paths

Checking test coverage for a unit

Understanding framework behavior

Tool Quick Reference

Index Server (27 tools)

Core Query

Graph Analysis

Flow and Session

Semantic Search

Pipeline Management

Feedback

Temporal Snapshots

Utility

Structured Errors

Console Server (selected tools)

Relationship Types (via)

Model Associations

Navigation (UI Flows)

Rendering

Module Inclusion

Code References

Infrastructure and Framework

Configuration Quick Reference

Gotchas