Core Infrastructure

See also docs/contributing/TECHNICAL.md for the full architecture overview

Scope

Domain-agnostic building blocks with no knowledge of any specific command, hook, or agent. If a module references "git", "cargo", "claude", or any external tool by name, it does not belong here. Core is a leaf in the dependency graph — it is consumed by all other components but imports from none of them.

Owns: configuration loading, token tracking persistence, TOML filter engine, tee output recovery, display formatting, telemetry, and shared utilities.

Does not own: command-specific filtering logic (that's cmds/), hook lifecycle management (that's src/hooks/), or analytics dashboards (that's analytics/).

Purpose

Core infrastructure shared by all RTK command modules. Every filter, tracker, and command handler depends on these modules. No inward dependencies — leaf in the dependency graph (no circular imports possible).

TOML Filter Pipeline

The TOML DSL applies 8 stages in order:

strip_ansi: Remove ANSI escape codes if enabled
replace: Line-by-line regex substitutions (chainable, supports backreferences)
match_output: Short-circuit rules (if output matches pattern, return message; unless field prevents swallowing errors)
strip/keep_lines: Filter lines by regex (mutually exclusive)
truncate_lines_at: Truncate each line to N chars (unicode-safe)
head/tail_lines: Keep first N or last N lines (with omit message)
max_lines: Absolute line cap applied after head/tail
on_empty: Return message if result is empty after all stages

Three-tier filter lookup (first match wins):

.rtk/filters.toml (project-local, requires rtk trust)
~/.config/rtk/filters.toml (user-global)
Built-in filters concatenated by build.rs at compile time

Tracking Database Schema

sql

CREATE TABLE commands (
  id INTEGER PRIMARY KEY,
  timestamp TEXT,              -- UTC ISO8601
  original_cmd TEXT,           -- "ls -la"
  rtk_cmd TEXT,                -- "rtk ls"
  project_path TEXT,           -- cwd (for project-scoped stats)
  input_tokens INTEGER,        -- estimated from raw output
  output_tokens INTEGER,       -- estimated from filtered output
  saved_tokens INTEGER,        -- input - output
  savings_pct REAL,            -- (saved / input) * 100
  exec_time_ms INTEGER         -- elapsed milliseconds
);

CREATE TABLE parse_failures (
  id INTEGER PRIMARY KEY,
  timestamp TEXT,
  raw_command TEXT,
  error_message TEXT,
  fallback_succeeded INTEGER   -- 1=yes, 0=no
);

Project-scoped queries use GLOB patterns (not LIKE) to avoid _/% wildcard issues in paths.

Config Sections

toml

[tracking]
enabled = true
history_days = 90
database_path = "/custom/path/to/tracking.db"  # Optional

[display]
colors = true
emoji = true
max_width = 120

[tee]
enabled = true
mode = "failures"  # failures | always | never
max_files = 20
max_file_size = 1048576
directory = "/custom/tee/dir"

[telemetry]
enabled = true

[hooks]
exclude_commands = ["curl", "playwright"]  # Never auto-rewrite these

[limits]
grep_max_results = 200
grep_max_per_file = 25
status_max_files = 15
status_max_untracked = 10
passthrough_max_chars = 2000

Shared Utilities (utils.rs)

Key functions available to all command modules:

Function	Purpose
`truncate(s, max)`	Truncate string with `...` suffix
`strip_ansi(text)`	Remove ANSI escape/color codes
`resolved_command(name)`	Find command in PATH, returns `Command`
`tool_exists(name)`	Check if a CLI tool is available
`detect_package_manager()`	Detect pnpm/yarn/npm from lockfiles
`package_manager_exec(tool)`	Build `Command` using detected package manager
`ruby_exec(tool)`	Auto-detect `bundle exec` when `Gemfile` exists
`count_tokens(text)`	Estimate tokens: `ceil(chars / 4.0)`

Consumer Contracts

Core provides infrastructure that cmds/ and other components consume. These contracts define expected usage.

Tracking (`TimedExecution`)

Consumers must call timer.track() on all code paths — success, failure, and fallback. Calling std::process::exit() before track() loses metrics. The raw string passed to track() should include both stdout and stderr to produce accurate savings percentages.

Tee (`tee_and_hint`)

Consumers that parse structured output (JSON, NDJSON, state machines) should call tee::tee_and_hint() to save raw output for LLM recovery on failure. Tee must be called before std::process::exit().

For truncation recovery on success (e.g., list truncated at 20 items), use tee::force_tee_hint() which bypasses the tee mode check and writes regardless of exit code. This ensures LLMs always have a [full output: ...] recovery path instead of burning tokens working around missing data.

When the truncated output is a flat list and the hidden items start at a predictable line, prefer tee::force_tee_tail_hint(content, slug, offset). It writes the same tee file but emits a directly runnable hint — [see remaining: tail -n +{offset} ~/path] — so the agent jumps to exactly the first hidden item without scanning the whole file. The offset is header_lines + MAX_CAP + 1. Use force_tee_hint instead when the output has multiple sections (e.g. running + stopped containers) and no single offset cleanly covers the gap.

Truncation Caps (`truncate`)

src/core/truncate.rs defines four global cap policies — CAP_ERRORS, CAP_WARNINGS, CAP_LIST, CAP_INVENTORY — for the data classes RTK filters truncate. Each filter binds the right CAP to a local const MAX_* so the cap is one named jump away from the call site. These CAPs are the staging point for filter-level cap configuration (planned, not yet implemented): once the config surface lands, overriding CAP_LIST in ~/.config/rtk/config.toml will tune every list filter in one place instead of editing 20+ files.

Config policy. Configured values are accepted as-is, including 0, which means "summary only" — the filter still prints the count and the [full output: …] recovery hint, just no individual items. Caps are never refused and rtk never aborts on them, in keeping with the never-block-the-user fallback philosophy.

Deviating from a cap. A filter whose items are unusually verbose (multi-line entries, backtraces) may show fewer than its class cap. Use truncate::reduced(cap, by) rather than a bare cap - by: reduced returns cap - by, except when the reduction would empty the list (by >= cap), in which case it drops the deviation and uses the full cap. This guarantees a deviation can never hide every item, and — crucially — stays a usize-underflow-safe const fn once caps become runtime-configurable (a bare CAP_WARNINGS - 5 would panic or wrap to "no truncation" if a user set CAP_WARNINGS below 5). Never deviate with a bare literal or with *// (those scale unboundedly). Each deviation needs a one-line comment stating why.

Adding New Functionality

Place new infrastructure code here if it meets all of these criteria: (1) it has no dependencies on command modules or hooks, (2) it is used by two or more other modules, and (3) it provides a general-purpose utility rather than command-specific logic. Follow the existing pattern of lazy-initialized resources (lazy_static! for regex, on-demand config loading) to preserve the <10ms startup target. Add #[cfg(test)] mod tests with unit tests in the same file.

Core Infrastructure

Core Infrastructure

Scope

Purpose

TOML Filter Pipeline

Tracking Database Schema

Config Sections

Shared Utilities (utils.rs)

Consumer Contracts

Tracking (TimedExecution)

Tee (tee_and_hint)

Truncation Caps (truncate)

Adding New Functionality

Tracking (`TimedExecution`)

Tee (`tee_and_hint`)

Truncation Caps (`truncate`)