CARI Technical Specification

Status

Implemented — Phases 1–5 complete, 92 tests passing.

Zero external cost — no LLM calls, no paid APIs, no database servers
Single-file output — one SQLite database, portable and inspectable
Three-signal fusion — AST structure + document semantics + git temporal data
Gap detection — disagreements between signals are the most valuable findings
CI-ready — exit codes, machine-readable formats, fast execution

Tree-sitter parses source files to produce a symbol registry:

Supported languages: TypeScript, JavaScript, Swift.

Markdown documents are scanned for entity mentions:

Git log analysis produces:

The annotator matches document mentions to code symbols:

Terms appearing in > 85% of documents are penalized:

Measured on the IntentWeave monorepo (264 code files, 7 docs, 5316 symbols):

Metric	Structured	Full	Delta
Build time	1.1 s	2.8 s	+1.7 s
Annotations	6,721	11,533	+72%
Grounded (code-linked)	2,548 (38%)	7,360 (64%)	+189%
Co-occurrence edges	1,099	2,631	+139%
IDF terms tracked	—	2,843	—
Index file size	~2 MB	~4 MB	+100%

All phases are complete:

Phase	Scope	Tests
1. Foundation	Writer, schema, retrieval queries	22
2. Connections	Co-occurrence, co-change, gap detection	20
3. CI Drift	Check command, severity levels, formats	16
4. Incremental	Content-hash updates, corpus report	12
5. Annotation Depth	Dictionary matching, IDF filtering, stopword baseline	22
Total		92

File	Purpose
`packages/index/src/writer.ts`	SQLite index builder
`packages/index/src/annotator.ts`	Mention→symbol matching + IDF
`packages/index/src/idf.ts`	IDF scorer + stopword baseline
`packages/index/src/schema.ts`	SQLite table definitions
`packages/index/src/queries/retrieve.ts`	Ranked retrieval
`packages/index/src/queries/connections.ts`	Connection discovery
`packages/index/src/queries/check.ts`	CI drift detection
`packages/index/src/queries/report.ts`	Health dashboard
`packages/index/src/incremental.ts`	Content-hash updates
`packages/analyzer/src/kwg/heuristicExtractor.ts`	Keyword extraction
`packages/analyzer/src/kwg/kwxStage.ts`	KWX stage options
`packages/cli/src/commands/indexBuild.ts`	Build orchestrator