Semantic search over your repository. Finds exact functions, classes, and dependencies needed for any task — then packs them into AI-ready XML.
From raw repository to AI-ready context in 4 stages
Respects .gitignore. Lazy repomix XML. Collects only changed files on re-index.
TypeScript AST extractor. Functions, classes, interfaces, types. Smart chunking for large functions.
all-MiniLM-L6-v2 (384d) via transformers.js. SHA256 cache. Cross-encoder reranking with ms-marco.
Repomix-compatible XML. Import graph expansion. [RELATED] markers. Truncation where code is irrelevant.
Built for teams that ship fast with AI assistance
Only re-indexes changed files via git diff. Full rebuild takes seconds, not minutes. SHA256 embedding cache persists across runs.
Not file-level — function-level. Finds the exact `authenticate()` method, not the entire 2000-line auth module. Chunks large functions by control flow.
Resolves imports and call references automatically. Pulls in type definitions, utilities, and middleware your task depends on — even if they don't match the query semantically.
Everything runs on your machine. No API keys, no cloud uploads, no data leakage. transformers.js + LanceDB are fully local.
Output follows the standard repomix XML format. Kimi, Claude, GPT — all parse it natively. No custom prompts needed.
Index and search within specific directories. `--scope src/auth` keeps focus tight when your codebase grows.
Includes Node.js runtime, transformer models, and LanceDB. Download and run.
Linux, macOS, Windows. Same CLI, same behavior everywhere.
Not walls of code. Only the exact blocks needed for your Jira task, with context preserved.
Stop copy-pasting entire files into AI chat. Give your agent exactly what it needs.