Token Savior Recall
An MCP server that reduces Claude's active token usage by 77% and wall time by 76% through structural code navigation and persistent memory, achieving 100% on a real coding benchmark.
At a Glance
Fully free and open-source under the MIT License. Install via pip or uvx with no cost.
Engagement
Available On
Alternatives
Listed May 2026
About Token Savior Recall
Token Savior Recall is an open-source MCP server that dramatically improves Claude's coding performance by replacing whole-file reads with symbol-level navigation and adding a persistent memory engine. It indexes codebases by symbol — functions, classes, imports, and call graphs — so the model navigates by pointer instead of reading entire files, reducing injected characters by up to 97%. A SQLite-backed memory engine stores decisions, bugfixes, conventions, and session rollups with Bayesian validity scoring, contradiction detection, and per-type TTL decay, re-injecting a compact delta at the start of each new session.
- Structural code navigation — Index your codebase by symbol and navigate via 90 tools covering core navigation, dependency graphs, git diffs, safe editing, checkpoints, and more; install via
pip install "token-savior-recall[mcp]"oruvx token-savior-recall. - Persistent memory engine — 21 memory tools backed by SQLite WAL + FTS5 + optional vector embeddings (
all-MiniLM-L6-v2, 384d), with hybrid BM25 + vector search fused via RRF for cross-session knowledge retention. - Progressive disclosure contract — Three-layer memory search (
memory_index→memory_search→memory_get) keeps lookup cost bounded at ~15, ~60, and ~200 tokens per result respectively. - Bayesian validity & contradiction detection — Each observation carries a validity prior with update rules; contradictions are flagged at save time and stale observations surface explicitly rather than being silently trusted.
- Configurable tool profiles — Six profiles (
full,core,nav,lean,ultra,tiny) filter the advertisedtools/listpayload to match manifest budget constraints, from ~1,070 tokens (tiny) to ~8,770 tokens (full). - Defer-loading via
ts_search— Thetinyprofile exposes 6 tools and makes ~60 others reachable just-in-time via semantic tool search using Nomic 768d embeddings, mirroring the Tool Attention pattern for −95% prefix tokens. - Benchmark-validated performance — Achieves 192/192 (100%) on the tsbench benchmark with Claude Opus 4.7, versus 141/180 (78.3%) for plain Claude, with zero losses across 96 tasks and 12 coding categories.
- Broad MCP client compatibility — Works with Claude Code, Cursor, Codex CLI, Cline, Continue, Windsurf, Aider, Gemini CLI, Copilot CLI, Zed, and any custom MCP client.
- Optional integrations — Telegram bot feed for critical observations, opt-in web viewer via htmx + SSE, and LLM auto-extraction (
TS_AUTO_EXTRACT=1) for passive observation capture from tool use.
Community Discussions
Be the first to start a conversation about Token Savior Recall
Share your experience with Token Savior Recall, ask questions, or help others learn from your insights.
Pricing
Open Source (MIT)
Fully free and open-source under the MIT License. Install via pip or uvx with no cost.
- 90 tools across 13 categories
- Structural code navigation
- Persistent memory engine
- Hybrid BM25 + vector search
- 6 configurable tool profiles
Capabilities
Key Features
- Structural code navigation by symbol (functions, classes, imports, call graph)
- Persistent memory engine with SQLite WAL + FTS5 + vector embeddings
- Hybrid BM25 + vector search fused via RRF
- Bayesian validity scoring for observations
- Contradiction detection at save time
- Per-type TTL decay and LRU scoring
- Symbol staleness detection via content-hash linking
- ROI tracking and auto-promotion of observations
- MDL distillation of redundant observations into conventions
- Progressive disclosure 3-layer memory search contract
- 90 tools across 13 categories
- 6 configurable tool profiles (full, core, nav, lean, ultra, tiny)
- Defer-loading via ts_search with Nomic 768d embeddings
- 8 Claude Code lifecycle hooks
- Optional web viewer (htmx + SSE)
- Optional Telegram bot feed for critical observations
- LLM auto-extraction of observations (opt-in)
- 100% score on tsbench benchmark (192/192 tasks)
- −77% active tokens vs plain Claude
- −76% wall time vs plain Claude
