MCP server · Python · PyPI
Cortex
Memory that remembers what matters.
Cortex gives Claude Code persistent memory through a 3-layer retrieval system that returns only what's relevant, using ~66% fewer tokens on average. It automatically extracts structured facts from your memories and code into a queryable knowledge graph, with entity normalization and multi-hop traversal. Everything stays local — SQLite + Python, zero API calls, zero cost. It's the memory the rest of the family plugs into, and the project that named the lab.
$ pip install cortex-claude && cortex-claude setup What it does
Progressive recall
3-layer retrieval (facts → summaries → full content) stops at the cheapest layer that answers — facts cost 5–15 tokens.
Knowledge graph
Auto-extracted structured facts with entity normalization, deduplication, and multi-hop traversal across connections.
Code graph
Tree-sitter indexes symbols across 7 languages — ~100 tokens to look one up vs 1000+ to read the file.
Smart auto-capture
Hooks capture results from your tools and a background daemon saves them in about 0.3 seconds.
Self-organizing clusters
Memories group into semantic sub-graphs with human-readable labels — browse topics top-down before drilling in.
Decay & temporal awareness
Unused memories fade, contradicted facts lose confidence, and timestamps keep recall fresh instead of stale.
Specs
- Version
- 0.6.0
- Type
- MCP server + CLI for Claude Code
- Platform
- Python 3.11+ · macOS, Linux, Windows
- Dashboard
- Web UI at localhost:37800
- Storage
- Local SQLite + sqlite-vec
- Cost
- Zero API calls, zero cost