EveryDev.ai
Sign inSubscribe
Home
Tools

2,685+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents1815
  • Coding1295
  • Infrastructure600
  • Marketing467
  • Projects433
  • Research403
  • Analytics351
  • Design338
  • Security243
  • MCP242
  • Testing238
  • Data230
  • Integration178
  • Prompts160
  • Learning159
  • Communication154
  • Extensions150
  • Voice130
  • Commerce125
  • DevOps108
  • Web80
  • Finance21
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. mnemo
    mnemo icon

    mnemo

    Agent Memory

    Local-first AI memory layer for any LLM that builds a persistent knowledge graph with entity extraction and semantic retrieval — no cloud required.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open source under the MIT license. Self-host the binary, Docker image, or install via Cargo.

    Engagement

    Available On

    API
    CLI
    SDK

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Agent MemoryLLM OrchestrationRetrieval-Augmented Generation

    Alternatives

    ByteRoverManexHyperspell
    Developer
    zaydmulani09zaydmulani09 builds mnemo, a local-first AI memory layer for…

    Listed Jun 2026

    About mnemo

    mnemo is an open-source, local-first memory sidecar for LLM applications, built in Rust by zaydmulani09 and released under the MIT license. It ships as a single static binary with zero cloud dependency, targeting developers who build custom LLM pipelines and need persistent, structured memory they fully control. The project was created in June 2025 and has accumulated 193 GitHub stars as of early June 2026.

    What It Is

    mnemo is a sidecar service that watches every conversation or document you feed it, extracts named entities and relationships using an LLM, builds a persistent knowledge graph in SQLite, and injects relevant context back into future prompts automatically — the README claims retrieval in under 50ms. It works with Ollama (fully local), OpenAI, Anthropic, or any OpenAI-compatible API. The core differentiator over simpler memory tools is a graph layer powered by petgraph: entities are deduplicated across sessions, relationships are weighted, and multi-hop graph traversal expands retrieval results at query time.

    How the Retrieval Pipeline Works

    When you POST text to /ingest, mnemo sends it to your configured LLM to extract entities (people, tools, places, concepts) and the relationships between them. Entities are deduplicated by name and type, aliases are merged, and everything is written to SQLite while the in-memory petgraph is updated atomically. On /retrieve, mnemo runs a 6-stage pipeline:

    • Full-text chunk search
    • Entity name search
    • Graph expansion via BFS over the knowledge graph
    • Relation filter
    • Score and rank
    • Assemble a context_prompt string for injection into your LLM's system prompt

    Graph-expanded results score at 0.5× so direct matches always rank higher than inferred ones.

    Architecture and Stack

    Four Rust crates are wired together: mnemo-core (entity extraction, graph ops, retrieval engine, DB layer), mnemo-api (Axum REST API), mnemo-cli (CLI tool using blocking reqwest), and mnemo-bench (12 performance benchmark suites). Storage is SQLite in WAL mode with an in-memory petgraph overlay. The README benchmarks on Apple M2 show full retrieval pipeline averaging ~4.2ms and entity inserts at ~0.12ms (~8,300 ops/s) in debug builds; release builds are described as 3–5× faster.

    Setup Paths

    Three quickstart paths are supported:

    • Docker + Ollama — docker compose up -d plus ollama pull llama3; fully free and self-contained
    • Binary — cargo install --path crates/mnemo-api with environment variables pointing at Ollama or OpenAI
    • Python SDK — pip install mnemo-sdk for sync and async clients

    Configuration is via environment variables or a TOML config file. The REST API exposes endpoints for ingest, retrieve, entity CRUD, chunk management, full-text search, graph neighbor traversal, stats, and a wipe operation.

    Who It Is For

    The README explicitly states mnemo is not for everyone: "If you're using a managed agent harness that handles memory for you, you don't need it." It targets developers building custom LLM pipelines who want persistent, structured, local memory without vendor lock-in. The comparison table in the README contrasts mnemo's single Rust binary and zero cloud dependency against Python-daemon alternatives that require cloud storage or lack a graph layer.

    mnemo - 1

    Community Discussions

    Be the first to start a conversation about mnemo

    Share your experience with mnemo, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open source under the MIT license. Self-host the binary, Docker image, or install via Cargo.

    • Single static Rust binary
    • SQLite-backed persistent knowledge graph
    • REST API with all endpoints
    • CLI tool
    • Python SDK (sync and async)

    Capabilities

    Key Features

    • Local-first persistent knowledge graph using SQLite and petgraph
    • Entity extraction via any OpenAI-compatible LLM (Ollama, OpenAI, Anthropic)
    • 6-stage scored retrieval pipeline with graph expansion (BFS)
    • Entity deduplication and alias merging across sessions
    • REST API with ingest, retrieve, entity CRUD, chunk management, and search endpoints
    • Single static Rust binary with zero cloud dependency
    • Python SDK with sync and async clients
    • CLI tool for memory management
    • Docker Compose quickstart with Ollama integration
    • TOML and environment variable configuration
    • Full-text search over entities and chunks
    • Multi-hop graph traversal up to depth 5
    • Session-scoped memory grouping
    • 12 performance benchmark suites

    Integrations

    Ollama
    OpenAI
    Anthropic
    Any OpenAI-compatible API
    SQLite
    Docker
    Python
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate mnemo and help others make informed decisions.

    Developer

    zaydmulani09

    zaydmulani09 builds mnemo, a local-first AI memory layer for LLM applications written in Rust. The project ships as a single static binary with a REST API, CLI, and Python SDK, targeting developers who need persistent, structured memory without cloud dependencies. mnemo is open source under the MIT license and available on GitHub.

    Read more about zaydmulani09
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    ByteRover icon

    ByteRover

    ByteRover is a local-first AI memory system that gives agents and developers a portable, hierarchically structured knowledge store with 92.2% retrieval accuracy.

    Manex icon

    Manex

    Manex turns private documents, grounded answers, corrections, and team decisions into reusable memory for teams working with sensitive documents.

    Hyperspell icon

    Hyperspell

    Memory and context layer for AI agents that connects to user data sources for automatic memory and context-aware responses.

    Browse all tools

    Related Topics

    Agent Memory

    Memory layers, frameworks, and services that enable AI agents to store, recall, and manage information across sessions. These tools provide persistent, semantic, and contextual memory for agents, supporting personalization, long-term context retention, graph-based relationships, and hybrid RAG + memory workflows.

    75 tools

    LLM Orchestration

    Platforms and frameworks for designing, managing, and deploying complex LLM workflows with visual interfaces, allowing for the coordination of multiple AI models and services.

    145 tools

    Retrieval-Augmented Generation

    RAG Systems that enhance LLM outputs by retrieving relevant information from external knowledge bases, combining the power of generative AI with information retrieval for more accurate and contextual responses.

    81 tools
    Browse all topics
    Back to all tools
    Discussions