EveryDev.ai
Subscribe
Home
Tools

2,911+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents1815
  • Coding1295
  • Infrastructure600
  • Marketing467
  • Projects433
  • Research403
  • Analytics351
  • Design338
  • Security243
  • MCP242
  • Testing238
  • Data230
  • Integration178
  • Prompts160
  • Learning159
  • Communication154
  • Extensions150
  • Voice130
  • Commerce125
  • DevOps108
  • Web80
  • Finance21
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. LEANN
    LEANN icon

    LEANN

    Retrieval-Augmented Generation

    A low-storage vector index that enables private, on-device RAG on millions of documents using 97% less storage than traditional vector databases.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source under the MIT License. No cost to use, modify, or distribute.

    Engagement

    Available On

    Windows
    macOS
    Linux
    Web
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Retrieval-Augmented GenerationVector DatabasesLocal Inference

    Alternatives

    PixelRAGPineconeRivestack
    Developer
    StarTrail-orgBerkeley, CAEst. 2025$64M raised

    Listed Jun 2026

    About LEANN

    LEANN is an open-source vector database and RAG framework developed at the Berkeley Sky Computing Lab, designed to run entirely on personal devices without cloud dependencies. It achieves dramatic storage reductions through graph-based selective recomputation, computing embeddings on-demand rather than storing them all, and is published as a research paper on arXiv (arXiv:2506.08276).

    What It Is

    LEANN is a lightweight, privacy-first vector index that lets users build semantic search and retrieval-augmented generation (RAG) systems on their laptops. Instead of storing every embedding like traditional vector databases (e.g., FAISS), LEANN stores a pruned graph structure and recomputes embeddings only for nodes visited during search. The project claims this approach delivers the same search accuracy as heavyweight solutions while using up to 97% less storage—for example, indexing 60 million text chunks in 6 GB instead of 201 GB.

    Core Architecture

    LEANN's storage efficiency rests on two main techniques:

    • Graph-based selective recomputation: Embeddings are computed on-demand only for nodes traversed during graph search, not stored persistently.
    • High-degree preserving pruning: Important "hub" nodes in the graph are retained while redundant connections are removed, keeping the graph compact.
    • Two backends: HNSW (default, maximum storage savings) and DiskANN (better speed-accuracy trade-off using PQ-based graph traversal with real-time reranking).
    • Dynamic batching: Embedding computations are batched for efficient GPU utilization when available.

    The index is stored in a Compressed Sparse Row (CSR) format to further minimize graph storage overhead.

    Data Sources and RAG Applications

    LEANN ships with ready-made application modules for a wide range of personal data sources:

    • Documents: PDF, TXT, MD, DOCX, PPTX, and code files with AST-aware chunking for Python, Java, C#, and TypeScript
    • Email: Apple Mail (macOS)
    • Browser history: Chrome (macOS and Linux)
    • Chat history: WeChat, iMessage, ChatGPT exports, Claude exports
    • Live data via MCP: Slack channels, Twitter bookmarks, and any MCP-compatible platform
    • Multimodal PDFs: ColQwen/ColPali vision-language models for documents with figures and diagrams

    The CLI supports building, searching, interactive chat, file-change detection via Merkle tree snapshots (leann watch), and index management.

    LLM and Embedding Provider Support

    LEANN supports multiple LLM backends for text generation and embedding:

    • Local inference: Ollama, LM Studio, vLLM, llama.cpp, SGLang, LiteLLM
    • Cloud providers: OpenAI, Anthropic, Gemini, Groq, DeepSeek, Mistral, and others via OpenAI-compatible APIs
    • Embedding modes: sentence-transformers, OpenAI, MLX (Apple Silicon), Ollama

    Users can mix providers—for example, using a local Ollama model for generation while using Jina AI for embeddings.

    MCP Integration and Claude Code Support

    LEANN includes a native MCP (Model Context Protocol) server (leann_mcp) that integrates directly with Claude Code, providing semantic search over indexed codebases as a drop-in replacement for Claude Code's built-in keyword search. Setup requires a single claude mcp add command after global installation via uv tool install.

    Update: v0.3.7

    The latest release is v0.3.7, published in March 2026. The repository was created in June 2025 and has seen active development, with the community survey for v0.4 soliciting votes on GPU acceleration and additional integrations. The project tracks zero telemetry and relies on the community survey as its primary feedback mechanism.

    LEANN - 1

    Community Discussions

    Be the first to start a conversation about LEANN

    Share your experience with LEANN, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open-source under the MIT License. No cost to use, modify, or distribute.

    • Full LEANN vector index and RAG framework
    • HNSW and DiskANN backends
    • CLI and Python API
    • All data source integrations (documents, email, browser, chat, MCP)
    • MCP server for Claude Code

    Capabilities

    Key Features

    • 97% storage reduction vs traditional vector databases
    • Graph-based selective recomputation of embeddings
    • High-degree preserving graph pruning
    • HNSW and DiskANN backends
    • RAG on documents (PDF, TXT, MD, DOCX, PPTX)
    • RAG on Apple Mail
    • RAG on Chrome browser history
    • RAG on WeChat, iMessage, ChatGPT, Claude chat history
    • Live data RAG via MCP (Slack, Twitter)
    • Multimodal PDF retrieval with ColQwen/ColPali
    • AST-aware code chunking for Python, Java, C#, TypeScript
    • Native MCP server for Claude Code integration
    • CLI with build, search, ask, watch, list, remove commands
    • Metadata filtering with rich operator support
    • Grep (exact text) search mode
    • File change detection via Merkle tree snapshots
    • Support for Ollama, OpenAI, Anthropic, HuggingFace LLM backends
    • OpenAI-compatible API support for embeddings and generation
    • Zero telemetry
    • Fully local and private operation

    Integrations

    Ollama
    OpenAI
    Anthropic (Claude)
    HuggingFace
    LM Studio
    vLLM
    llama.cpp
    SGLang
    LiteLLM
    Jina AI
    Groq
    DeepSeek
    Mistral AI
    Gemini
    OpenRouter
    LlamaIndex
    LangChain
    FAISS
    DiskANN
    MCP (Model Context Protocol)
    Claude Code
    Slack MCP server
    Twitter MCP server
    Apple Mail
    Google Chrome
    WeChat
    iMessage
    ChatGPT
    ColQwen2
    ColPali
    API Available
    View Docs

    Ratings & Reviews

    No ratings yet

    Be the first to rate LEANN and help others make informed decisions.

    Developer

    StarTrail-org

    StarTrail-org builds open-source AI infrastructure for pixel-native search and retrieval, with research roots at Berkeley SkyLab, BAIR, and the Berkeley NLP Group. The team develops PixelRAG, a visual RAG pipeline that replaces text parsing with screenshot-based embedding, and LEANN, a related project in efficient approximate nearest-neighbor search. Contributors include researchers from UC Berkeley with backgrounds in distributed systems, NLP, and vision-language models. The organization releases full training datasets, model adapters, and pre-built indexes alongside its codebases.

    Founded 2025
    Berkeley, CA
    $64M raised
    10 employees

    Used by

    Open-source AI community
    Berkeley research labs
    Read more about StarTrail-org
    WebsiteGitHubLinkedIn
    2 tools in directory

    Similar Tools

    PixelRAG icon

    PixelRAG

    Pixel-native visual retrieval-augmented generation system that searches documents by screenshot tiles using vision embeddings instead of text parsing.

    Pinecone icon

    Pinecone

    Pinecone is a fully managed, serverless vector database built for powering AI applications like semantic search, RAG, agents, and recommendations at production scale.

    Rivestack icon

    Rivestack

    Managed PostgreSQL with pgvector built in, running on NVMe storage in the EU, offering fixed monthly pricing and no platform overhead for AI vector workloads.

    Browse all tools

    Related Topics

    Retrieval-Augmented Generation

    RAG Systems that enhance LLM outputs by retrieving relevant information from external knowledge bases, combining the power of generative AI with information retrieval for more accurate and contextual responses.

    91 tools

    Vector Databases

    Specialized databases optimized for storing and retrieving vector embeddings that power semantic search, recommendation systems, and other AI applications with similarity matching.

    30 tools

    Local Inference

    Tools and platforms for running AI inference locally without cloud dependence.

    131 tools
    Browse all topics
    Back to all toolsSuggest an edit
    ratings
    discussions