Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Tools

    1,959+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1079
    • Coding989
    • Infrastructure422
    • Marketing403
    • Design350
    • Projects317
    • Analytics306
    • Research293
    • Testing188
    • Data165
    • Integration163
    • Security161
    • MCP148
    • Learning138
    • Communication121
    • Extensions115
    • Commerce112
    • Prompts109
    • Voice103
    • DevOps88
    • Web71
    • Finance18
    1. Home
    2. Tools
    3. QMD
    QMD icon

    QMD

    Agent Memory

    QMD is a local search engine that indexes Markdown files and combines BM25 keyword search, vector semantic search, and LLM re-ranking for AI agent memory and retrieval.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source CLI tool available on GitHub under the MIT license.

    Engagement

    Available On

    macOS
    Linux
    SDK

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Agent MemoryLocal InferenceMCP Tools

    Alternatives

    TIMPsContextPoolGBrain
    Developer
    Tobi LütkeOttawa, CanadaEst. 2004$122M raised

    Listed Feb 2026

    About QMD

    QMD (Query Markup Documents) is an open-source, on-device search engine that indexes Markdown files and provides hybrid retrieval combining BM25 full-text search, vector semantic search, and LLM re-ranking. Built by Tobi Lütke, it runs entirely locally using GGUF models via node-llama-cpp with no API keys or cloud dependencies required. QMD is widely adopted as a memory backend for AI coding agents such as Claude Code and OpenClaw, replacing basic keyword search with intelligent, context-aware retrieval.

    • Hybrid search pipeline - Combines BM25 keyword matching via SQLite FTS5 with vector semantic search and LLM-based re-ranking for high-quality results across different query types.
    • Query expansion - Uses a fine-tuned 1.7B parameter model to generate alternative phrasings of your search query, broadening recall without sacrificing precision.
    • LLM re-ranking - A local Qwen3 reranker model re-scores the top candidates using yes/no classification with log-probability confidence, improving result ordering.
    • Collection and context management - Organize documents into named collections with glob patterns and attach hierarchical context descriptions that are returned alongside search results, giving LLMs richer information for decision-making.
    • MCP server integration - Exposes search, retrieval, and status tools via Model Context Protocol over stdio or HTTP transport, enabling direct integration with Claude Desktop, Claude Code, and other MCP-compatible agents.
    • Smart document chunking - Splits documents into approximately 900-token chunks with 15 percent overlap using a scoring algorithm that finds natural Markdown break points rather than cutting at arbitrary token boundaries.
    • Multiple output formats - Supports JSON, CSV, Markdown, XML, and file-list output modes designed for agentic workflows where structured data is needed.
    • Document retrieval by ID - Each indexed document receives a six-character hash identifier, enabling fast retrieval by docid, file path with optional line offset, or glob pattern via multi-get.
    • Fully local and private - All three GGUF models (embedding, reranker, query expansion) totaling approximately 2 GB run on-device. No data leaves the machine.

    To get started, install with npm install -g @tobilu/qmd or bun install -g @tobilu/qmd, add collections pointing to your Markdown directories, run qmd embed to generate vector embeddings, and search with qmd search, qmd vsearch, or qmd query for the full hybrid pipeline.

    QMD - 1

    Community Discussions

    Be the first to start a conversation about QMD

    Share your experience with QMD, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open-source CLI tool available on GitHub under the MIT license.

    • BM25 full-text search via SQLite FTS5
    • Vector semantic search with local embeddings
    • LLM re-ranking with Qwen3 reranker
    • Query expansion with fine-tuned model
    • MCP server with stdio and HTTP transport

    Capabilities

    Key Features

    • Hybrid search combining BM25, vector, and LLM re-ranking
    • Local vector embeddings via embeddinggemma-300M GGUF model
    • LLM re-ranking with qwen3-reranker-0.6b
    • Fine-tuned query expansion model for broader recall
    • Reciprocal Rank Fusion with position-aware blending
    • MCP server for Claude Desktop and Claude Code integration
    • HTTP transport mode with daemon support for shared server
    • Collection-based document organization with glob patterns
    • Hierarchical context annotations for search results
    • Smart ~900-token chunking with natural Markdown break points
    • Document retrieval by path, docid hash, or glob pattern
    • Multi-get for batch document retrieval
    • JSON, CSV, XML, Markdown, and file-list output formats
    • Runs fully on-device with no API keys or cloud services
    • Auto-downloads GGUF models from HuggingFace on first use

    Integrations

    Claude Desktop
    Claude Code
    OpenClaw
    MCP (Model Context Protocol)
    node-llama-cpp
    SQLite FTS5
    HuggingFace GGUF models
    Obsidian
    Git

    Reviews & Ratings

    No ratings yet

    Be the first to rate QMD and help others make informed decisions.

    Developer

    Tobi Lütke

    Tobi Lütke is the founder and CEO of Shopify. He builds open-source developer tools including QMD, a local hybrid search engine for Markdown files designed for AI agent workflows. His projects focus on local-first, privacy-respecting tooling that runs entirely on-device.

    Founded 2004
    Ottawa, Canada
    $122M raised
    8,100 employees

    Used by

    Allbirds
    Gymshark
    Heinz
    Mattel
    +2 more
    Read more about Tobi Lütke
    WebsiteGitHubX / Twitter
    1 tool in directory

    Similar Tools

    TIMPs icon

    TIMPs

    Open source AI memory agent that stores facts, preferences, goals, and reflections with persistent memory across sessions using PostgreSQL and Qdrant.

    ContextPool icon

    ContextPool

    ContextPool gives AI coding agents persistent memory across sessions by extracting and storing engineering insights from past sessions via MCP.

    GBrain icon

    GBrain

    An open-source CLI and TypeScript library that gives AI agents a persistent, searchable knowledge base built from markdown files, with hybrid vector+keyword search powered by Postgres and pgvector.

    Browse all tools

    Related Topics

    Agent Memory

    Memory layers, frameworks, and services that enable AI agents to store, recall, and manage information across sessions. These tools provide persistent, semantic, and contextual memory for agents, supporting personalization, long-term context retention, graph-based relationships, and hybrid RAG + memory workflows.

    36 tools

    Local Inference

    Tools and platforms for running AI inference locally without cloud dependence.

    68 tools

    MCP Tools

    Tools built with the Model Context Protocol for specific tasks.

    33 tools
    Browse all topics
    Back to all tools
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026
    57views
    Discussions