EveryDev.ai
Sign inSubscribe
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Podcasts
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    Home
    Tools

    2,508+ AI tools

    • New
    • Trending
    • Featured
    • Compare
    • Arena
    Categories
    • Agents1671
    • Coding1218
    • Infrastructure542
    • Marketing451
    • Design437
    • Projects400
    • Research375
    • Analytics339
    • Testing233
    • MCP229
    • Data213
    • Security200
    • Integration170
    • Learning155
    • Communication148
    • Prompts145
    • Extensions138
    • Commerce125
    • Voice122
    • DevOps99
    • Web78
    • Finance21
    1. Home
    2. Tools
    3. Sift
    Sift icon

    Sift

    Knowledge Management

    A capture-first personal knowledge base that collects links, text, screenshots, and notes, then uses AI to process them into searchable, reusable knowledge pages.

    Visit Website

    At a Glance

    Pricing
    Free

    Free to run locally or self-host under the source-available license for personal, educational, research, and internal organizational use.

    Engagement

    Available On

    Web
    API
    CLI

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Knowledge ManagementRetrieval-Augmented GenerationMCP Servers

    Alternatives

    Project N.O.M.A.D.Smart ConnectionsGBrain
    Developer
    YuanlwYuanlw builds Sift, a capture-first personal knowledge base…

    Listed May 2026

    About Sift

    Sift is a capture-first personal knowledge base built by Yuanlw, written primarily in TypeScript and hosted on GitHub. It is designed to close the gap between saving information and actually reusing it — letting users dump links, text, screenshots, and notes first, then letting AI analysis, association, and indexing happen in the background. The project is currently a functional personal MVP, not yet a mature public SaaS product.

    What It Is

    Sift sits in the personal knowledge management category, but with a specific philosophy: saving must be fast, and understanding can happen later. Rather than requiring users to assign titles, categories, or tags at capture time, Sift accepts raw material — URLs, copied text, images, quick notes — and processes it asynchronously into structured source records, readable wiki-style knowledge pages, and retrieval-ready vector chunks. The result is a personal knowledge asset that can be searched, queried, and fed into external Agent workflows.

    Core Workflow

    The pipeline Sift implements follows a clear sequence:

    • Collect — Quick capture of links, text, screenshots, and notes via an inbox interface; supports batch URL import, browser bookmark HTML import, and bulk photo/screenshot import.
    • Process — Background extraction, structuring, source record generation, knowledge page generation, semantic chunking, and vector indexing.
    • Organize — Inbox views for today's captures, in-progress, failed, pending notes, ignored, and test data; failed items can be retried, supplemented, or ignored.
    • Retrieve — Full-library Q&A, per-knowledge-page Q&A with history, full-text search, semantic recall, recent review, knowledge discovery, and duplicate detection.
    • Expose — Agent API and MCP endpoint so external tools can read Sift's knowledge context.

    Architecture and Model Configuration

    Sift requires three model types: a text/chat model for extraction, structuring, knowledge page generation, and Q&A; an embedding model for retrieval; and an optional vision model for image OCR. The /settings page offers two modes — using Sift's default models (with quota tracking) or configuring a custom OpenAI-compatible endpoint. Custom API keys are never returned to the frontend; multi-user deployments can encrypt keys server-side. The model layer is designed to support OpenAI, Anthropic, Google Gemini, Qwen, DeepSeek, and local model gateways without changing the product's core boundaries.

    Knowledge Graph and Retrieval

    Phases P10 and P11 introduced an invisible knowledge relationship layer and graph-aware retrieval. Ask and Agent queries can now expand recall using Source-Wiki relationships, related wiki pages, and duplicate source signals. Phase P12 added one-click human-confirmed merging of high-confidence related or duplicate discoveries into existing wiki pages, preserving merge history, source relationships, and chunk reconstruction.

    Current Status

    The README explicitly states that Sift is a usable personal MVP suitable for daily personal use and ongoing product review, but not yet a mature public hosted SaaS. Completed phases (P0–P12) cover the full capture-first foundation, extraction, source/knowledge page generation, search, Q&A, Agent API, MCP, mobile-first capture, external import, review/discovery, model metering, account security, knowledge graph, and merge workflows. Still needed before broader deployment: email verification, password recovery, team/multi-tenant support, production task queues, model provider expansion, regression testing, and a clearer account/deployment system. The license is source-available (not OSI open source): personal, educational, research, and internal organizational use is permitted, but offering it as a public SaaS or resale service without explicit written permission is not allowed.

    Sift - 1

    Community Discussions

    Be the first to start a conversation about Sift

    Share your experience with Sift, ask questions, or help others learn from your insights.

    Pricing

    FREE

    Self-Hosted

    Free to run locally or self-host under the source-available license for personal, educational, research, and internal organizational use.

    • Full capture-first inbox
    • AI extraction and knowledge page generation
    • Full-text and semantic search
    • Q&A with history
    • Agent API and MCP endpoint

    Capabilities

    Key Features

    • Capture-first inbox for links, text, screenshots, and notes
    • Background AI extraction and structuring
    • Source record and knowledge page generation
    • Full-text search and semantic recall
    • Full-library and per-page Q&A with history
    • Batch URL import, browser bookmark HTML import, and bulk screenshot import
    • Recent review, knowledge discovery, and duplicate detection
    • Agent API and MCP endpoint for external tool integration
    • Knowledge relationship graph and graph-aware retrieval
    • One-click human-confirmed merge of related/duplicate content
    • Model configuration via settings UI (default or custom OpenAI-compatible)
    • Model usage metering and quota tracking
    • Inbox views: today, in-progress, failed, pending notes, ignored

    Integrations

    OpenAI-compatible text/chat models
    OpenAI-compatible embedding models
    Vision/OCR models
    Anthropic
    Google Gemini
    Qwen
    DeepSeek
    Local model gateways
    MCP (Model Context Protocol)
    Stripe (for SaaS billing in hosted deployments)
    Docker / Docker Compose
    PostgreSQL
    API Available
    View Docs

    Reviews & Ratings

    No ratings yet

    Be the first to rate Sift and help others make informed decisions.

    Developer

    Yuanlw

    Yuanlw builds Sift, a capture-first personal knowledge base that turns daily information intake into reusable knowledge assets. The project is written in TypeScript and self-hosted via Docker and PostgreSQL. It exposes an Agent API and MCP endpoint, positioning it as a long-term knowledge layer for personal and agent-driven workflows.

    Read more about Yuanlw
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    Project N.O.M.A.D. icon

    Project N.O.M.A.D.

    A self-contained, offline-first knowledge and education server with AI chat, Wikipedia, maps, and data tools — accessible entirely through a browser.

    Smart Connections icon

    Smart Connections

    An Obsidian plugin that uses local embeddings to surface semantically related notes in your vault, helping you rediscover and connect ideas without manual tagging or linking.

    GBrain icon

    GBrain

    An open-source CLI and TypeScript library that gives AI agents a persistent, searchable knowledge base built from markdown files, with hybrid vector+keyword search powered by Postgres and pgvector.

    Browse all tools

    Related Topics

    Knowledge Management

    AI-powered systems for organizing, discovering, and accessing collective team knowledge with intelligent search, tagging, and contextual recommendations across knowledge bases and wikis.

    92 tools

    Retrieval-Augmented Generation

    RAG Systems that enhance LLM outputs by retrieving relevant information from external knowledge bases, combining the power of generative AI with information retrieval for more accurate and contextual responses.

    73 tools

    MCP Servers

    Model Context Protocol servers that extend AI capabilities.

    93 tools
    Browse all topics
    Back to all tools
    Discussions