EveryDev.ai
Sign inSubscribe
Home
Tools

2,790+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents1916
  • Coding1362
  • Infrastructure646
  • Marketing508
  • Projects459
  • Research417
  • Design399
  • Analytics362
  • MCP249
  • Security249
  • Testing243
  • Data235
  • Integration181
  • Prompts171
  • Learning164
  • Communication163
  • Extensions158
  • Voice140
  • Commerce128
  • DevOps113
  • Web83
  • Finance24
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. ScreenMind
    ScreenMind icon

    ScreenMind

    Agent Memory

    AI-powered screen memory tool that captures your screen, analyzes it with Gemma 4, and builds a searchable, conversational local memory — 100% private with zero cloud dependencies.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully free and open-source under the MIT License. Self-host on your own machine.

    Engagement

    Available On

    Windows
    macOS
    Linux
    Web
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Agent MemoryLocal InferenceMCP Servers

    Alternatives

    MemPalaceMemvidKeel
    Developer
    Ayush ShekharAyush Shekhar builds open-source AI tooling focused on local…

    Listed Jun 2026

    About ScreenMind

    ScreenMind is an open-source, privacy-first screen memory tool built by Ayush Shekhar and released under the MIT License. It continuously captures your screen, runs multimodal AI analysis locally using Gemma 4 E2B via llama.cpp, and stores everything in a searchable SQLite database — no cloud, no telemetry, no data leaving your machine. The project positions itself explicitly as a privacy-respecting alternative to Microsoft Recall, which the README notes "stores data in plaintext, sends telemetry, and was met with massive privacy backlash."

    What It Is

    ScreenMind is a local AI memory system for your desktop. It sits in the background, detects meaningful screen changes (rather than firing on a fixed timer), and sends each screenshot through a multi-model pipeline: EasyOCR extracts raw text, Gemma 4 E2B analyzes the image and text together to produce structured JSON (app name, activity category, mood, scene description, spatial layout), MiniLM-L6-v2 generates semantic embeddings, and FTS5 indexes everything for keyword search. The result is a personal knowledge base you can query conversationally — asking things like "what did Ishaa say on Discord?" and getting the actual message back.

    Architecture and AI Pipeline

    The system runs four AI models in concert:

    • EasyOCR — extracts raw screen text fed as context to Gemma
    • Gemma 4 E2B (via llama.cpp) — vision + audio + reasoning in one model; handles screenshot analysis, voice memo transcription, meeting transcription, daily summaries, and chat answers
    • MiniLM-L6-v2 — 80MB CPU-resident model generating 384-dim semantic vectors
    • SQLite WAL + FTS5 — zero-config database with concurrent reads and full-text search

    Three analysis modes trade speed for depth: Accurate (~76s, deep thinking + layout detection), Balanced (~40s, thinking enabled), and Fast (~12s, no thinking, layout via OCR clustering). A per-app perceptual hash cache with app-aware staleness (communication apps refresh faster than IDEs) significantly reduces inference calls.

    Privacy and Security Model

    All computation happens on-device after the one-time model download (~5GB GGUF). Key privacy controls include:

    • Sensitive data filter — auto-redacts credit cards, SSNs, API keys, and passwords before storage
    • AES encryption at rest — Fernet encryption for screenshots with OS keyring integration
    • Dashboard PIN lock — session-based auth with configurable auto-lock timeout
    • Incognito mode — one-click pause; nothing recorded
    • App blocklist — silently skips capture for specified applications

    Agent Platform and MCP Integration

    ScreenMind ships a full agent/plugin system. Markdown agents (.md files with YAML frontmatter) let anyone write English prompts that Gemma executes against screen data on a schedule. Python plugins (.py) get full SDK access with state persistence and direct LLM calls. Four built-in agents cover daily journaling, focus reporting, meeting action extraction, and code changelog summarization.

    The project also exposes an MCP server (mcp_server.py) over stdio transport, making screen history available to Claude Desktop, Cursor, and VS Code. MCP tools include search_screen, get_recent_activity, get_daily_summary, capture_now, and search_audio for meeting transcripts.

    Setup Path and System Requirements

    ScreenMind requires Python 3.10+, approximately 5GB of disk space for the Gemma 4 E2B GGUF model, and a GPU with 4GB+ VRAM (recommended). It runs on Windows, macOS, and Linux via OS-specific platform adapters. Setup is: clone the repo, create a virtual environment, install requirements, and run main.py. On first launch it auto-downloads the model, starts llama-server, and opens a web dashboard at http://127.0.0.1:7777. Configuration is available via .env, environment variables, or the Settings tab in the dashboard.

    Current Status

    The repository was created in May 2026 and last pushed in June 2026, with 80 stars and 3 forks at time of indexing. The project is actively maintained with CI via GitHub Actions and Codecov coverage tracking. The README notes macOS/Linux platform adapters exist but flags real-hardware testing as a high-impact contribution area, suggesting Windows is the most tested platform currently.

    ScreenMind - 1

    Community Discussions

    Be the first to start a conversation about ScreenMind

    Share your experience with ScreenMind, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully free and open-source under the MIT License. Self-host on your own machine.

    • Full screen capture and AI analysis
    • Gemma 4 E2B multimodal inference via llama.cpp
    • Hybrid semantic + keyword search
    • Conversational chat with screen memory
    • Voice memo and meeting transcription

    Capabilities

    Key Features

    • Smart screen capture with content-change detection
    • Gemma 4 E2B multimodal analysis (vision + audio + reasoning)
    • Hybrid semantic + keyword search (MiniLM + FTS5)
    • Conversational RAG chat with screen memory
    • Voice memo recording and transcription
    • Meeting auto-detection and transcription (Zoom/Teams/Meet)
    • Analytics dashboard with category breakdown and hourly heatmap
    • Day Rewind timelapse playback
    • Three analysis modes: Accurate, Balanced, Fast
    • Per-app perceptual hash cache
    • 100% local inference via llama.cpp
    • Sensitive data auto-redaction (credit cards, SSNs, API keys)
    • AES encryption at rest with OS keyring
    • Dashboard PIN lock with auto-lock timeout
    • Incognito mode (one-click pause)
    • Agent platform: Markdown AI agents and Python plugins
    • MCP server for Claude Desktop, Cursor, VS Code
    • Obsidian and Notion integration
    • Webhook support (HMAC signed, auto-retry)
    • System-wide hotkeys (bookmark, pause, voice memo)
    • FastAPI REST server with Swagger docs
    • Cross-platform support: Windows, macOS, Linux

    Integrations

    Claude Desktop (MCP)
    Cursor (MCP)
    VS Code (MCP)
    Obsidian
    Notion
    Slack (webhooks)
    Discord (webhooks)
    IFTTT (webhooks)
    Zoom (meeting detection)
    Microsoft Teams (meeting detection)
    Google Meet (meeting detection)
    HuggingFace (model download)
    llama.cpp / llama-server
    EasyOCR
    MiniLM-L6-v2
    API Available
    View Docs

    Ratings & Reviews

    No ratings yet

    Be the first to rate ScreenMind and help others make informed decisions.

    Developer

    Ayush Shekhar

    Ayush Shekhar builds open-source AI tooling focused on local, privacy-first computing. ScreenMind, the flagship project, delivers a fully on-device screen memory system powered by Gemma 4 and llama.cpp — no cloud, no telemetry. The project demonstrates deep integration of multimodal AI (vision, audio, reasoning) with practical desktop productivity workflows.

    Read more about Ayush Shekhar
    WebsiteGitHub
    1 tool in directory

    Similar Tools

    MemPalace icon

    MemPalace

    A local, open-source AI memory system that stores all your conversations verbatim in ChromaDB and makes them findable via semantic search, achieving 96.6% LongMemEval R@5 with zero API calls.

    Memvid icon

    Memvid

    A portable, single-file memory layer for AI agents with instant retrieval, long-term memory, and no database required.

    Keel icon

    Keel

    A local-first AI desktop assistant whose memory lives in plain markdown files on your disk, with support for Claude, GPT, OpenRouter, and Ollama.

    Browse all tools

    Related Topics

    Agent Memory

    Memory layers, frameworks, and services that enable AI agents to store, recall, and manage information across sessions. These tools provide persistent, semantic, and contextual memory for agents, supporting personalization, long-term context retention, graph-based relationships, and hybrid RAG + memory workflows.

    79 tools

    Local Inference

    Tools and platforms for running AI inference locally without cloud dependence.

    129 tools

    MCP Servers

    Model Context Protocol servers that extend AI capabilities.

    106 tools
    Browse all topics
    Back to all toolsSuggest an edit
    ratings
    discussion