EveryDev.ai
Subscribe
Home
Tools

2,911+ AI tools

  • New
  • Trending
  • Featured
  • Compare
  • Arena
Categories
  • Agents1815
  • Coding1295
  • Infrastructure600
  • Marketing467
  • Projects433
  • Research403
  • Analytics351
  • Design338
  • Security243
  • MCP242
  • Testing238
  • Data230
  • Integration178
  • Prompts160
  • Learning159
  • Communication154
  • Extensions150
  • Voice130
  • Commerce125
  • DevOps108
  • Web80
  • Finance21
AI Tools by Topic
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
    1. Home
    2. Tools
    3. PixelRAG
    PixelRAG icon

    PixelRAG

    Retrieval-Augmented Generation
    Featured

    Pixel-native visual retrieval-augmented generation system that searches documents by screenshot tiles using vision embeddings instead of text parsing.

    Visit Website

    At a Glance

    Pricing
    Open Source

    Fully open-source under Apache-2.0. Includes hosted public API endpoint (no API key required), full pipeline source code, pre-built FAISS indexes, and LoRA adapter weights on Hugging Face.

    Engagement

    Available On

    Windows
    macOS
    Linux
    Web
    API

    Resources

    WebsiteDocsGitHubllms.txt

    Topics

    Retrieval-Augmented GenerationVector DatabasesAI Development Libraries

    Alternatives

    LEANNturbovecFaiss
    Developer
    StarTrail-orgBerkeley, CAEst. 2025$64M raised

    Listed Jun 2026

    About PixelRAG

    PixelRAG is an open-source RAG system from Berkeley SkyLab, BAIR, and the Berkeley NLP Group that renders documents to screenshot tiles and retrieves over the images directly using a vision-language embedding model. Rather than parsing HTML or extracting text, it embeds page screenshots into a vector space where tables, charts, layout, and infographics remain intact and searchable. The project ships with a live hosted API indexing 8.28M Wikipedia articles across 28.1M screenshot tiles, and the full pipeline is available for self-hosting under the Apache-2.0 license.

    What It Is

    PixelRAG is a pixel-native retrieval pipeline that replaces text chunking with screenshot-based embedding. The core insight, documented in the accompanying research paper "PIXELRAG: Web Screenshots Beat Text for Retrieval-Augmented Generation," is that text extraction discards layout, tables, figures, and styling — signals that make a page legible and answerable. By rendering pages to image tiles and embedding them with Qwen3-VL-Embedding (LoRA fine-tuned on screenshot data), PixelRAG retrieves visually structured content that text-based RAG cannot reach. The pipeline covers four stages: render (Playwright CDP or PDF), embed (Qwen3-VL-Embedding), index (FAISS IVF), and serve (FastAPI).

    Architecture and Pipeline

    The system is modular and installable in stages via pip extras:

    • pixelshot — renders web pages or PDFs to image tiles using headless Chromium via Playwright CDP
    • pixelrag chunk / embed / build-index — converts tiles to vectors and builds a FAISS IVF index
    • pixelrag serve — exposes a FastAPI search endpoint accepting text, image, or hybrid (text + image) queries
    • pixelrag index — orchestrates the full source-to-index pipeline from a YAML config

    The hosted Wikipedia index spans 214 GB of FAISS data with 2048-dimensional embeddings. Pre-built indexes and the LoRA-fine-tuned adapter weights are published on Hugging Face, and the full training dataset (Chrisyichuan/screenshot-training-natural-filtered-v2) is also released for adapting other backbones.

    Agent Integration and Claude Plugin

    The repository includes a Claude Code plugin called pixelbrowse that gives Claude the ability to screenshot any URL and read the resulting image rather than fetching raw HTML. This lets Claude see charts, diagrams, tables, and layout as a person would. The plugin calls the pixelshot CLI locally — no MCP server or backend required. The search API is also a plain HTTP endpoint compatible with any agent framework that supports tool use, including Claude tool-use, OpenAI function calling, and LangChain.

    Self-Hosting and Deployment

    PixelRAG runs on Linux (CUDA) and macOS (Apple Silicon / MPS), with CPU fallback. Building a custom index from local documents or PDFs requires only a pixelrag.yaml config file pointing at a source directory. The training pipeline lives in a separate uv project inside train/ with pinned dependencies (torch==2.9.1+cu129, transformers==4.57.1, cuDNN 9.20). The hosted public endpoint at https://api.pixelrag.ai requires no API key and accepts text or base64-encoded image queries.

    Update: v0.3.0

    The latest release is v0.3.0, published on 2026-06-23, with the repository last pushed the same day. The GitHub repository was created in May 2026 and has accumulated over 3,500 stars and 320 forks since launch, reflecting rapid early adoption in the research and developer community. The project is actively maintained under the StarTrail-org GitHub organization, which also maintains the LEANN project.

    PixelRAG - 1

    Community Discussions

    Be the first to start a conversation about PixelRAG

    Share your experience with PixelRAG, ask questions, or help others learn from your insights.

    Pricing

    OPEN SOURCE

    Open Source

    Fully open-source under Apache-2.0. Includes hosted public API endpoint (no API key required), full pipeline source code, pre-built FAISS indexes, and LoRA adapter weights on Hugging Face.

    • Apache-2.0 license
    • Hosted public search API (no API key)
    • 8.28M Wikipedia article index
    • Full pipeline source code
    • Pre-built FAISS indexes on Hugging Face

    Capabilities

    Key Features

    • Pixel-native visual retrieval over screenshot tiles
    • Text, image, and hybrid (text + image) query support
    • Pre-built hosted index of 8.28M Wikipedia articles (28.1M tiles)
    • Qwen3-VL-Embedding with LoRA fine-tuning for screenshot retrieval
    • FAISS IVF index with 2048-dimensional embeddings
    • FastAPI search server (CPU and GPU)
    • pixelshot CLI for rendering web pages and PDFs to image tiles
    • Claude Code plugin (pixelbrowse) for agent visual browsing
    • Self-hostable pipeline with YAML config
    • Modular pip extras (render, embed, index, serve)
    • Pre-built FAISS indexes and LoRA adapters on Hugging Face
    • Public training dataset released for custom backbone fine-tuning
    • No API key required for hosted endpoint

    Integrations

    Claude Code (pixelbrowse plugin)
    OpenAI function calling
    LangChain
    Playwright (Chromium CDP)
    FAISS
    Qwen3-VL-Embedding
    Hugging Face
    FastAPI
    Google Colab
    API Available
    View Docs

    Ratings & Reviews

    No ratings yet

    Be the first to rate PixelRAG and help others make informed decisions.

    Developer

    StarTrail-org

    StarTrail-org builds open-source AI infrastructure for pixel-native search and retrieval, with research roots at Berkeley SkyLab, BAIR, and the Berkeley NLP Group. The team develops PixelRAG, a visual RAG pipeline that replaces text parsing with screenshot-based embedding, and LEANN, a related project in efficient approximate nearest-neighbor search. Contributors include researchers from UC Berkeley with backgrounds in distributed systems, NLP, and vision-language models. The organization releases full training datasets, model adapters, and pre-built indexes alongside its codebases.

    Founded 2025
    Berkeley, CA
    $64M raised
    10 employees

    Used by

    Open-source AI community
    Berkeley research labs
    Read more about StarTrail-org
    WebsiteGitHubLinkedIn
    2 tools in directory

    Similar Tools

    LEANN icon

    LEANN

    A low-storage vector index that enables private, on-device RAG on millions of documents using 97% less storage than traditional vector databases.

    turbovec icon

    turbovec

    A Rust vector index with Python bindings built on Google Research's TurboQuant algorithm, offering 2–4 bit compression and SIMD-accelerated search faster than FAISS.

    Faiss icon

    Faiss

    A library for efficient similarity search and clustering of dense vectors, developed by Meta's Fundamental AI Research group, supporting both CPU and GPU execution.

    Browse all tools

    Related Topics

    Retrieval-Augmented Generation

    RAG Systems that enhance LLM outputs by retrieving relevant information from external knowledge bases, combining the power of generative AI with information retrieval for more accurate and contextual responses.

    91 tools

    Vector Databases

    Specialized databases optimized for storing and retrieving vector embeddings that power semantic search, recommendation systems, and other AI applications with similarity matching.

    30 tools

    AI Development Libraries

    Programming libraries and frameworks that provide machine learning capabilities, model integration, and AI functionality for developers.

    228 tools
    Browse all topics
    Back to all toolsSuggest an edit
    ratings
    discussions