Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Developers

    1,995+ AI companies

    • Radar
    • Trending
    1. Home
    2. Developers
    3. Patronus AI, Inc.

    Patronus AI, Inc.

    To boost enterprise confidence in generative AI through automated evaluation, optimization, and security platforms for LLMs and AI agents.

    Visit Website

    At a Glance

    1Tool Listed
    6Products
    29Tool Views
    8Capabilities
    Discussions
    San Francisco, CaliforniaHeadquarters
    2023Est.
    40Employees
    $20MRaised
    Focus Areas
    LLM Evaluations
    Automated Testing
    Observability Platforms
    Connect
    Latest News
    Introducing Generative Simulators: Autonomously Scaling Environments for AgentsDec 17, 2025
    Introducing MEMTRACK: A Benchmark for Agent MemoryOct 14, 2025
    Markets
    • Technology
    • Financial Services
    • Healthcare
    • Education

    AI Tools by Patronus AI, Inc.

    (1)
    View Patronus AI
    Patronus AI tool icon

    Patronus AI

    LLM Evaluation and Monitoring Platform

    LLM EvaluationsAutomated TestingObservability

    Discussions

    No discussions yet

    Be the first to start a discussion about Patronus AI, Inc.

    Latest News

    12/17/2025

    Introducing Generative Simulators: Autonomously Scaling Environments for Agents

    patronus.ai
    10/14/2025

    Introducing MEMTRACK: A Benchmark for Agent Memory

    patronus.ai
    09/25/2025

    Percival Chat: An Eval Copilot for Agentic Systems

    patronus.ai
    08/20/2025

    Patronus Evaluators Launch

    patronus.ai

    Products & Services

    6
    Core Evaluation Platform
    2023

    A centralized solution for running LLM experiments, logging results, and comparing model performance using automated scoring.

    Lynx
    June 2024

    A 70B-parameter hallucination detection model that outperforms GPT-4 on identifying mistakes in LLM outputs.

    Percival
    2025

    An AI evaluation copilot for analyzing complex traces in agentic systems, identifying over 20 failure modes.

    FinanceBench
    October 2023

    A benchmark of 10,000 Q&A pairs used to evaluate LLM performance on financial questions using public documents.

    Market Position

    Leading automated AI evaluation and security platform focused on enterprise reliability and research-backed benchmarks.

    Leadership

    Founders

    AK

    Anand Kannappan

    Co-founder and CEO. Former machine learning researcher at Meta AI, where he focused on large language model evaluation and safety.

    RQ

    Rebecca Qian

    Co-founder and CTO. Former Research Engineer and Machine Learning Engineer at Facebook AI (Meta AI), specializing in agent simulation and LLMs.

    Executive Team

    AK

    Anand Kannappan

    Co-Founder and CEO

    Former ML Researcher at Meta AI.

    RQ

    Rebecca Qian

    Co-Founder and CTO

    Former Research Engineer at Meta AI.

    Board of Directors

    GS
    Glenn Solomon
    Managing Partner at Notable Capital (Board Member)

    Founding Story

    Founded by machine learning experts from Meta AI to address the critical need for automated evaluation tools that detect LLM hallucinations, copyright issues, and safety risks, enabling safe enterprise deployment.

    Business Model

    Revenue
    $629.3K (2024 estimate)

    Revenue Model

    SaaS platform subscription and usage-based API pricing.

    Pricing Tiers

    Individual
    Free

    Includes 20 pages and 5 pages per project.

    Base
    $25/month

    Includes 600 pages with add-ons on demand.

    Enterprise
    Custom

    Unlimited pages, premium security features (on-prem/VPC), and custom fine-tuning.

    API - Small Evaluator
    $10 / 1k calls

    Usage-based API pricing for small evaluator calls.

    API - Large Evaluator
    $20 / 1k calls

    Usage-based API pricing for large evaluator calls.

    Private

    Target Markets

    Industries & Segments
    • Technology
    • Financial Services
    • Healthcare
    • Education
    Use Cases
    • RAG system evaluation
    • AI agent training
    • Financial document QA
    • Model comparison and benchmarking
    • Safety and compliance testing
    Notable Customers
    • AngelList
    • Etsy
    • Pearson

    Quick Facts

    Headquarters
    San Francisco, California
    Founded
    2023
    Entity Type
    Inc.
    Employees
    40
    Total Funding
    $20M
    Investors
    Notable Capital, Lightspeed Venture Partners
    Office Locations
    San Francisco
    New York

    Funding History

    Seed$3,000,000
    September 2023
    Lightspeed Venture Partners
    Series A$17,000,000
    May 2024
    Notable Capital

    History & Milestones

    2025

    Released Percival, an evaluation copilot for agentic systems.

    October 2025

    Launched MEMTRACK, a benchmark for agent memory and state tracking.

    December 2025

    Introduced Generative Simulators for autonomously scaling agent environments.

    May 2024

    Raised $17M Series A funding led by Notable Capital, bringing total funding to $20M.

    June 2024

    Released Lynx, a state-of-the-art 70B hallucination detection model.

    Key Capabilities

    8
    Hallucination detection
    Automated evaluation scoring
    Adversarial testing
    Multimodal model support
    Agent simulation
    Memory benchmarking

    Integrations & Partnerships

    Platform Integrations

    • Datadog
    • GitHub
    • AWS
    • Google Cloud
    • Various LLM providers (OpenAI, Anthropic, etc.)

    Key Partnerships

    Datadog
    Lightspeed Venture Partners

    Connect

    Website
    patronus.ai/
    GitHub
    patronus-ai
    X / Twitter
    PatronusAI

    AI Topics

    3

    Patronus AI, Inc. focuses on these topics:

    LLM Evaluations(1)
    Automated Testing(1)
    Observability Platforms(1)
    Back to all developers
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026