Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Developers

    1,843+ AI companies

    • Radar
    • Trending
    1. Home
    2. Developers
    3. LM Arena

    LM Arena

    To build the world's most trusted AI evaluation platform that measures AI reliability through real-world human preferences, serving as the voice of humans shaping and improving AI to ensure responsible deployment.

    Visit Website

    At a Glance

    1Tool Listed
    14Products
    209Tool Views
    22Capabilities
    Discussions
    San Francisco, CaliforniaHeadquarters
    2023Est.
    40Employees
    $250MRaised
    Focus Areas
    Performance Metrics
    User Research
    LLM Evaluations
    Connect
    Latest News
    Launch of Chatbot Arena research project at UC BerkeleyMay 3, 2023
    Claude 3 surpasses GPT-4 for the first time on Chatbot ArenaMar 1, 2024
    Markets
    • AI developers and researchers
    • Model providers (OpenAI, Google, Anthropic, Meta, xAI, etc.)
    • Enterprises implementing AI systems
    • Software developers and engineers
    • +5 more

    AI Tools by LM Arena

    (1)
    View LM Arena
    LM Arena tool icon

    LM Arena

    LLM Evaluation and Deployment Platform

    Performance MetricsUser ResearchLLM Evaluations

    Discussions

    No discussions yet

    Be the first to start a discussion about LM Arena

    Latest News

    05/03/2023

    Launch of Chatbot Arena research project at UC Berkeley

    lmarena.ai
    03/01/2024

    Claude 3 surpasses GPT-4 for the first time on Chatbot Arena

    Wikipedia
    04/17/2025

    Chatbot Arena rebrands as LMArena and becomes a formal company (Arena Intelligence Inc.)

    maginative.com
    05/21/2025

    LMArena Secures $100M in Seed Funding at $600M valuation

    prnewswire.com

    Products & Services

    14
    Chatbot Arena / Text Arena
    May 3, 2023

    Side-by-side blind model comparisons where users vote on responses using crowdsourced pairwise comparisons. Uses Bradley-Terry model for Elo-style rankings.

    WebDev Arena
    December 2024

    AI coding competition for web development challenges. Powered by Code Arena experience as of November 12, 2025.

    Copilot Arena
    November 2024

    VSCode extension for benchmarking AI coding assistants on real-world code completion with paired autocomplete and in-line editing features.

    RepoChat Arena
    November 2024

    Benchmarking environment for AI software engineers working with real-world GitHub codebases.

    Market Position

    LMArena is positioned as a neutral, science-driven alternative to static academic benchmarks, providing the gold standard for evaluating real-world model performance through rigorous science and human judgment. It differentiates from Scale AI (expert-driven/private services) and Hugging Face (automated/objective benchmarks) by providing crowdsourced human preference signals based on actual user interactions. The platform is best known for its crowdsourced AI leaderboards that have become an industry standard for model makers. It serves as a transparent, reproducible, community-driven infrastructure layer that evaluates models based on real-world prompts rather than proprietary or closed testing.

    Leadership

    Founders

    AN

    Anastasios N. Angelopoulos

    CEO. PhD from UC Berkeley with expertise in trustworthy AI systems, black-box decision-making, and medical machine learning. Former researcher at Google DeepMind. UC Berkeley postdoc and researcher at Sky Computing Lab.

    WC

    Wei-Lin Chiang

    CTO. Studied distributed systems and deep learning frameworks at UC Berkeley SkyLab. Former research experience at Google Research, Amazon, and Microsoft. UC Berkeley postdoc.

    IS

    Ion Stoica

    Co-founder and Advisor. UC Berkeley professor and serial founder of Databricks, Anyscale, and Conviva. Advisor to the founding team at Berkeley Sky Computing Lab.

    Executive Team

    AN

    Anastasios N. Angelopoulos

    Co-Founder and CEO

    PhD from UC Berkeley with expertise in trustworthy AI systems, black-box decision-making, and medical machine learning. Former researcher at Google DeepMind.

    WC

    Wei-Lin Chiang

    Co-Founder and CTO

    Studied distributed systems and deep learning frameworks at UC Berkeley SkyLab. Former research experience at Google Research, Amazon, and Microsoft.

    Board of Directors

    AM
    Anjney Midha
    Board Member (Andreessen Horowitz General Partner)
    PD
    Peter Deng
    Board Observer (Felicis General Partner)
    IS
    Ion Stoica
    Advisor / Co-founder (UC Berkeley Professor)

    Founding Story

    LMArena began in early 2023 as Chatbot Arena, a scrappy academic side project by two UC Berkeley Ph.D. roommates, Anastasios Angelopoulos and Wei-Lin Chiang, at the Berkeley Sky Computing Lab. Originally built to test their own open-source large language model, Vicuna, it addressed the industry-wide challenge that technical benchmarks often fail to reflect real-world user experience. The platform was created as a blind taste test for AI models to determine which ones provide the best user experience for tasks like coding, content creation, and conversation. Within one week of launch, the site received 4,700 votes. By December 2023, the Wall Street Journal described it as the AI industry's obsession. In April 2025, the founders announced Chatbot Arena had become a startup called LMArena (Arena Intelligence Inc.), and in May 2025, raised $100 million in seed funding.

    Business Model

    Revenue
    $30 million ARR reached within four months of launching commercial AI Evaluations product in September 2025

    Revenue Model

    Freemium model with commercial evaluation services. Core platform is free to the public to generate crowdsourced data. Revenue is generated by charging model providers and enterprise clients for private arenas, evaluation tooling, analytics dashboards, API/SDK access, and premium support for custom assessments. Annualized consumption rate (ARR) based on commercial model evaluations.

    Pricing Tiers

    Free Public Platform
    Free

    Open access to compare AI models, vote on responses, and view public leaderboards. Core participation is free and open to the public.

    AI Evaluations (Enterprise)
    Custom pricing

    Commercial service for enterprises, model labs, and developers. Includes private arenas for proprietary datasets, evaluation tooling, analytics dashboards, diagnostic reports, API/SDK access, and premium support for custom assessments. Pay-as-you-go consumption-based pricing model.

    Private company. No IPO plans announced.

    Target Markets

    Industries & Segments
    • AI developers and researchers
    • Model providers (OpenAI, Google, Anthropic, Meta, xAI, etc.)
    • Enterprises implementing AI systems
    • Software developers and engineers
    • Web developers
    • Content creators and designers
    Use Cases
    • Model benchmarking and performance comparison
    • Specialized task evaluation (coding, math, creative writing)
    • Product development and model selection
    • Search and grounding evaluation
    • Media editing and generation assessment
    • Enterprise model evaluation on proprietary data
    Notable Customers
    • OpenAI
    • Google DeepMind
    • Anthropic
    • Meta

    Quick Facts

    Headquarters
    San Francisco, California
    Founded
    2023
    Entity Type
    Inc.
    Employees
    40
    Total Funding
    $250 million total raised
    Investors
    Andreessen Horowitz (a16z), Felicis
    Office Locations
    San Francisco

    Funding History

    Seed$100 million
    May 21, 2025
    $600 million valuation
    Andreessen Horowitz (a16z)
    UC Investments (University of California)
    Lightspeed Venture Partners
    Laude Ventures
    Felicis
    Kleiner Perkins
    The House Fund
    Series A$150 million
    January 6, 2026
    $1.7 billion valuation
    Felicis
    UC Investments (University of California)
    Andreessen Horowitz
    The House Fund
    LDVP
    Kleiner Perkins
    Lightspeed Venture Partners
    Laude Ventures

    History & Milestones

    January 6, 2026

    Raised $150 million Series A at $1.7 billion valuation; Reached unicorn status

    January 2026

    Reached $30 million ARR; 5 million monthly users across 150 countries and 60 million conversations per month; 50 million community votes

    January 2025

    Users accurately predicted the success of the DeepSeek-R1 model via the rankings

    March 2025

    Launch of Search Arena

    April 2025

    Formal launch of Arena Intelligence Inc. (LMArena) as a company; Chatbot Arena rebrands as LMArena

    Key Capabilities

    22
    Side-by-side blind model comparisons
    Crowdsourced pairwise voting
    Elo-style rankings using Bradley-Terry model
    Mobile-first UI
    Saved chat history
    Endless chat

    Integrations & Partnerships

    Platform Integrations

    • Visual Studio Code (VSCode extension for Copilot Arena)
    • Hugging Face (datasets and leaderboard hosting)
    • GitHub (RepoChat Arena for codebase evaluation)
    • Discord (access to video arenas)
    • Third-party AI models from Qwen, Anthropic, Meta, Minimax, Perplexity, and others
    • fal.ai (platform integration for image generation models)
    • Gradio library (interface framework)

    Key Partnerships

    UC Berkeley Sky Computing Lab
    Hugging Face (hosting leaderboards and datasets)
    Andreessen Horowitz (a16z)

    Connect

    Website
    lmarena.ai
    GitHub
    lmarena
    X / Twitter
    arena

    AI Topics

    3

    LM Arena focuses on these topics:

    Performance Metrics(1)
    User Research(1)
    LLM Evaluations(1)
    Back to all developers
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026