EveryDev.ai
Sign inSubscribe
Explore AI Tools
  • AI Coding Assistants
  • Agent Frameworks
  • MCP Servers
  • AI Prompt Tools
  • Vibe Coding Tools
  • AI Design Tools
  • AI Database Tools
  • AI Website Builders
  • AI Testing Tools
  • LLM Evaluations
Follow Us
  • X / Twitter
  • LinkedIn
  • Reddit
  • Discord
  • Threads
  • Bluesky
  • Mastodon
  • YouTube
  • GitHub
  • Instagram
Get Started
  • About
  • Editorial Standards
  • Corrections & Disclosures
  • Community Guidelines
  • Advertise
  • Contact Us
  • Newsletter
  • Submit a Tool
  • Start a Discussion
  • Write A Blog
  • Share A Build
  • Terms of Service
  • Privacy Policy
Explore with AI
  • ChatGPT
  • Gemini
  • Claude
  • Grok
  • Perplexity
Agent Experience
  • llms.txt
Theme
With AI, Everyone is a Dev. EveryDev.ai © 2026
Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • Communities
  • News
  • Podcasts
  • Blogs
  • Builds
  • Contests
  • Compare
  • Arena
Create
    Home
    Developers

    2,193+ AI companies

    • Radar
    • Trending
    1. Home
    2. Developers
    3. web-arena-x

    web-arena-x

    To build realistic, reproducible web environments for training and evaluating autonomous web agents that can handle complex, real-world tasks.

    Visit Website

    At a Glance

    1Tool Listed
    4Products
    5Capabilities
    Discussions
    Pittsburgh, PAHeadquarters
    2023Est.
    15Employees
    Focus Areas
    Agent Harness
    Browser Automation
    LLM Evaluations
    Connect
    Latest News
    TheAgentCompany presented at ICML 2025May 1, 2025
    WebArena presented as Oral at NeurIPS 2024Dec 1, 2024
    Markets
    • AI Research Labs
    • Technology Companies developing AI Agents
    • Open Source AI Community

    AI Tools by web-arena-x

    (1)
    View WebArena
    WebArena tool icon

    WebArena

    Web Agent Benchmark Environment

    Agent HarnessBrowser AutomationLLM Evaluations

    Discussions

    No discussions yet

    Be the first to start a discussion about web-arena-x

    Latest News

    05/01/2025

    TheAgentCompany presented at ICML 2025

    the-agent-company.com
    12/01/2024

    WebArena presented as Oral at NeurIPS 2024

    webarena.dev
    08/01/2024

    VisualWebArena presented at ACL 2024

    jykoh.com
    11/01/2024

    WebArena-Infinity announced

    webarena.dev

    Products & Services

    4
    WebArena
    2023

    A standalone, self-hostable web environment with four popular categories (Shopping, Reddit, GitLab, etc.) for building autonomous agents.

    VisualWebArena
    2024

    A benchmark designed to assess the performance of multimodal web agents on realistic visual web tasks.

    WebArena-Infinity
    2024

    A framework for automatically generating browser environments with verifiable tasks and high authenticity.

    TheAgentCompany
    2025

    An extensible benchmark for evaluating AI agents on professional tasks within a simulated company environment.

    Market Position

    A pioneering realistic benchmark for web agents, focusing on functional correctness and high-authenticity environments rather than just text-based interactions.

    Leadership

    Founders

    SZ

    Shuyan Zhou

    Assistant Professor at Duke University (since 2024); PhD from Carnegie Mellon University; previously researcher at Google[x] and Microsoft Research. Lead contributor to WebArena.

    FF

    Frank F. Xu

    Researcher at Carnegie Mellon University focusing on code generation and autonomous agents. Lead developer and contributor to WebArena and TheAgentCompany.

    GN

    Graham Neubig

    Associate Professor at Carnegie Mellon University; Co-Founder of Inspired Cognition and All Hands AI. Principal investigator for the WebArena project.

    Executive Team

    SZ

    Shuyan Zhou

    Project Lead / Assistant Professor (Duke)

    Specializes in NLP and autonomous agents.

    FF

    Frank F. Xu

    Lead Developer / Researcher (CMU)

    Expert in machine learning and software engineering.

    Board of Directors

    DF
    Daniel Fried
    Faculty Advisor (Carnegie Mellon University)
    YB
    Yonatan Bisk
    Faculty Advisor (Carnegie Mellon University)
    RS
    Ruslan Salakhutdinov
    Advisor (VisualWebArena)

    Founding Story

    WebArena was created to move beyond toy benchmarks and provide a realistic end-to-end environment where agents must interact with complex websites and tools, mimicking human problem-solving workflows.

    Business Model

    Revenue Model

    Open-source research project; supported by academic research grants from CMU, Duke, and affiliated organizations.

    Not applicable (Research Project)

    Target Markets

    Industries & Segments
    • AI Research Labs
    • Technology Companies developing AI Agents
    • Open Source AI Community
    Use Cases
    • Benchmarking autonomous web agents
    • Training LLMs for web-based computer use
    • Research in multimodal AI perception and reasoning
    • Evaluating agent safety and reliability
    Notable Customers
    • Anthropic
    • OpenAI
    • Meta
    • Microsoft

    Quick Facts

    Headquarters
    Pittsburgh, PA
    Founded
    2023
    Entity Type
    Academic Research Collective / Open Source Organization
    Employees
    15
    Total Funding
    Not disclosed (primarily supported via academic research grants)
    Investors
    Carnegie Mellon University, Duke University
    Office Locations
    Carnegie Mellon University
    Duke University

    History & Milestones

    May 2025

    TheAgentCompany benchmark presented at ICML 2025.

    May 2024

    VisualWebArena (multimodal benchmark) presented at ACL 2024.

    December 2024

    WebArena presented as an Oral paper at NeurIPS 2024.

    November 2024

    WebArena-Infinity announced for automated environment generation.

    July 2023

    WebArena paper first released on arXiv, introducing the realistic web benchmark.

    Key Capabilities

    5
    Self-hostable sandboxed web environments
    Programmatic verification of functional correctness
    Diverse task categories (E-commerce, Social, Productivity, Maps)
    Multimodal (text + image) input support
    Automated task and environment generation

    Integrations & Partnerships

    Platform Integrations

    • Docker
    • GitHub
    • Hugging Face
    • arXiv

    Key Partnerships

    Carnegie Mellon University
    Duke University

    Connect

    Website
    webarena.dev
    GitHub
    web-arena-x

    AI Topics

    3

    web-arena-x focuses on these topics:

    Agent Harness(1)
    Browser Automation(1)
    LLM Evaluations(1)
    Back to all developers