Main Menu
  • Tools
  • Developers
  • Topics
  • Discussions
  • News
  • Blogs
  • Builds
  • Contests
  • Compare
Create
    EveryDev.ai
    Sign inSubscribe
    Home
    Developers

    1,764+ AI companies

    • Radar
    • Trending
    1. Home
    2. Developers
    3. Vals AI, Inc.

    Vals AI, Inc.

    Vals AI provides independent, standardized benchmarks for evaluating large language models and AI applications on real-world enterprise tasks. The company aims to bridge the gap between theoretical AI advancements and practical business applications by offering transparent, unbiased evaluations across industries like legal, finance, healthcare, and coding.

    Visit Website

    At a Glance

    1Tool Listed
    6Products
    50Tool Views
    15Capabilities
    Discussions
    San Francisco, CaliforniaHeadquarters
    2023Est.
    $5MRaised
    Focus Areas
    Automated Testing
    Performance Metrics
    Academic Research
    Connect
    Latest News
    The Winners (and Losers) of This New Vibe-Coding Benchmark Will Surprise YouNov 21, 2025
    Why Nvidia Keeps Backing Would-Be Competitors to OpenAINov 13, 2025
    Markets
    • Legal firms and legal service providers
    • Financial services and banking institutions
    • Healthcare organizations
    • AI labs and model developers (OpenAI, Anthropic, Google, etc.)
    • +6 more

    AI Tools by Vals AI, Inc.

    (1)
    View Vals AI
    Vals AI tool icon

    Vals AI

    LLM Evaluation Platform

    Automated TestingPerformance MetricsAcademic Research

    Discussions

    No discussions yet

    Be the first to start a discussion about Vals AI, Inc.

    Latest News

    11/21/2025

    The Winners (and Losers) of This New Vibe-Coding Benchmark Will Surprise You

    Inc - https://www.inc.com/ben-sherry/the-winners-and-losers-of-this-new-vibe-coding-benchmark-will-surprise-you/91266938
    11/13/2025

    Why Nvidia Keeps Backing Would-Be Competitors to OpenAI

    The Information - https://www.theinformation.com/articles/nvidia-keeps-backing-competitors-openai
    11/12/2025

    Readme: Human vs machine vs legal machine - A study of AI and legal research

    Dentons - https://www.dentons.com/en/insights/articles/2025/november/12/readme-human-vs-machine-vs-legal-machine-a-study-of-ai-and-legal-research
    10/26/2025

    OpenAI's Less-Flashy Rival Might Have a Better Business Model

    Wall Street Journal - https://www.wsj.com/tech/ai/anthropic-business-model-ai-9e26b4ef

    Products & Services

    6
    Vals Index

    Public enterprise LLM benchmarks that rank model performance across real-world business tasks in finance, legal, coding, and other domains

    Vals Enterprise Platform

    Comprehensive evaluation platform for testing LLMs and LLM applications with automated testing, expert review, CI/CD integration, and performance analytics

    Industry-Specific Benchmarks

    Specialized benchmarks for legal, finance, healthcare, tax, and other verticals to evaluate AI model performance on domain-specific tasks

    Legal AI Benchmark (VLAIR)
    February 27, 2025

    First-of-its-kind legal AI benchmarking study evaluating AI platforms against human lawyer baselines on real-world legal tasks

    Market Position

    Vals AI positions itself as the independent, neutral evaluator of AI models, distinguishing itself from self-reported benchmarks by AI companies. The company focuses on real-world, industry-specific tasks rather than academic benchmarks, and addresses data contamination issues in traditional evaluation methods. Competitors include WitnessAI, Modulos, Armilla AI, Credo AI, and others in the AI governance and evaluation space. Vals AI has established credibility through partnerships with top law firms, AI vendors, and academic institutions.

    Leadership

    Founders

    RK

    Rayan Krishnan

    Co-Founder & CEO. 24 years old (as of 2025), abandoned Ph.D. plans to start Vals AI after ChatGPT's release. Previous experience at Palantir, Microsoft, University of Washington, and SAP Concur. Based at Stanford University.

    LN

    Langston Nashold

    Co-Founder & CTO. Dropped out of AI-focused master's program at Stanford to pursue Vals AI. Stanford CS, Andrew Ng's AI + Climate Change Lab, previously worked at Hudson River Trading.

    Executive Team

    RK

    Rayan Krishnan

    Co-Founder & CEO

    24 years old, former experience at Palantir, Microsoft, University of Washington, and SAP Concur

    LN

    Langston Nashold

    Co-Founder & CTO

    Stanford CS, Andrew Ng's AI + Climate Change Lab, previously at Hudson River Trading

    Founding Story

    Founded in 2023 by Rayan Krishnan and Langston Nashold, who both dropped out of their AI-focused master's program at Stanford University to pursue their vision. Following ChatGPT's release, they recognized a critical gap in the tech industry: the lack of an independent, standardized test to evaluate AI services. They saw the need for a neutral, third-party review system for large language models, addressing issues like data contamination in existing benchmarks and the need for industry-specific evaluation rather than generic tests.

    Business Model

    Revenue Model

    Enterprise subscriptions and API access for evaluation platform. Provides both public free benchmarks and paid enterprise platform for companies to run custom evaluations. Revenue from AI labs, model developers, enterprise customers, and legal/financial firms needing evaluation services.

    Private company, no IPO plans announced

    Target Markets

    Industries & Segments
    • Legal firms and legal service providers
    • Financial services and banking institutions
    • Healthcare organizations
    • AI labs and model developers (OpenAI, Anthropic, Google, etc.)
    • Enterprise software companies building AI applications
    • Legal technology vendors
    Use Cases
    • Evaluating LLM suitability for enterprise applications before deployment
    • Benchmarking AI models on legal research, case analysis, and contract review
    • Testing AI performance on financial analysis and Excel-based tasks
    • Measuring accuracy of AI in healthcare and medical applications
    • Auditing LLM applications to replace manual review teams
    • Model selection and purchasing decisions for enterprises
    Notable Customers
    • Anthropic
    • Google
    • OpenAI
    • Everlaw

    Quick Facts

    Headquarters
    San Francisco, California
    Founded
    2023
    Entity Type
    Inc.
    Total Funding
    $5 million
    Investors
    Sequoia Capital, Bloomberg Beta
    Office Locations
    San Francisco

    Funding History

    Seed$5 million
    July 2024
    Sequoia Capital
    Bloomberg Beta
    Pear VC
    8VC
    J12 Ventures

    History & Milestones

    February 27, 2025

    Published first Legal AI Benchmark Study (VLAIR)

    October 16, 2025

    Released report showing Gen AI tools outperforming lawyers on legal research tasks

    December 2025

    Featured in The Atlantic article on young AI billionaires

    Early 2024

    Successful small preview launch of evaluation platform

    April 11, 2024

    Official public launch featured in Bloomberg

    Key Capabilities

    15
    Industry-specific LLM benchmarking across legal, finance, healthcare, and coding domains
    Automated evaluation framework for LLM applications
    Expert review and annotation capabilities
    CI/CD integration for continuous testing
    Model performance tracking and analytics
    Cost and token usage monitoring

    Integrations & Partnerships

    Platform Integrations

    • CI/CD pipeline integration
    • SDK for Python
    • CLI tools
    • API access
    • Excel integration (for Finance Agent benchmark)
    • Integration with major AI model providers (OpenAI, Anthropic, Google, etc.)

    Key Partnerships

    Legaltech Hubstrategic partnership for legal AI benchmarking
    Stanford Universitycollaboration with researchers
    Reed Smithlegal AI study partner

    Connect

    Website
    vals.ai
    GitHub
    vals-ai
    X / Twitter
    _valsai

    AI Topics

    3

    Vals AI, Inc. focuses on these topics:

    Automated Testing(1)
    Performance Metrics(1)
    Academic Research(1)
    Back to all developers
    Explore AI Tools
    • AI Coding Assistants
    • Agent Frameworks
    • MCP Servers
    • AI Prompt Tools
    • Vibe Coding Tools
    • AI Design Tools
    • AI Database Tools
    • AI Website Builders
    • AI Testing Tools
    • LLM Evaluations
    Follow Us
    • X / Twitter
    • LinkedIn
    • Reddit
    • Discord
    • Threads
    • Bluesky
    • Mastodon
    • YouTube
    • GitHub
    • Instagram
    Get Started
    • About
    • Editorial Standards
    • Corrections & Disclosures
    • Community Guidelines
    • Advertise
    • Contact Us
    • Newsletter
    • Submit a Tool
    • Start a Discussion
    • Write A Blog
    • Share A Build
    • Terms of Service
    • Privacy Policy
    Explore with AI
    • ChatGPT
    • Gemini
    • Claude
    • Grok
    • Perplexity
    Agent Experience
    • llms.txt
    Theme
    With AI, Everyone is a Dev. EveryDev.ai © 2026