Vals AI, Inc.

Vals AI provides independent, standardized benchmarks for evaluating large language models and AI applications on real-world enterprise tasks. The company aims to bridge the gap between theoretical AI advancements and practical business applications by offering transparent, unbiased evaluations across industries like legal, finance, healthcare, and coding.

Visit Website

At a Glance

53Tool Views

San Francisco, CaliforniaHeadquarters

2023Est.

AI Tools by Vals AI, Inc.

(1)

Vals AI

LLM Evaluation Platform

Automated Testing Performance Metrics Academic Research

Discussions

No discussions yet

Be the first to start a discussion about Vals AI, Inc.

Latest News

11/21/2025

The Winners (and Losers) of This New Vibe-Coding Benchmark Will Surprise You

Inc - https://www.inc.com/ben-sherry/the-winners-and-losers-of-this-new-vibe-coding-benchmark-will-surprise-you/91266938

11/13/2025

Why Nvidia Keeps Backing Would-Be Competitors to OpenAI

The Information - https://www.theinformation.com/articles/nvidia-keeps-backing-competitors-openai

11/12/2025

Readme: Human vs machine vs legal machine - A study of AI and legal research

Dentons - https://www.dentons.com/en/insights/articles/2025/november/12/readme-human-vs-machine-vs-legal-machine-a-study-of-ai-and-legal-research

10/26/2025

OpenAI's Less-Flashy Rival Might Have a Better Business Model

Wall Street Journal - https://www.wsj.com/tech/ai/anthropic-business-model-ai-9e26b4ef

Products & Services

Vals Index

Public enterprise LLM benchmarks that rank model performance across real-world business tasks in finance, legal, coding, and other domains

Vals Enterprise Platform

Comprehensive evaluation platform for testing LLMs and LLM applications with automated testing, expert review, CI/CD integration, and performance analytics

Industry-Specific Benchmarks

Specialized benchmarks for legal, finance, healthcare, tax, and other verticals to evaluate AI model performance on domain-specific tasks

Legal AI Benchmark (VLAIR)

February 27, 2025

First-of-its-kind legal AI benchmarking study evaluating AI platforms against human lawyer baselines on real-world legal tasks

Market Position

Vals AI positions itself as the independent, neutral evaluator of AI models, distinguishing itself from self-reported benchmarks by AI companies. The company focuses on real-world, industry-specific tasks rather than academic benchmarks, and addresses data contamination issues in traditional evaluation methods. Competitors include WitnessAI, Modulos, Armilla AI, Credo AI, and others in the AI governance and evaluation space. Vals AI has established credibility through partnerships with top law firms, AI vendors, and academic institutions.

Leadership

Founders

Rayan Krishnan

Co-Founder & CEO. 24 years old (as of 2025), abandoned Ph.D. plans to start Vals AI after ChatGPT's release. Previous experience at Palantir, Microsoft, University of Washington, and SAP Concur. Based at Stanford University.

Langston Nashold

Co-Founder & CTO. Dropped out of AI-focused master's program at Stanford to pursue Vals AI. Stanford CS, Andrew Ng's AI + Climate Change Lab, previously worked at Hudson River Trading.

Executive Team

Rayan Krishnan

Co-Founder & CEO

24 years old, former experience at Palantir, Microsoft, University of Washington, and SAP Concur

Langston Nashold

Co-Founder & CTO

Stanford CS, Andrew Ng's AI + Climate Change Lab, previously at Hudson River Trading

Founding Story

Founded in 2023 by Rayan Krishnan and Langston Nashold, who both dropped out of their AI-focused master's program at Stanford University to pursue their vision. Following ChatGPT's release, they recognized a critical gap in the tech industry: the lack of an independent, standardized test to evaluate AI services. They saw the need for a neutral, third-party review system for large language models, addressing issues like data contamination in existing benchmarks and the need for industry-specific evaluation rather than generic tests.

Business Model

Revenue Model

Enterprise subscriptions and API access for evaluation platform. Provides both public free benchmarks and paid enterprise platform for companies to run custom evaluations. Revenue from AI labs, model developers, enterprise customers, and legal/financial firms needing evaluation services.

Private company, no IPO plans announced

Target Markets

Industries & Segments

Legal firms and legal service providers
Financial services and banking institutions
Healthcare organizations
AI labs and model developers (OpenAI, Anthropic, Google, etc.)
Enterprise software companies building AI applications
Legal technology vendors

Use Cases

Evaluating LLM suitability for enterprise applications before deployment
Benchmarking AI models on legal research, case analysis, and contract review
Testing AI performance on financial analysis and Excel-based tasks
Measuring accuracy of AI in healthcare and medical applications
Auditing LLM applications to replace manual review teams
Model selection and purchasing decisions for enterprises

Notable Customers

Anthropic
Google
OpenAI
Everlaw

Quick Facts

Headquarters

San Francisco, California

Founded

2023

Entity Type

Inc.

Total Funding

$5 million

Investors

Sequoia Capital, Bloomberg Beta

Office Locations

San Francisco

Funding History

Seed$5 million

July 2024

Sequoia Capital

Bloomberg Beta

Pear VC

8VC

J12 Ventures

History & Milestones

February 27, 2025

Published first Legal AI Benchmark Study (VLAIR)

October 16, 2025

Released report showing Gen AI tools outperforming lawyers on legal research tasks

December 2025

Featured in The Atlantic article on young AI billionaires

Early 2024

Successful small preview launch of evaluation platform

April 11, 2024

Official public launch featured in Bloomberg

Key Capabilities

Industry-specific LLM benchmarking across legal, finance, healthcare, and coding domains

Automated evaluation framework for LLM applications

Expert review and annotation capabilities

CI/CD integration for continuous testing

Model performance tracking and analytics

Cost and token usage monitoring

Integrations & Partnerships

Platform Integrations

CI/CD pipeline integration
SDK for Python
CLI tools
API access
Excel integration (for Finance Agent benchmark)
Integration with major AI model providers (OpenAI, Anthropic, Google, etc.)

Key Partnerships

Legaltech Hubstrategic partnership for legal AI benchmarking

Stanford Universitycollaboration with researchers

Reed Smithlegal AI study partner

Connect

Website

vals.ai

GitHub

vals-ai

X / Twitter

_valsai

AI Topics

Vals AI, Inc. focuses on these topics:

Automated Testing(1)

Performance Metrics(1)

Academic Research(1)

Back to all developers

Vals AI, Inc.