LLM Stats icon

LLM Stats

LLM Stats publishes objective leaderboards and benchmark results to show measured model performance rather than marketing claims. The site collects, runs, and displays benchmark results across multiple arenas and datasets, and provides tools to compare models and explore detailed metrics. LLM Stats also offers documentation and an API for programmatic access to results and benchmarks.

  • Leaderboards — Browse ranked model leaderboards and see comparative scores across benchmarks and arenas.
  • Benchmarks & Arenas — Access curated benchmark suites (MMLU, GPQA, AIME, etc.) and arena results that evaluate models on domain-specific tasks.
  • Model comparison — Use the compare tool to view side-by-side performance and metric breakdowns for selected models.
  • Playground & API — Use the public playground and consult API documentation to programmatically retrieve benchmark data and model metadata.
  • Community & resources — Read blog posts, community posts, and resources about benchmarks and evaluation methodology.

To get started, visit the website to view leaderboards or benchmark pages, use the compare tool to explore differences between models, and consult the documentation to access the API and playground for automated queries.

No discussions yet

Be the first to start a discussion about LLM Stats

Developer

ZeroEval operates LLM Stats and publishes verifiable, high-quality benchmarks and leaderboards for AI models. The team builds evaluatio…read more

Pricing and Plans

(Free)

Free

Free

Public access to leaderboards, benchmarks, comparison tools, and API documentation; intended for researchers and practitioners.

  • Access to public leaderboards and benchmark results
  • Browse benchmark and arena pages
  • Model comparison tool and playground
  • API documentation for programmatic access

System Requirements

Operating System
Any OS with a modern web browser
Memory (RAM)
4 GB+ RAM
Processor
Any modern 64-bit CPU
Disk Space
No local storage required (cloud-based)

AI Capabilities

Benchmarking
Model-evaluation
Leaderboards
Metrics-visualization