LLM Stats
LLM Stats publishes objective leaderboards and benchmark results to show measured model performance rather than marketing claims. The site collects, runs, and displays benchmark results across multiple arenas and datasets, and provides tools to compare models and explore detailed metrics. LLM Stats also offers documentation and an API for programmatic access to results and benchmarks.
- Leaderboards — Browse ranked model leaderboards and see comparative scores across benchmarks and arenas.
- Benchmarks & Arenas — Access curated benchmark suites (MMLU, GPQA, AIME, etc.) and arena results that evaluate models on domain-specific tasks.
- Model comparison — Use the compare tool to view side-by-side performance and metric breakdowns for selected models.
- Playground & API — Use the public playground and consult API documentation to programmatically retrieve benchmark data and model metadata.
- Community & resources — Read blog posts, community posts, and resources about benchmarks and evaluation methodology.
To get started, visit the website to view leaderboards or benchmark pages, use the compare tool to explore differences between models, and consult the documentation to access the API and playground for automated queries.
No discussions yet
Be the first to start a discussion about LLM Stats
Developer
Pricing and Plans
Free
Public access to leaderboards, benchmarks, comparison tools, and API documentation; intended for researchers and practitioners.
- Access to public leaderboards and benchmark results
- Browse benchmark and arena pages
- Model comparison tool and playground
- API documentation for programmatic access