LM Arena

Name: LM Arena
Availability: OnlineOnly
Author: LM Arena

Performance Metrics

Web platform for comparing, running, and deploying large language models with hosted inference and API access.

Visit Website

At a Glance

Pricing

Open Source

Free platform for comparing AI models and contributing to crowdsourced leaderboards.

Engagement

78views

0saves

1discussions

Available On

Web

API

Resources

Website Docs GitHub llms.txt

Topics

Performance Metrics User Research LLM Evaluations

About LM Arena

LM Arena provides a web-based environment for running, comparing, and deploying large language models. It focuses on making model evaluation, hosted inference, and simple deployment workflows accessible from a browser and via an API. The platform supports uploading or connecting models, running evaluation workloads, and exposing inference endpoints for applications.

Model comparison run side-by-side evaluations and benchmarks across models.
Hosted inference deploy models to managed endpoints for production usage.
API access programmatically invoke models and integrate into applications.
Custom model uploads bring your own model artifacts for testing and deployment.
Usage monitoring track usage metrics and performance of deployed endpoints.

To get started, sign up on the web app, upload or connect a model, run a comparison job, and create an inference endpoint; use the provided API keys to integrate inference into your applications.

Community Discussions

Create a new discussion about LM Arena

Joe Seifi

13d·Apple, Disney, Adobe, Eventbrite,…

Video Arena is out on LM Arena and it's pretty wild

So LM Arena shipped their Video Arena feature and I've been messing around with it. You can now compare AI video models side by side, things like Sora, Hailuo, Veo 3.1, and a bunch of others. The cool part is it runs through their Discord server and you get to vote on which model output you like bet…

news