OLMo

Name: OLMo
Availability: OnlineOnly
Author: Allen Institute for AI

OLMo is Allen AI's fully open-source large language model framework for training, fine-tuning, evaluating, and running inference on state-of-the-art open language models.

Visit Website

At a Glance

Pricing

Open Source

Fully open-source under Apache 2.0. Free to use, modify, and distribute.

Engagement

Available On

macOS

API

SDK

CLI

Allen Institute for AISeattle, WAEst. 2014$40M raised

Updated May 2026

About OLMo

OLMo (Open Language Model) is a fully open-source large language model project from the Allen Institute for AI (Ai2), released under the Apache 2.0 license. It provides not just model weights but also training code, data, and evaluation tools — making it one of the most transparent LLM releases available. The project is designed by scientists, for scientists, and targets researchers and developers who need full access to the entire model development pipeline.

What It Is

OLMo is a repository and model family for training and using Ai2's open language models. Unlike many "open" models that release only weights, OLMo releases the full training codebase, pretraining data, intermediate checkpoints, and evaluation frameworks. The current generation is OLMo-2, which comes in 1B, 7B, 13B, and 32B parameter variants, all available on Hugging Face in both base and instruction-tuned forms.

Two-Stage Pretraining Architecture

OLMo-2 uses a two-stage training procedure:

Stage 1: Large-scale pretraining on 4–5 trillion tokens of mostly web-based data (OLMo-mix-1124 dataset)
Stage 2: Continued training on 50–300 billion tokens of high-quality, targeted data (Dolmino-mix-1124 dataset)

For the 7B and 13B models, Ai2 trains multiple runs with different random seeds and then averages ("soups") the resulting model weights to produce the final checkpoint. All intermediate checkpoints — at minimum every 1,000 training steps — are publicly released.

Update: OLMo-2 and the Move to OLMo-core

The original OLMo repository (github.com/allenai/OLMo) is now marked as out of date and no longer actively maintained. The latest releases, including the OLMo-2 32B model, are developed and maintained in the newer OLMo-core repository (github.com/allenai/OLMo-core). The most recent model variant is OLMo-2-0425-1B, released in April 2025. The legacy repository's latest release tag is v0.6.0 (December 2024).

Inference and Deployment

OLMo models integrate directly with the Hugging Face Transformers library, enabling standard AutoModelForCausalLM and pipeline-based inference. The repository also includes an example script for hosting OLMo-2 on Modal.com with an OpenAI-compatible API endpoint. Quantization via bitsandbytes (8-bit) is supported. Models can be run on CUDA GPUs or Apple Silicon (Mac) devices with PyTorch 2.5+.

Evaluation and Ecosystem

Ai2 maintains separate repositories for evaluation: OLMo-eval and olmes. Instruction-tuned variants (Instruct models) are available for all four size classes. The project also connects to Ai2's broader open model ecosystem, including Tülu 3 (post-training), Molmo (multimodal), and the Ai2 Playground for interactive access.

Why It Matters for Researchers

OLMo's distinguishing characteristic is its commitment to full openness — training configs, WandB training logs, data provenance CSVs, and every checkpoint are publicly accessible. This level of transparency is explicitly positioned by Ai2 as enabling reproducible science and community-driven advancement of language model research, in contrast to models that release only final weights.

Community Discussions

Be the first to start a conversation about OLMo

Share your experience with OLMo, ask questions, or help others learn from your insights.

Pricing

OPEN SOURCE

Open Source

Fully open-source under Apache 2.0. Free to use, modify, and distribute.

OLMo-2 model weights (1B, 7B, 13B, 32B)
Full training source code
Pretraining data access
All intermediate checkpoints
Instruction-tuned variants

Capabilities

Key Features

Fully open-source training, fine-tuning, and inference code
OLMo-2 models in 1B, 7B, 13B, and 32B parameter sizes
Two-stage pretraining on 4–5 trillion tokens
Model weight averaging (model souping) for improved performance
All intermediate checkpoints publicly released
Instruction-tuned variants for all model sizes
Hugging Face Transformers integration
8-bit quantization support via bitsandbytes
OpenAI-compatible API deployment example for Modal.com
Separate evaluation frameworks (OLMo-eval, olmes)
Full training configs and WandB logs published
Apache 2.0 license

Integrations

Hugging Face Transformers

PyTorch

bitsandbytes

Modal.com

WandB

Hugging Face Hub

OLMo-eval

olmes

API Available

View Docs

Back to all tools Suggest an edit

About OLMo

What It Is

Two-Stage Pretraining Architecture

OLMo-2 uses a two-stage training procedure:

Stage 1: Large-scale pretraining on 4–5 trillion tokens of mostly web-based data (OLMo-mix-1124 dataset)
Stage 2: Continued training on 50–300 billion tokens of high-quality, targeted data (Dolmino-mix-1124 dataset)

OLMo