SGLang Project

To accelerate AI model inference and reduce compute costs through high-performance optimization and open-source systems.

Visit Website

At a Glance

25Tool Views

San Francisco, CaliforniaHeadquarters

2026Est.

50Employees

AI Development Libraries

Connect

AI Tools by SGLang Project

(1)

SGLang

LLM and VLM Serving Framework

Local Inference AI Infrastructure AI Dev Libraries

Discussions

No discussions yet

Be the first to start a discussion about SGLang Project

Latest News

01/21/2026

Sources: Project SGLang spins out as RadixArk with $400M valuation

TechCrunch

11/19/2025

Introducing Miles — RL Framework To Fire Up Large-Scale MoE Training

LMSYS Blog

03/17/2026

ROCm Support for Miles: Large-Scale RL Post-Training on AMD

RadixArk / LMSYS

10/01/2025

SGLang reaches 400,000 GPUs deployment milestone

SGLang GitHub / Project Updates

Products & Services

SGLang Inference Engine

2023

A high-performance open-source serving framework for large language models (LLMs) and vision language models (VLMs).

Miles RL Framework

Nov 2025

An enterprise-facing reinforcement learning framework tailored for large-scale Mixture-of-Experts (MoE) training and production.

Managed SGLang Hosting

2026

A paid hosting service for enterprises to deploy SGLang with production-grade reliability and low latency.

Market Position

RadixArk positions itself as a higher-performance alternative to vLLM and HuggingFace TGI, focusing on memory efficiency (RadixAttention) and enterprise RL workflows (Miles).

Leadership

Founders

Ying Sheng

Co-founder and CEO of RadixArk. Previously an engineer at Elon Musk's AI startup xAI and a research scientist at Databricks. Key contributor to the SGLang project. Stanford University graduate/Ph.D. candidate.

Lianmin Zheng

Co-founder. Ph.D. from UC Berkeley advised by Ion Stoica. Lead of LMSYS Org projects including FastChat and Vicuna. Co-creator of SGLang.

Ion Stoica

Co-founder and Chairman/Advisor. Professor of Computer Science at UC Berkeley. Co-founder of Databricks, Anyscale, and Conviva.

Executive Team

Ying Sheng

Co-founder and CEO

Former engineer at xAI and research scientist at Databricks; SGLang maintainer.

Lianmin Zheng

Co-founder

Ph.D. from UC Berkeley; lead of LMSYS Org and creator of Vicuna.

Board of Directors

Ion Stoica

Co-founder and Executive Chairman

Lip-Bu Tan

Angel Investor and Advisor (Intel veteran)

Founding Story

RadixArk originated as SGLang in 2023 within the UC Berkeley lab of Databricks co-founder Ion Stoica. The project was created to address the inefficiencies in LLM inference. Following massive community adoption and deployment on over 400,000 GPUs, the core contributors spun the project out into a commercial entity to provide enterprise-grade infrastructure and managed services.

Business Model

Revenue

Not publicly disclosed (ARR in early stages post-spinout).

Revenue Model

Open-core model: Free open-source inference engine (SGLang) and RL framework (Miles) with revenue generated through managed hosting services and enterprise support.

Pricing Tiers

Open Source

Free

Access to core SGLang and Miles code via GitHub.

Managed Hosting

Usage-based / Subscription

Production-grade hosting for SGLang inference.

Private

Target Markets

Industries & Segments

AI Labs
Cloud Service Providers
Enterprise AI Developers
Open-source Community

Use Cases

High-throughput LLM serving
Real-time chat applications
Large-scale RL post-training
Multi-modal model inference

Notable Customers

xAI
Cursor
LMSYS Org

Quick Facts

Headquarters

San Francisco, California, US

Founded

2026

Entity Type

Inc.

Employees

Total Funding

Significant funding from Accel and prominent angels (Valuation ~$400M)

Investors

Accel, Lip-Bu Tan

Office Locations

San Francisco

Funding History

AngelUndisclosed

2025

Lip-Bu Tan

Venture RoundUndisclosed

Jan 2026

$400,000,000 valuation

Accel

History & Milestones

Jan 2026

Officially spun out from UC Berkeley as RadixArk with a $400 million valuation.

Mar 2026

Announced ROCm support for the Miles RL framework to enable large-scale post-training on AMD hardware.

Nov 2025

Introduced 'Miles', an enterprise-grade reinforcement learning (RL) framework for post-training.

2023

SGLang project originated as an open-source research project at Ion Stoica's UC Berkeley lab.

Key Capabilities

RadixAttention (Prefix Caching)

Speculative Decoding

Disaggregated Prefill and Decode

P2P Weight Transfer via RDMA

Multi-backend Support (CUDA, ROCm, TPU, Ascend)

Zero-overhead Scheduler

Integrations & Partnerships

Platform Integrations

OpenAI API compatible
Hugging Face
Docker
Kubernetes
Amazon AWS
Google Cloud

Key Partnerships

UC Berkeley (LMSYS)

NVIDIA (GTC Partner)

AMD (ROCm Integration)

Connect

Website

sglang.io

GitHub

sgl-project

AI Topics

SGLang Project focuses on these topics:

Local Inference(1)

AI Infrastructure(1)

AI Development Libraries(1)

Back to all developers Suggest an edit

SGLang Project

To accelerate AI model inference and reduce compute costs through high-performance optimization and open-source systems.

Visit Website

At a Glance

25Tool Views

San Francisco, CaliforniaHeadquarters

2026Est.

50Employees

AI Development Libraries

Connect

AI Tools by SGLang Project

(1)

SGLang

LLM and VLM Serving Framework

Local Inference AI Infrastructure AI Dev Libraries

Discussions

No discussions yet

Be the first to start a discussion about SGLang Project

Latest News

01/21/2026

Sources: Project SGLang spins out as RadixArk with $400M valuation

TechCrunch

11/19/2025

Introducing Miles — RL Framework To Fire Up Large-Scale MoE Training

LMSYS Blog

03/17/2026

ROCm Support for Miles: Large-Scale RL Post-Training on AMD

RadixArk / LMSYS

10/01/2025

SGLang reaches 400,000 GPUs deployment milestone

SGLang GitHub / Project Updates

Products & Services

SGLang Inference Engine

2023

A high-performance open-source serving framework for large language models (LLMs) and vision language models (VLMs).

Miles RL Framework

Nov 2025

An enterprise-facing reinforcement learning framework tailored for large-scale Mixture-of-Experts (MoE) training and production.

Managed SGLang Hosting

2026

A paid hosting service for enterprises to deploy SGLang with production-grade reliability and low latency.

Market Position

RadixArk positions itself as a higher-performance alternative to vLLM and HuggingFace TGI, focusing on memory efficiency (RadixAttention) and enterprise RL workflows (Miles).

Leadership

Founders

Ying Sheng

Lianmin Zheng

Co-founder. Ph.D. from UC Berkeley advised by Ion Stoica. Lead of LMSYS Org projects including FastChat and Vicuna. Co-creator of SGLang.

Ion Stoica

Co-founder and Chairman/Advisor. Professor of Computer Science at UC Berkeley. Co-founder of Databricks, Anyscale, and Conviva.

Executive Team

Ying Sheng

Co-founder and CEO

Former engineer at xAI and research scientist at Databricks; SGLang maintainer.

Lianmin Zheng

Co-founder

Ph.D. from UC Berkeley; lead of LMSYS Org and creator of Vicuna.

Board of Directors

Ion Stoica

Co-founder and Executive Chairman

Lip-Bu Tan

Angel Investor and Advisor (Intel veteran)

Founding Story

Business Model

Revenue

Not publicly disclosed (ARR in early stages post-spinout).

Revenue Model

Open-core model: Free open-source inference engine (SGLang) and RL framework (Miles) with revenue generated through managed hosting services and enterprise support.

Pricing Tiers

Open Source

Free

Access to core SGLang and Miles code via GitHub.

Managed Hosting

Usage-based / Subscription

Production-grade hosting for SGLang inference.

Private

Target Markets

Industries & Segments

AI Labs
Cloud Service Providers
Enterprise AI Developers
Open-source Community

Use Cases

High-throughput LLM serving
Real-time chat applications
Large-scale RL post-training
Multi-modal model inference

Notable Customers

xAI
Cursor
LMSYS Org

Quick Facts

Headquarters

San Francisco, California, US

Founded

2026

Entity Type

Inc.

Employees

Total Funding

Significant funding from Accel and prominent angels (Valuation ~$400M)

Investors

Accel, Lip-Bu Tan

Office Locations

San Francisco

Funding History

AngelUndisclosed

2025

Lip-Bu Tan

Venture RoundUndisclosed

Jan 2026

$400,000,000 valuation

Accel

History & Milestones

Jan 2026

Officially spun out from UC Berkeley as RadixArk with a $400 million valuation.

Mar 2026

Announced ROCm support for the Miles RL framework to enable large-scale post-training on AMD hardware.

Nov 2025

Introduced 'Miles', an enterprise-grade reinforcement learning (RL) framework for post-training.

2023

SGLang project originated as an open-source research project at Ion Stoica's UC Berkeley lab.

Key Capabilities

RadixAttention (Prefix Caching)

Speculative Decoding

Disaggregated Prefill and Decode

P2P Weight Transfer via RDMA

Multi-backend Support (CUDA, ROCm, TPU, Ascend)

Zero-overhead Scheduler

Integrations & Partnerships

Platform Integrations

OpenAI API compatible
Hugging Face
Docker
Kubernetes
Amazon AWS
Google Cloud

Key Partnerships

UC Berkeley (LMSYS)

NVIDIA (GTC Partner)

AMD (ROCm Integration)

Connect

Website

sglang.io

GitHub

sgl-project

AI Topics

SGLang Project focuses on these topics:

Local Inference(1)

AI Infrastructure(1)

AI Development Libraries(1)

Back to all developers Suggest an edit