Together AI

Together AI is a research-driven AI company empowering developers and researchers to train, fine-tune, and deploy open-source generative AI models at scale through a purpose-built AI acceleration cloud platform.

Visit Website

At a Glance

52Tool Views

San Francisco, CaliforniaHeadquarters

2022Est.

360Employees

$533.5MRaised

Focus Areas

AI Infrastructure

Cloud Computing Platforms

Model Management

Connect

AI Tools by Together AI

(1)

Together AI

AI Native Cloud Platform

AI Infrastructure Cloud Platforms Model Management

Discussions

No discussions yet

Be the first to start a discussion about Together AI

Latest News

02/20/2025

Together AI Raises $305M Series B to Scale AI Acceleration Cloud for Open Source and Enterprise AI

together.ai

03/11/2025

Together AI Becomes NVIDIA Cloud Partner to Bolster Accelerated Computing Infrastructure

together.ai

11/01/2024

Together AI Expands in Europe Through Strategic Infrastructure Partnership with Hypertec, Bringing 100,000 GPUs to Europe

together.ai

01/01/2025

Together AI Acquires CodeSandbox to Add Built-in Code Interpretation Capabilities

prnewswire.com

Products & Services

Together Inference API (Serverless Inference)

June 2023

Cloud-based inference platform providing access to over 200 open-source models for text, vision, image, audio, video, transcription, embeddings, rerank, and moderation. Delivers 2-3x faster inference speeds than hyperscaler solutions.

Dedicated Endpoints

Dedicated inference infrastructure for customers requiring consistent performance and isolation, enabling greater flexibility than standard cloud providers.

Fine-tuning Platform

Platform for fine-tuning open-source models with both standard and specialized pricing options, including LoRA and full fine-tuning capabilities.

GPU Clusters (Instant, Reserved, Frontier AI Factory)

Scalable GPU cluster solutions ranging from 64 to over 36,000 GPUs, featuring NVIDIA Blackwell GPUs (HGX B200, GB200 NVL72) with up to 90% faster training performance. Over 200 MW of power capacity deployed.

Market Position

Together AI positions itself as 'The AI Native Cloud' engineered for AI natives and powered by cutting-edge research. Key differentiators include: (1) Open-source focus with no vendor lock-in - customers own their fine-tuned models, (2) Performance advantages with 2-3x faster inference than hyperscalers and 90% faster training, (3) Cost advantages with 60% cost savings reported by customers like Hedra, (4) Purpose-built infrastructure with FlashAttention optimizations and 200+ MW of GPU capacity, (5) Research-driven innovation through contributions like FlashAttention-2/3, RedPajama dataset, and Together Kernel Collection, (6) Full stack offering from inference to training to custom models. Competes with Mistral AI, Baseten, Cohere, and hyperscalers (AWS, GCP, Azure) in the AI infrastructure market.

Leadership

Founders

Vipul Ved Prakash

Co-founded Cloudmark, an anti-spam company that developed Vipul's Razor. Founded Topsy, a social media search and analytics company acquired by Apple for over $200 million in 2013. Background in large-scale distributed systems and information retrieval.

Ce Zhang

Associate Professor at University of Chicago. PhD from University of Wisconsin-Madison, postdoctoral researcher at Stanford under Chris Ré. Research focus on machine learning systems and decentralized computing.

Chris Ré

Professor at Stanford AI Lab, Stanford Center for Research on Foundation Models (CRFM), and Machine Learning Group. Research focus on foundation models and machine learning systems.

Percy Liang

Associate Professor of Computer Science at Stanford University. BS from MIT (2004), PhD from UC Berkeley (2011). Research focus on machine learning and natural language understanding. Co-founder of Stanford Center for Research on Foundation Models.

Executive Team

Vipul Ved Prakash

Founder & CEO

Previously founded Topsy (acquired by Apple for $200M+) and co-founded Cloudmark. Expert in large-scale distributed systems and information retrieval.

Ce Zhang

Founder & CTO

Associate Professor at University of Chicago. PhD from UW-Madison, postdoc at Stanford. Expert in machine learning systems and decentralized computing.

Board of Directors

Clem Delangue

Investor / Advisor - CEO of HuggingFace

Soumith Chintala

Investor / Advisor - Creator of PyTorch

John Chambers

Investor - Former Cisco CEO

Founding Story

Together AI was founded in June 2022 by Vipul Ved Prakash, Ce Zhang, Chris Ré, and Percy Liang, who were driven by the belief that open and decentralized alternatives to closed AI systems were going to be important. The founders, who are researchers and professors at Stanford, wanted to democratize AI by making open-source models accessible to developers and enterprises globally. Prior to Together AI, the team had already released successful open-source projects including RedPajama, GPT-JT, and OpenChatKit, which garnered support from hundreds of thousands of AI developers.

Business Model

Revenue

$100 million ARR surpassed in 2024 (up from $30 million in February 2024). Projections of $120 million revenue for 2025. Reportedly crossed $300 million ARR by summer 2025 according to some reports.

Revenue Model

Consumption-based revenue model with per-token pricing for inference, GPU cluster rentals (hourly rates), fine-tuning services (per token), and custom model consulting. No subscription tiers or minimum commitments for serverless offerings. Options for instant clusters, reserved clusters, and Frontier AI Factory for large-scale deployments.

Pricing Tiers

Serverless Inference - Text & Vision

Variable per 1M tokens

Ranges from $0.05-$3.50 per 1M input tokens and $0.06-$7.00 per 1M output tokens depending on model (e.g., Llama 3.3 70B at $0.88/$0.88, DeepSeek-R1 at $3.00/$7.00, Llama 3.2 3B Turbo at $0.06/$0.06)

Dedicated Endpoints

Custom pricing

Dedicated inference infrastructure with custom pricing based on requirements

Fine-tuning

Per token pricing

Standard and specialized pricing options available for LoRA and full fine-tuning

GPU Clusters - Instant

Hourly rates

Self-service GPU clusters from 64 to 10,000+ GPUs with hourly billing

GPU Clusters - Reserved

Reserved capacity pricing

Reserved GPU capacity for long-term deployments

Frontier AI Factory

Enterprise pricing

Large-scale GPU deployments for frontier AI development

Code Sandbox & Interpreter

Usage-based

Code execution and interpretation services

Private, no IPO announced

Target Markets

Industries & Segments

AI-native startups and scale-ups
Enterprise software companies
AI researchers and academics
Open-source AI developers
AI application developers
Large enterprises deploying AI

Use Cases

AI-powered code editors and IDEs
Voice AI and real-time conversational agents
AI video generation at scale
Customer support automation
AI agents and reasoning models
Document parsing and information extraction

Notable Customers

Cursor
Decagon
Salesforce
Zoom

Quick Facts

Headquarters

San Francisco, California, United States

Founded

2022

Entity Type

Inc.

Employees

360

Total Funding

$533.5M

Investors

General Catalyst, Prosperity7 Ventures

Office Locations

251 Rhode Island St

Funding History

Seed$20M

May 15, 2023

Lux Capital

Series A$102.5M

November 29, 2023

Over $500M valuation

Kleiner Perkins

Series A2 / Additional Round$106M

March 13, 2024

$1.25B valuation

Salesforce Ventures

History & Milestones

February 20, 2025

Raised $305M Series B led by General Catalyst and co-led by Prosperity7 at $3.3B valuation

2025

Acquired CodeSandbox to add built-in code interpretation capabilities

March 11, 2025

Became NVIDIA Cloud Partner in the NVIDIA Partner Network

March 13, 2024

Raised $106M funding led by Salesforce Ventures at $1.25B valuation

2024

Grew user base to over 450,000 AI developers

Key Capabilities

Access to over 200 open-source AI models

Inference speeds 2-3x faster than hyperscaler solutions

FlashAttention-2 and FlashAttention-3 for accelerated training and inference

Together Kernel Collection for 24% faster training operations

GPU clusters from 64 to 36,000+ GPUs

Over 200 MW of power capacity

Integrations & Partnerships

Platform Integrations

LangChain
Vercel
LlamaIndex
MongoDB
EmbedChain
Mintlify (Documentation)
PyTorch
HuggingFace

Key Partnerships

NVIDIA (NVIDIA Cloud Partner, access to AI Enterprise software, early Blackwell GPU deployment)

Hypertec (Co-building 36,000 NVIDIA GB200 NVL72 GPU cluster, 100,000 GPUs in Europe)

Meta (Partnership to bring PyTorch Reinforcement Learning to the AI Native Cloud)

Connect

Website

together.ai

GitHub

togethercomputer

X / Twitter

TogetherCompute

AI Topics

Together AI focuses on these topics:

AI Infrastructure(1)

Cloud Computing Platforms(1)

Model Management(1)

Back to all developers

Together AI

Visit Website

At a Glance

52Tool Views

San Francisco, CaliforniaHeadquarters

2022Est.

360Employees

$533.5MRaised

Focus Areas

AI Infrastructure

Cloud Computing Platforms

Model Management

Connect

AI Tools by Together AI

(1)

Together AI

AI Native Cloud Platform

AI Infrastructure Cloud Platforms Model Management

Discussions

No discussions yet

Be the first to start a discussion about Together AI

Latest News

02/20/2025

Together AI Raises $305M Series B to Scale AI Acceleration Cloud for Open Source and Enterprise AI

together.ai

03/11/2025

Together AI Becomes NVIDIA Cloud Partner to Bolster Accelerated Computing Infrastructure

together.ai

11/01/2024

Together AI Expands in Europe Through Strategic Infrastructure Partnership with Hypertec, Bringing 100,000 GPUs to Europe

together.ai

01/01/2025

Together AI Acquires CodeSandbox to Add Built-in Code Interpretation Capabilities

prnewswire.com

Products & Services

Together Inference API (Serverless Inference)

June 2023

Dedicated Endpoints

Dedicated inference infrastructure for customers requiring consistent performance and isolation, enabling greater flexibility than standard cloud providers.

Fine-tuning Platform

Platform for fine-tuning open-source models with both standard and specialized pricing options, including LoRA and full fine-tuning capabilities.

GPU Clusters (Instant, Reserved, Frontier AI Factory)

Market Position

Leadership

Founders

Vipul Ved Prakash

Ce Zhang

Chris Ré

Professor at Stanford AI Lab, Stanford Center for Research on Foundation Models (CRFM), and Machine Learning Group. Research focus on foundation models and machine learning systems.

Percy Liang

Executive Team

Vipul Ved Prakash

Founder & CEO

Previously founded Topsy (acquired by Apple for $200M+) and co-founded Cloudmark. Expert in large-scale distributed systems and information retrieval.

Ce Zhang

Founder & CTO

Associate Professor at University of Chicago. PhD from UW-Madison, postdoc at Stanford. Expert in machine learning systems and decentralized computing.

Board of Directors

Clem Delangue

Investor / Advisor - CEO of HuggingFace

Soumith Chintala

Investor / Advisor - Creator of PyTorch

John Chambers

Investor - Former Cisco CEO

Founding Story

Business Model

Revenue

$100 million ARR surpassed in 2024 (up from $30 million in February 2024). Projections of $120 million revenue for 2025. Reportedly crossed $300 million ARR by summer 2025 according to some reports.

Revenue Model

Pricing Tiers

Serverless Inference - Text & Vision

Variable per 1M tokens

Ranges from $0.05-$3.50 per 1M input tokens and $0.06-$7.00 per 1M output tokens depending on model (e.g., Llama 3.3 70B at $0.88/$0.88, DeepSeek-R1 at $3.00/$7.00, Llama 3.2 3B Turbo at $0.06/$0.06)

Dedicated Endpoints

Custom pricing

Dedicated inference infrastructure with custom pricing based on requirements

Fine-tuning

Per token pricing

Standard and specialized pricing options available for LoRA and full fine-tuning

GPU Clusters - Instant

Hourly rates

Self-service GPU clusters from 64 to 10,000+ GPUs with hourly billing

GPU Clusters - Reserved

Reserved capacity pricing

Reserved GPU capacity for long-term deployments

Frontier AI Factory

Enterprise pricing

Large-scale GPU deployments for frontier AI development

Code Sandbox & Interpreter

Usage-based

Code execution and interpretation services

Private, no IPO announced

Target Markets

Industries & Segments

AI-native startups and scale-ups
Enterprise software companies
AI researchers and academics
Open-source AI developers
AI application developers
Large enterprises deploying AI

Use Cases

AI-powered code editors and IDEs
Voice AI and real-time conversational agents
AI video generation at scale
Customer support automation
AI agents and reasoning models
Document parsing and information extraction

Notable Customers

Cursor
Decagon
Salesforce
Zoom

Quick Facts

Headquarters

San Francisco, California, United States

Founded

2022

Entity Type

Inc.

Employees

360

Total Funding

$533.5M

Investors

General Catalyst, Prosperity7 Ventures

Office Locations

251 Rhode Island St

Funding History

Seed$20M

May 15, 2023

Lux Capital

Series A$102.5M

November 29, 2023

Over $500M valuation

Kleiner Perkins

Series A2 / Additional Round$106M

March 13, 2024

$1.25B valuation

Salesforce Ventures

History & Milestones

February 20, 2025

Raised $305M Series B led by General Catalyst and co-led by Prosperity7 at $3.3B valuation

2025

Acquired CodeSandbox to add built-in code interpretation capabilities

March 11, 2025

Became NVIDIA Cloud Partner in the NVIDIA Partner Network

March 13, 2024

Raised $106M funding led by Salesforce Ventures at $1.25B valuation

2024

Grew user base to over 450,000 AI developers

Key Capabilities

Access to over 200 open-source AI models

Inference speeds 2-3x faster than hyperscaler solutions

FlashAttention-2 and FlashAttention-3 for accelerated training and inference

Together Kernel Collection for 24% faster training operations

GPU clusters from 64 to 36,000+ GPUs

Over 200 MW of power capacity

Integrations & Partnerships

Platform Integrations

LangChain
Vercel
LlamaIndex
MongoDB
EmbedChain
Mintlify (Documentation)
PyTorch
HuggingFace

Key Partnerships

NVIDIA (NVIDIA Cloud Partner, access to AI Enterprise software, early Blackwell GPU deployment)

Hypertec (Co-building 36,000 NVIDIA GB200 NVL72 GPU cluster, 100,000 GPUs in Europe)

Meta (Partnership to bring PyTorch Reinforcement Learning to the AI Native Cloud)

Connect

Website

together.ai

GitHub

togethercomputer

X / Twitter

TogetherCompute

AI Topics

Together AI focuses on these topics:

AI Infrastructure(1)

Cloud Computing Platforms(1)

Model Management(1)

Back to all developers