Cartesia
To build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are, focusing on real-time multimodal models.
At a Glance
AI Tools by Cartesia
(1)Cartesia Sonic
Low Latency Text to Speech
Discussions
No discussions yet
Be the first to start a discussion about Cartesia
Latest News
Cartesia Raises $100 Million to Transform Real-Time Voice AI with Sonic-3
Introducing Line: The Modern Voice Agent Development Platform
Introducing Ink: speech-to-text models for real-time conversation
Cartesia, voice AI startup, raises $64 million Series A
Products & Services
Flagship text-to-speech (TTS) model designed for ultra-low latency (90ms) and high-quality voice generation. Supports voice cloning and 17+ locales.
Fastest streaming speech-to-text (STT) model built for real-time conversation and multilingual support.
Voice agent development platform providing SDKs, telephony integration, and analytics for building AI agents.
Market Position
Pioneers of State Space Models (SSMs), providing the world's fastest and most natural real-time conversational AI models with ultra-low latency.
Leadership
Founders
Karan Goel
CEO and Founder. Previously a Research Scientist at Snorkel AI and Salesforce AI Research. PhD from Stanford AI Lab where he co-invented State Space Models (SSMs).
Albert Gu
Chief Scientist and Cofounder. Assistant Professor at Carnegie Mellon University. PhD from Stanford AI Lab where he co-invented State Space Models (SSMs) and S4 models.
Arjun Desai
Cofounder. PhD from Stanford AI Lab specializing in AI and medical imaging.
Brandon Yang
Cofounder. Previously at Google and Stanford PhD candidate.
Chris Ré
Cofounder. Associate Professor at Stanford University. Founder of Snorkel AI and Lattice Data (acquired by Apple). MacArthur Fellow.
Executive Team
Karan Goel
CEO + Founder
PhD from Stanford AI Lab, formerly Research Scientist at Snorkel AI and Salesforce AI Research.
Albert Gu
Chief Scientist + Cofounder
Assistant Professor at CMU, co-inventor of S4 and State Space Models.
Board of Directors
Founding Story
The founding team met at Stanford AI Lab, where they invented State Space Models (SSMs), a new architecture for large-scale foundation models that achieves state-of-the-art results in audio, text, and video.
Business Model
Revenue Model
API usage based on credits, tiered monthly/annual subscriptions.
Pricing Tiers
20K credits for models, $1 prepaid for agents, personal use.
100K credits for models, $5 prepaid for agents, instant voice cloning.
1.25M credits for models, $49 prepaid for agents, pro voice cloning.
8M credits for models, $299 prepaid for agents, priority support.
Custom usage, enterprise-grade security and compliance (SOC-2, HIPAA).
Target Markets
- Developers
- Enterprise
- AI Agent builders
- Voice agents
- Customer service
- Gaming
- Localization
- Healthcare
- Sales
- 11x
- Tencent Cloud