Fish Audio
Fish Audio provides studio-grade AI text-to-speech and voice cloning tools with emotion control, bridging the gap between synthetic and natural speech.
Founding Story
Founded by a Gen Z team led by former NVIDIA researcher Leng Yue, who turned a personal focus on human-level AI voice synthesis (reportedly motivated by 'heartbreak') into a high-growth startup that scaled from $400k to $5M ARR in early 2025.
Discussions
No discussions yet
Be the first to start a discussion about Fish Audio
Leadership
Founders
Leng Yue (冷月)
Founder of Fish Audio and former NVIDIA researcher. A prolific open-source developer who turned a personal interest in human-level AI voice synthesis into a business scaling to $5M ARR.
Shijia Liao
Chief Scientist at Fish Audio. Former researcher at NVIDIA and University of Maryland. Expert in vision foundation models and multi-modality models.
Executive Team
Leng Yue
CEO & Founder
Former NVIDIA researcher and open-source developer.
Shijia Liao
Chief Scientist
Former NVIDIA and UMD researcher specializing in multi-modality models.
Business Model
Revenue Model
Subscription-based tiered plans and pay-as-you-go API usage.
Pricing Tiers
8,000 credits monthly, up to 7 mins of S1 generation, 3 public voice slots.
250,000 credits monthly, up to 200 mins S1 generation, unlimited public + 10 private slots, commercial use.
2,000,000 credits monthly, up to 27 hours S1 generation, unlimited voice slots, commercial use.
Target Markets
- AI Developers
- Content Creators
- Gaming Industry
- Enterprise Solutions
- AI companions
- Content creation
- Game development
- Voice-overs
- Real-time avatars
- 20,000+ active developers
- Over 1.2M creators
History & Milestones
Open-sourced Fish Audio S2, featuring fine-grained control and production streaming capabilities.
Reached $400,000 in annualized recurring revenue (ARR).
Scaled revenue to over $5 million ARR within four months.
Launched OpenAudio S1, featuring real-time emotional control for AI voice acting.
Publicly launched Fish Audio S1, a frontier text-to-speech audio foundation model.
1 AI Tool by Fish Audio
Fish Audio
14hFish Audio is an AI-powered text-to-speech and voice cloning platform that lets users generate realistic voices and create custom voice models.
