Fish Audio

Fish Audio provides studio-grade AI text-to-speech and voice cloning tools with emotion control, bridging the gap between synthetic and natural speech.

Visit Website

At a Glance

109Tool Views

Mountain View, CAHeadquarters

2024Est.

21Employees

AI Tools by Fish Audio

(1)

Fish Audio

AI Voice Cloning and TTS API

Voice Synthesis Image Audio

Discussions

No discussions yet

Be the first to start a discussion about Fish Audio

Latest News

03/09/2026

Products & Services

Fish Audio S1

November 20, 2025

A frontier text-to-speech audio foundation model touted as the most expressive and natural TTS model on the market.

Fish Audio S2

March 9, 2026

An open-source version of the model featuring fine-grained control and support for production streaming.

Fish Speech (v1.5 / v1.6)

November 2025

Next-generation multilingual text-to-speech and realistic voice cloning engine.

Voice Cloning

Studio-grade voice cloning that sounds like the user with support for emotion control.

Market Position

Challenges incumbents like ElevenLabs by offering superior emotion control, real-time directing capabilities, and open-source models (S2).

Leadership

Founders

Leng Yue (冷月)

Founder of Fish Audio and former NVIDIA researcher. A prolific open-source developer who turned a personal interest in human-level AI voice synthesis into a business scaling to $5M ARR.

Shijia Liao

Chief Scientist at Fish Audio. Former researcher at NVIDIA and University of Maryland. Expert in vision foundation models and multi-modality models.

Executive Team

Leng Yue

CEO & Founder

Former NVIDIA researcher and open-source developer.

Shijia Liao

Chief Scientist

Former NVIDIA and UMD researcher specializing in multi-modality models.

Founding Story

Founded by a Gen Z team led by former NVIDIA researcher Leng Yue, who turned a personal focus on human-level AI voice synthesis (reportedly motivated by 'heartbreak') into a high-growth startup that scaled from $400k to $5M ARR in early 2025.

Business Model

Revenue

$5M+ ARR (as of April 2025)

Revenue Model

Subscription-based tiered plans and pay-as-you-go API usage.

Pricing Tiers

Free

8,000 credits monthly, up to 7 mins of S1 generation, 3 public voice slots.

Plus

$20/mo (or $5.5/mo billed annually at $66)

250,000 credits monthly, up to 200 mins S1 generation, unlimited public + 10 private slots, commercial use.

Pro

$150/mo (or $37.5/mo billed annually at $450)

2,000,000 credits monthly, up to 27 hours S1 generation, unlimited voice slots, commercial use.

Privately held

Target Markets

Industries & Segments

AI Developers
Content Creators
Gaming Industry
Enterprise Solutions

Use Cases

AI companions
Content creation
Game development
Voice-overs
Real-time avatars

Notable Customers

20,000+ active developers
Over 1.2M creators

Quick Facts

Headquarters

Mountain View, CA

Founded

2024

Entity Type

Inc.

Employees

Total Funding

No public venture funding rounds reported; high growth from early revenue.

Office Locations

Mountain View

History & Milestones

March 9, 2026

Open-sourced Fish Audio S2, featuring fine-grained control and production streaming capabilities.

January 2025

Reached $400,000 in annualized recurring revenue (ARR).

April 2025

Scaled revenue to over $5 million ARR within four months.

June 3, 2025

Launched OpenAudio S1, featuring real-time emotional control for AI voice acting.

November 20, 2025

Publicly launched Fish Audio S1, a frontier text-to-speech audio foundation model.

Key Capabilities

Emotion control

Real-time streaming

Voice cloning

2m+ voice library

Multilingual (30+ languages)

Audio separation

Integrations & Partnerships

Platform Integrations

API
GitHub
Product Hunt

Key Partnerships

Nvidia Inception Program

Google Cloud

AWS

Connect

Website

fish.audio

GitHub

fishaudio

AI Topics

Fish Audio focuses on these topics:

Voice Synthesis(1)

Image(1)

Audio(1)

Back to all developers Suggest an edit

Fish Audio

Fish Audio provides studio-grade AI text-to-speech and voice cloning tools with emotion control, bridging the gap between synthetic and natural speech.

Visit Website

At a Glance

109Tool Views

Mountain View, CAHeadquarters

2024Est.

21Employees

AI Tools by Fish Audio

(1)

Fish Audio

AI Voice Cloning and TTS API

Voice Synthesis Image Audio

Discussions

No discussions yet

Be the first to start a discussion about Fish Audio

Latest News

03/09/2026

Fish Audio Open-Sources S2: Fine-Grained Control Meets Production Streaming

fish.audio

02/04/2026

Best Speech to Text APIs 2026: Technical Comparison & Integration Guide

fish.audio

01/29/2026

How to Use SAM Audio for Audio Separation Step by Step

fish.audio

11/20/2025

Launching Fish Audio S1: A Frontier Text-to-Speech Audio Foundation Model

fish.audio

Products & Services

Fish Audio S1

November 20, 2025

A frontier text-to-speech audio foundation model touted as the most expressive and natural TTS model on the market.

Fish Audio S2

March 9, 2026

An open-source version of the model featuring fine-grained control and support for production streaming.

Fish Speech (v1.5 / v1.6)

November 2025

Next-generation multilingual text-to-speech and realistic voice cloning engine.

Voice Cloning

Studio-grade voice cloning that sounds like the user with support for emotion control.

Market Position

Challenges incumbents like ElevenLabs by offering superior emotion control, real-time directing capabilities, and open-source models (S2).

Leadership

Founders

Leng Yue (冷月)

Founder of Fish Audio and former NVIDIA researcher. A prolific open-source developer who turned a personal interest in human-level AI voice synthesis into a business scaling to $5M ARR.

Shijia Liao

Chief Scientist at Fish Audio. Former researcher at NVIDIA and University of Maryland. Expert in vision foundation models and multi-modality models.

Executive Team

Leng Yue

CEO & Founder

Former NVIDIA researcher and open-source developer.

Shijia Liao

Chief Scientist

Former NVIDIA and UMD researcher specializing in multi-modality models.

Founding Story

Business Model

Revenue

$5M+ ARR (as of April 2025)

Revenue Model

Subscription-based tiered plans and pay-as-you-go API usage.

Pricing Tiers

Free

8,000 credits monthly, up to 7 mins of S1 generation, 3 public voice slots.

Plus

$20/mo (or $5.5/mo billed annually at $66)

250,000 credits monthly, up to 200 mins S1 generation, unlimited public + 10 private slots, commercial use.

Pro

$150/mo (or $37.5/mo billed annually at $450)

2,000,000 credits monthly, up to 27 hours S1 generation, unlimited voice slots, commercial use.

Privately held

Target Markets

Industries & Segments

AI Developers
Content Creators
Gaming Industry
Enterprise Solutions

Use Cases

AI companions
Content creation
Game development
Voice-overs
Real-time avatars

Notable Customers

20,000+ active developers
Over 1.2M creators

Quick Facts

Headquarters

Mountain View, CA

Founded

2024

Entity Type

Inc.

Employees

Total Funding

No public venture funding rounds reported; high growth from early revenue.

Office Locations

Mountain View

History & Milestones

March 9, 2026

Open-sourced Fish Audio S2, featuring fine-grained control and production streaming capabilities.

January 2025

Reached $400,000 in annualized recurring revenue (ARR).

April 2025

Scaled revenue to over $5 million ARR within four months.

June 3, 2025

Launched OpenAudio S1, featuring real-time emotional control for AI voice acting.

November 20, 2025

Publicly launched Fish Audio S1, a frontier text-to-speech audio foundation model.

Key Capabilities

Emotion control

Real-time streaming

Voice cloning

2m+ voice library

Multilingual (30+ languages)

Audio separation

Integrations & Partnerships

Platform Integrations

API
GitHub
Product Hunt

Key Partnerships

Nvidia Inception Program

Google Cloud

AWS

Connect

Website

fish.audio

GitHub

fishaudio

AI Topics

Fish Audio focuses on these topics:

Voice Synthesis(1)

Image(1)

Audio(1)

Back to all developers Suggest an edit