Deepgram
Deepgram provides a suite of developer APIs for voice and audio processing, including Speech-to-Text, Text-to-Speech, Voice Agent API, and Audio Intelligence. Designed for low-latency, high-accuracy applications, these APIs are used in real-time agent systems, transcription tools, and audio analytics. Deepgram supports streaming and batch input, with features like speaker diarization, redaction, sentiment detection, and summarization. The platform also powers Deepgram Saga, a developer-focused Voice OS for integrating intelligent voice experiences.
No discussions yet
Be the first to start a discussion about Deepgram
Demo Video for Deepgram
Developer
Deepgram builds speech and audio intelligence APIs that help developers add high-quality voice features to their apps.
Pricing and Plans
(Freemium)
Pay As You Go
Contact for pricing
- Free $200 credit
- No minimums, no expiration
- Access to all endpoints in public models
- Concurrency: Speech-to-text (100 REST / 50 WSS / 5 Whisper), TTS (5), Voice Agent (5), Audio Intelligence (10)
- Discord and community support
Growth
$4000/year
- Prepaid credits with usage-based redemption
- Same concurrency as Pay As You Go
- Discord and community support
Enterprise
Contact for pricing
- Best discounts
- Custom-trained models
- Priority access to new features
- Highest concurrency levels
- Self-hosted deployment options
- Paid support options
- Community access
Free
Free
- Free version available
System Requirements
Operating System
Any (API-based)
Memory (RAM)
No specific requirement (cloud-hosted)
Processor
No specific requirement (cloud-hosted)
Disk Space
No local storage needed
AI Capabilities
Speech-to-text transcription
Text-to-speech generation
Voice agent orchestration
Sentiment and topic detection
Speaker identification
Audio summarization
Keyword boosting
Content redaction