Deepgram icon

Deepgram

Deepgram provides a suite of developer APIs for voice and audio processing, including Speech-to-Text, Text-to-Speech, Voice Agent API, and Audio Intelligence. Designed for low-latency, high-accuracy applications, these APIs are used in real-time agent systems, transcription tools, and audio analytics.…

Demo Video for Deepgram

Developer

Deepgram builds speech and audio intelligence APIs that help developers add high-quality voice features to their apps.

Pricing and Plans

PlanPriceFeatures
Pay As You GoContact us
  • Free $200 credit
  • No minimums, no expiration
  • Access to all endpoints in public models
  • Concurrency: Speech-to-text (100 REST / 50 WSS / 5 Whisper), TTS (5), Voice Agent (5), Audio Intelligence (10)
  • Discord and community support
Growth$4000/yearly
  • Prepaid credits with usage-based redemption
  • Same concurrency as Pay As You Go
  • Discord and community support
EnterpriseContact us
  • Best discounts
  • Custom-trained models
  • Priority access to new features
  • Highest concurrency levels
  • Self-hosted deployment options
  • Paid support options
  • Community access

System Requirements

Operating System
Any (API-based)
Memory (RAM)
No specific requirement (cloud-hosted)
Processor
No specific requirement (cloud-hosted)
Disk Space
No local storage needed

AI Capabilities

Speech-to-text transcription
Text-to-speech generation
Voice agent orchestration
Sentiment and topic detection
Speaker identification
Audio summarization
Keyword boosting
Content redaction