D-ID
D-ID is a digital human platform that generates AI avatar videos and deploys real-time conversational visual agents for marketing, training, sales, and customer experience.
At a Glance
Free trial plan to get started with D-ID Studio and Agents.
Engagement
Available On
Alternatives
Listed Jun 2026
About D-ID
D-ID is a generative AI platform founded in 2017 that lets organizations create multilingual avatar videos and deploy real-time interactive visual agents. The platform is accessible via a self-service web studio, a mobile app, and a developer API, making it usable by both non-technical creators and engineering teams building custom integrations.
What It Is
D-ID sits in the digital human and AI video category. Its core job is to turn static images, scripts, documents, or presentations into polished talking-avatar videos, and to power live, face-to-face conversational agents that can be embedded on websites or in applications. The platform combines deep-learning face animation, LLM text generation, text-to-speech, and voice cloning into a single workflow. According to D-ID's About page, the platform has powered more than 200 million avatar videos and attracted a global community of over 280,000 developers.
Core Product Suite
D-ID ships several distinct products under one platform:
- Creative Reality™ Studio — A self-service web and mobile studio for generating avatar videos from scripts, briefs, decks, or documents. Outputs MP4 at up to 1080p. Supports 119+ languages and a variety of accents.
- Visual AI Agents — Real-time conversational avatars that respond to voice or text input, execute tasks, and can be embedded on any digital touchpoint. The V4 Expressive release adds emotionally intelligent responses.
- AI Avatars — Custom digital humans built from uploaded images or video, with voice cloning and multilingual output.
- Video Translate — Uploads a video in one language and returns a lip-synced, voice-cloned translation in another language.
- Video Campaigns — Personalized video generation at scale for marketing outreach.
- Agentic Videos — A recently launched format described by D-ID as interactive videos that "talk back," combining passive video content with conversational AI.
Deployment Model and API
D-ID describes itself as an API-first company. The streaming API supports real-time avatar animation and is documented at docs.d-id.com. A live-streaming demo repository is publicly available on GitHub under the MIT license, showing how to integrate the streaming API using Node.js and WebSockets. The API generates at what D-ID claims is an industry-leading 120 FPS for real-time streaming. Credits used via the API draw from the same balance as the Studio, with streaming API credits priced at half the standard rate per the FAQ.
Enterprise Positioning and Compliance
D-ID holds ISO 27001, ISO 27017, ISO 27018, ISO 42001, and SOC 2 certifications. Data in transit uses TLS 1.3 and data at rest is encrypted via Transparent Data Encryption S3 Storage. The platform includes built-in content moderation and AI watermarking on all generated videos; Enterprise customers can customize but not remove the watermark. D-ID's About page states that organizations including PepsiCo, Fidelity, J.P. Morgan, SoftBank, NTT, Deutsche Telekom, PwC, Deloitte, Burda Media, AXA Insurance, and Gameloft rely on the platform.
Update: Acquisition of simpleshow and Agentic Videos Launch
In September 2025, D-ID acquired Berlin-based video startup simpleshow, as reported by TechCrunch. The acquisition brought simpleshow's CEO Karsten Boehrs into the D-ID leadership team as President & COO, along with several other simpleshow executives. D-ID describes the deal as uniting enterprise-grade video creation with real-time humanlike interaction into a unified Digital Human platform. Separately, D-ID launched Agentic Videos on Product Hunt, positioning the format as interactive videos that respond to viewer input. The V4 Expressive Visual Agents release, also recent, adds emotionally expressive real-time avatar responses. D-ID Agents received a Special Mention in TIME's Best Inventions of 2024.
Community Discussions
Be the first to start a conversation about D-ID
Share your experience with D-ID, ask questions, or help others learn from your insights.
Pricing
Trial
Free trial plan to get started with D-ID Studio and Agents.
- 200 free agent conversation sessions
- D-ID logo watermark (full-screen watermark on videos)
- Standard AI Presenter up to 1280×1280px
- Premium AI Presenters at 1080p
- Access to Creative Reality Studio on desktop and mobile
Lite
Entry-level paid plan for individuals and small teams.
- Standard AI Presenter output up to 1280×1280px
- Premium AI Presenters not supported
- D-ID logo watermark
- Access to Creative Reality Studio
Pro
Professional plan with premium presenters, voice cloning, and reduced watermark.
- Premium AI Presenter at 1080p
- Generic AI watermark (not D-ID logo)
- 1 cloned voice
- Access to Creative Reality Studio on desktop and mobile
- API access
Advanced
Advanced plan with more voices, moderation bypass option, and manual review.
- Premium AI Presenter at 1080p
- Generic AI watermark
- 3 cloned voices
- Option to bypass built-in moderation with own solution
- Manual review option for rejected images
- API access
Enterprise
Custom enterprise plan with dedicated support, customizable watermark, and volume pricing.
- Customizable AI watermark
- Customizable number of cloned voices
- Dedicated account manager
- 24/7 priority support
- ISO 27001, 27017, 27018, 42001, SOC 2 compliance
- Custom integrations and SLA
- API access
Capabilities
Key Features
- AI avatar video generation from scripts, documents, or presentations
- Real-time conversational visual agents with voice and text input
- Video translation with lip-sync and voice cloning in 120+ languages
- Text-to-image portrait generation powered by Stable Diffusion
- Voice cloning from uploaded audio recordings
- Multilingual support for 119+ languages and accents
- Customizable avatar expressions (happy, serious, surprised, neutral)
- Layered canvas editor with backgrounds, text, and media
- MP4 video output up to 1080p resolution
- Real-time streaming API at up to 120 FPS
- RAG-powered agent knowledge base from PDF, TXT, PPTX, and URLs
- Embeddable agents for websites and applications
- Video Campaigns for personalized outreach at scale
- Agentic Videos — interactive videos that respond to viewer input
- Mobile app for iOS and Android
- ISO 27001, 27017, 27018, 42001, and SOC 2 compliance
- AI watermarking on all generated content
- Built-in content moderation
