# D-ID

> D-ID is a digital human platform that generates AI avatar videos and deploys real-time conversational visual agents for marketing, training, sales, and customer experience.

D-ID is a generative AI platform founded in 2017 that lets organizations create multilingual avatar videos and deploy real-time interactive visual agents. The platform is accessible via a self-service web studio, a mobile app, and a developer API, making it usable by both non-technical creators and engineering teams building custom integrations.

## What It Is

D-ID sits in the digital human and AI video category. Its core job is to turn static images, scripts, documents, or presentations into polished talking-avatar videos, and to power live, face-to-face conversational agents that can be embedded on websites or in applications. The platform combines deep-learning face animation, LLM text generation, text-to-speech, and voice cloning into a single workflow. According to D-ID's About page, the platform has powered more than 200 million avatar videos and attracted a global community of over 280,000 developers.

## Core Product Suite

D-ID ships several distinct products under one platform:

- **Creative Reality™ Studio** — A self-service web and mobile studio for generating avatar videos from scripts, briefs, decks, or documents. Outputs MP4 at up to 1080p. Supports 119+ languages and a variety of accents.
- **Visual AI Agents** — Real-time conversational avatars that respond to voice or text input, execute tasks, and can be embedded on any digital touchpoint. The V4 Expressive release adds emotionally intelligent responses.
- **AI Avatars** — Custom digital humans built from uploaded images or video, with voice cloning and multilingual output.
- **Video Translate** — Uploads a video in one language and returns a lip-synced, voice-cloned translation in another language.
- **Video Campaigns** — Personalized video generation at scale for marketing outreach.
- **Agentic Videos** — A recently launched format described by D-ID as interactive videos that "talk back," combining passive video content with conversational AI.

## Deployment Model and API

D-ID describes itself as an API-first company. The streaming API supports real-time avatar animation and is documented at docs.d-id.com. A live-streaming demo repository is publicly available on GitHub under the MIT license, showing how to integrate the streaming API using Node.js and WebSockets. The API generates at what D-ID claims is an industry-leading 120 FPS for real-time streaming. Credits used via the API draw from the same balance as the Studio, with streaming API credits priced at half the standard rate per the FAQ.

## Enterprise Positioning and Compliance

D-ID holds ISO 27001, ISO 27017, ISO 27018, ISO 42001, and SOC 2 certifications. Data in transit uses TLS 1.3 and data at rest is encrypted via Transparent Data Encryption S3 Storage. The platform includes built-in content moderation and AI watermarking on all generated videos; Enterprise customers can customize but not remove the watermark. D-ID's About page states that organizations including PepsiCo, Fidelity, J.P. Morgan, SoftBank, NTT, Deutsche Telekom, PwC, Deloitte, Burda Media, AXA Insurance, and Gameloft rely on the platform.

## Update: Acquisition of simpleshow and Agentic Videos Launch

In September 2025, D-ID acquired Berlin-based video startup simpleshow, as reported by TechCrunch. The acquisition brought simpleshow's CEO Karsten Boehrs into the D-ID leadership team as President & COO, along with several other simpleshow executives. D-ID describes the deal as uniting enterprise-grade video creation with real-time humanlike interaction into a unified Digital Human platform. Separately, D-ID launched Agentic Videos on Product Hunt, positioning the format as interactive videos that respond to viewer input. The V4 Expressive Visual Agents release, also recent, adds emotionally expressive real-time avatar responses. D-ID Agents received a Special Mention in TIME's Best Inventions of 2024.

## Features
- AI avatar video generation from scripts, documents, or presentations
- Real-time conversational visual agents with voice and text input
- Video translation with lip-sync and voice cloning in 120+ languages
- Text-to-image portrait generation powered by Stable Diffusion
- Voice cloning from uploaded audio recordings
- Multilingual support for 119+ languages and accents
- Customizable avatar expressions (happy, serious, surprised, neutral)
- Layered canvas editor with backgrounds, text, and media
- MP4 video output up to 1080p resolution
- Real-time streaming API at up to 120 FPS
- RAG-powered agent knowledge base from PDF, TXT, PPTX, and URLs
- Embeddable agents for websites and applications
- Video Campaigns for personalized outreach at scale
- Agentic Videos — interactive videos that respond to viewer input
- Mobile app for iOS and Android
- ISO 27001, 27017, 27018, 42001, and SOC 2 compliance
- AI watermarking on all generated content
- Built-in content moderation

## Integrations
Microsoft PowerPoint, Canva, Google Slides, ElevenLabs (Pro voices), LMS systems, Email marketing platforms, AWS (infrastructure), Microsoft (partner)

## Platforms
LINUX, ANDROID, IOS, WEB, API, DEVELOPER_SDK

## Pricing
Freemium — Free tier available with paid upgrades

## Links
- Website: https://www.d-id.com
- Documentation: https://docs.d-id.com/reference/get-started
- Repository: https://github.com/de-id/live-streaming-demo
- EveryDev.ai: https://www.everydev.ai/tools/d-id
