Getting Started

API for Developers

Getting Started

With the D-ID API, you can create realistic talking avatars from videos or photos, and build fully interactive agents capable of real-time conversation. This guide explains how to authenticate, send requests, handle responses, and integrate these capabilities directly into your application.

Main Products

New Updates

NEW: Agents API is here! ⭐️
By blending the smarts of advanced language models with the warmth of face-to-face communication, D-ID Agents redefine digital connections, making them more personal, engaging, and human. All you have to do is select your Agent’s appearance, choose its voice, describe how you want it to interact, and provide it with documents to augment and personalize its knowledge base. You’ll have a digital person you can speak with in minutes, just like a real human. Check it out!

NEW: Discover dozens of new HQ Presenters now ready to use! ⭐️
Enabling FULL-HD photorealistic avatars, medium-shot, with body and hands movements using just text or audio as input. You can also create a custom HQ Presenter in Full-HD resolution based on your own video footage. Check it out!

NEW: HQ Presenters (Clips) are now streamable in real-time! ⭐️
D-ID's Clips Live Streaming API allows you to use D-ID’s AI tools to generate videos of our high-quality digital humans, in real-time. This powerful functionality opens up various use cases, such as virtual assistants, interactive broadcasting, online education & training, and more. Check it out!

NEW: Tailored API plans for developers! ⭐️
D-ID's tailored API plans are specifically designed to cater to your product's lifecycle. Whether you're in the build phase, scaling up, or ready to launch, we have a plan for you, ensuring that as your needs change, your costs remain predictable and manageable. Check it out!


API Video Tutorial



Live Streaming

D-ID’s API now supports synchronistic generation of talking head videos from an image and text or audio file. Integrate it with your AI chatbot to create face-to-face CX conversations, use it to create real-time video call avatars or add it to your character-based online game. The possibilities are endless!


Giving a Face to Conversational AI
chat.D-ID is a web app that uses real-time face animation and advanced text-to-speech to create an immersive and human-like conversational AI experience. The free app lets you speak face-to-face with ChatGPT. Try it out live.
Adding a Human Touch to AI
The real-time capabilities of the tech can be integrated with both open and closed domain AI models, enabling businesses of all sizes to create a more personal connection with their clients, employees, and communities.
Real-time video streaming opens up a new world of possibilities
D-ID’s API is robust, massively scalable and super simple to use – integrate it in just four lines of code. It now also supports streaming generation of talking head videos from an image and audio file. Build a whole ecosystem around our platform. The possibilities are endless.

See Streams endpoint for more details


Superfast Performance

D-ID’s Rendering time is 100 FPS, that's 4X faster than real-time! The fastest text-to-video solution in the world. Generate your videos at scale. D-ID's API handles tens of thousands of requests in parallel, with unbeatable service and robust performance. Over 150 million videos have been generated to date.



Facial Expressions


NEW: Creating engaging visuals is all about capturing the attention of the viewer. With D-ID's API, you can take your visuals to the next level by controlling the expressions of your avatar. Adding expressions to your avatar can make them more engaging, fun, and lifelike. This can help boost the engagement with your viewers and increase the overall enjoyment of your visuals. Click here to learn more.

Standard Result
Neutral Expression
Results with Expressions
Different facial expressions results


Neutral Happy Surprise Serious


Learn more about D-ID

Meet the Natural User Interface (NUI) by D-ID. The interface that humanizes interactions with everything digital. Build interfaces users can talk to and that understand them. A face-to-face conversation with AI.



Support

Have any questions? We're here to help! Go to the Help Center or send us a message.

Contact Support
Loading…