Discussions
Embedded Custom Instructions?
Hi, I want to create agents that do not have a knowledge base - so working purely with the LLM instructions for my prompt. However, I'm seeing some behavior that makes me wonder whether there's extra things added to the prompt (under the hood) for the agents service. If so, is it possible for you to list what's the appended system prompt or, even better, is there a way to disable it? Make sure the agent can be fully guided exclusively based on my prompt?
error 500, ERR_BAD_RESPONSE when requesting in node.js env
Previously I had success in the Python environment but when I changed to the node.js environment I always got error 500, even though the header and payload were more or less the same, what do you think is wrong?
API requirements
Hey, I'm interested in using your APIs, but I couldn't find the following features in your API reference section.
Voice clone limit
I am using the you APIs to clone the voice. I noticed that there is an limit to the audios you can clone. If I purchase your Enterprise Plan can this limit be customised according to the need.
Custom ElevenLabs voice not working for agents
I added a custom voice for elevenlabs together with my request to create an agent by adding
Image cropping in the Agent API - stitch setting doesn't work
While setting up an agent, I tried doing
How do I change the model used by Elevenlabs to generate my voice
I have got my custom voice working through elevenlabs but it does not sound correct unless I use the eleven_turbo_v2 model. I have tried implementing this in my code but to no avail. How can I use the eleven_turbo_v2 model for my voice?
Facial Expressions
How do I incorporate facial expressions using D-iD?
Inquiry on Real-Time Text Streaming Capabilities
I'm utilizing a Large Language Model (LLM) and I'm interested in streaming text inputs to enhance real-time processing with D-ID. Can D-ID support streaming inputs where it begins processing as soon as a word is submitted?
poor video resolution on streams
i've seen this posted multiple times, but i haven't noticed a satisfactory answer. i would like to use the talks/streams/{id} end point to stream videos using text and an eleven labs voice. i've copied the code from https://github.com/de-id/live-streaming-demo and modified it for my use case and it works, but the video quality is poor (low resolution, blurry). the audio is fine. I know that my network is not the problem. at the time i was running the test i was getting 81Mbps download speeds. I turned on chrome://webrtc-internals/ to capture statistics for multiple streaming runs and i was not seeing any packets dropped and the incoming video bitrate was between 250k and 320k bps, so it does not seem that the stream coming from D-ID is taking advantage of all the bandwidth available. What are my options for improving the quality of the video?