Back to Autogpt

Talking Head

docs/integrations/block-integrations/talking_head.md

0.6.441.8 KB
Original Source

Create Talking Avatar Video

What it is

This block is an AI-powered tool that creates video clips featuring a talking avatar using the D-ID service.

What it does

It generates a video of a digital avatar speaking a given script, with customizable voice, presenter, and visual settings.

How it works

The block sends a request to the D-ID API with your specified parameters. It then regularly checks the status of the video creation process until it's complete or an error occurs.

Inputs

InputDescription
API KeyYour D-ID API key for authentication
Script InputThe text you want the avatar to speak
ProviderThe voice provider to use (options: microsoft, elevenlabs, amazon)
Voice IDThe specific voice to use for the avatar
Presenter IDThe visual appearance of the avatar
Driver IDThe animation style for the avatar
Result FormatThe file format of the final video (options: mp4, gif, wav)
Crop TypeHow the video should be cropped (options: wide, square, vertical)
SubtitlesWhether to include subtitles in the video
SSMLWhether the input script uses Speech Synthesis Markup Language
Max Polling AttemptsMaximum number of times to check for video completion
Polling IntervalTime to wait between each status check (in seconds)

Outputs

OutputDescription
Video URLThe web address where you can access the completed video
ErrorA message explaining what went wrong if the video creation failed

Possible use case

A marketing team could use this block to create engaging video content for social media. They could input a script promoting a new product, select a friendly-looking avatar, and generate a video that explains the product's features in an appealing way.