VLOGGER - AI Does Talking Heads
Written by Sue Gee   
Sunday, 24 March 2024

Developed by Google researchers VLOGGER AI is a system that can create realistic videos of people talking and moving from a single still image and an audio clip as input. 

As outlined by Enric Corona and his co-authors in the paper, "VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis", the technology relies on a pair of machine learning models to synthesize realistic video footage.  The first model predicts the body movements, facial expressions, and even blinks based on the audio. The second takes the predicted body controls from the first stage and uses a temporal diffusion model to iteratively refine each frame to generate a smooth, realistic video of the person talking:

vlogger

Put together with generative AI and foreign language translation, there are obvious uses for VLOGGER in content creation and communication. There are of course concerns about the misuse of this technology, such as creating deepfakes.

vloggersq 

More Information

VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Related Articles

Google Adds Gemini To Bard

Generate 3D Flythroughs from Still Photos

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


Liberica Alpaquita Containers Now Come With CRaC
23/04/2024

Bellsoft has added CRaC support to its ready-to-use Alpaquita container images. This will enable developers to seamlessly integrate CRaC into their projects for performant Java in the Cloud.



Hydraulic Atlas Bows Out, Welcome Electric Atlas
21/04/2024

Boston Dynamics dismayed us at the beginning of the week with a video that suggested was discontinuing Atlas, its humanoid robot. Fast forward a day and its successor was unveiled. Designed to be even [ ... ]


More News

raspberry pi books

 

Comments




or email your comment to: comments@i-programmer.info

 

Last Updated ( Sunday, 24 March 2024 )