Hunyuan Sonic Turns Images with Audio Into Speeches, Songs

In this day and age, you don’t need a whole lot to generate stunning videos with AI. Hunyuan Sonic is a nifty approach to breathing life into static images. It uses temporal audio learning for accurate lip-sync and natural expressions. By using a motion-decoupled controller, motion of the head and expression movement “are disentangled and independently controlled by intra-audio clips.”

Sonic can generate stunning videos with an image and audio input. It can generate long videos up to 10 minutes. As the above video shows, Sonic can create more dynamic, natural videos. Sonic works well with images that are not real humans.

[HT: Zhejiang University,Tencent ]

What's Hot

InstantCharacter Personalize Image Characters with a Scalable Diffusion Transformer

Leonardo’s AI Video Tool Gets Motion Control

HunyuanPortrait for Controllable Animation from Images

Leonardo’s AI Video Tool Gets Motion Control

ChatGPT Gets an Image Library for Organization

Grok Studio Released, Can Now Run Python, C++ Code

Where to Find HunyuanVideo I2V Open Source Text to Video Model

How to Create an App with Grok 3 and Replit with No Coding

OpenAI to Announce o3-mini?

Helix: AI That Enables Robots to Reason Like Humans

AiNova Python/Scratch Programmable Robot Car

Can DeepSeek R1 Play Chess? Tested Against LC0

Most Popular

How to Run DeepSeek in Cursor

GPTARS: GPT Powered TARS Robot

How to Run DeepSeek R1 on Android

Our Picks

InstantCharacter Personalize Image Characters with a Scalable Diffusion Transformer

Leonardo’s AI Video Tool Gets Motion Control

HunyuanPortrait for Controllable Animation from Images

What's Hot

Hunyuan Sonic Turns Images with Audio Into Speeches, Songs

Related Posts