One-Minute Video Generation with Test-Time Training

AI video models are getting smarter all the time. The latest models are capable of producing stunningly realistic videos. Transformers still struggle to generate one-minute videos though. This paper explores using Test-Time Training with a pre-trained Transformer to generate a one-minute videos from text storyboards.

With this approach, it is possible to generate one-minute videos with smooth motion and temporal consistency. Here is a sample scene:

The kitchen has soft yellow walls, white cabinets, and a window with red-and-white checkered curtains letting in gentle sunlight. In the middle, there’s a round wooden table with matching chairs, sitting on a clean white-tiled floor. Tom, the blue-gray cat, walks in from the left holding a warm, golden-brown pie on a shiny silver tray. He moves calmly across the room toward the table, carefully places the pie down, pulls out a chair, and sits comfortably. The camera smoothly follows Tom from left to right, clearly showing each of his movements.

The kitchen has soft yellow walls, white cabinets, and a window with red-and-white checkered curtains letting in gentle sunlight. In the middle, there’s a round wooden table with matching chairs, sitting on a clean white-tiled floor. Tom, the blue-gray cat, sits comfortably at the table with the golden-brown pie resting on its shiny silver tray directly in front of him. He carefully uses his paw to pick up a slice from the tray, lifts it toward his mouth, and takes a large bite. The camera slowly moves closer, clearly showing Tom enjoying his pie as crumbs lightly fall onto the table.

[HT]

What's Hot

InstantCharacter Personalize Image Characters with a Scalable Diffusion Transformer

Leonardo’s AI Video Tool Gets Motion Control

HunyuanPortrait for Controllable Animation from Images

Leonardo’s AI Video Tool Gets Motion Control

ChatGPT Gets an Image Library for Organization

Ponder Cursor-like AI Video Editor

AutoKeybo: Auto Mode Switching Keyboard with AI Vision

SenseRobot AI Robot Chess Coach

Phi-4 Released: 14B Small Language Model for Math

Most Popular

How to Run DeepSeek in Cursor

GPTARS: GPT Powered TARS Robot

How to Run DeepSeek R1 on Android

Our Picks

InstantCharacter Personalize Image Characters with a Scalable Diffusion Transformer

Leonardo’s AI Video Tool Gets Motion Control

HunyuanPortrait for Controllable Animation from Images

What's Hot

One-Minute Video Generation with Test-Time Training

Related Posts