CineMaster: 3D-Aware Controllable Text to Video Generation

Video models are getting better all the time. The latest models give you plenty of control over camera movement. CineMaster lets you manipulate objects and camera in 3D space. With this approach, it is possible to make videos of men walking in front of another object, cars passing one another, a hot balloon circling a tower, and a lot more. As the researchers explain:

To achieve this, CineMaster operates in two stages. In the first stage, we design an interactive workflow that allows users to intuitively construct 3D-aware conditional signals by positioning object bounding boxes and defining camera movements within the 3D space. In the second stage, these control signals—comprising rendered depth maps, camera trajectories and object class labels—serve as the guidance for a text-to-video diffusion model, ensuring to generate the user-intended video content.

[HT]

What's Hot

Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

Mureka O2 & V7.6 Music Models Debut

SOUYIE SW-9 GPT Powered Smartwatch

Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

Manus Browser Operator: Agent Works In your Browser

SAM 3D Can Turn Any Object In Images Into a 3D Model

GPT 4.5 Leads in Elimination Game That Tests Reasoning, Strategy & Deception

Merryking AI GPT-4o Smartwatch for Translation & Fitness

Boston Dynamics’ Atlas Using Machine Learning for Autonomous Task Completion

SAM 3D Can Turn Any Object In Images Into a 3D Model

15+ Things To Do with Nano Banana Pro

Nano Banana Pro Hits Higgsfield & Others

Most Popular

Prompt Cannon: Run Prompts Across Multiple Models

Dipal D1 2.5K Curved Screen 3D AI Character

GPTARS: GPT Powered TARS Robot

Our Picks

Top Black Friday Deals for AI: Higgsfield, Suno, Freepik

Mureka O2 & V7.6 Music Models Debut

SOUYIE SW-9 GPT Powered Smartwatch

What's Hot

CineMaster: 3D-Aware Controllable Text to Video Generation

Related Posts