Frame In-N-Out: Unbounded Controllable Image-to-Video Generation

We propose Frame In-N-Out, a controllable image-to-video generation framework where objects can enter or exit the scene along user-defined motion trajectories. Our method introduces a new dataset curation pattern recognition, evaluation protocol, and a motion-controllable, identity-preserving video Diffusion Transformer, to achieve Frame In and Frame Out in the cinematic domain.

Update 🔥🔥🔥

Release the paper
Release the model weights (CogVideoX)
Gradio demo (with online)
Release the Training Code
Release the Processed Training Dataset

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Frame In-N-Out: Unbounded Controllable Image-to-Video Generation

Update 🔥🔥🔥

About

Uh oh!

Releases

Packages

UVA-Computer-Vision-Lab/FrameINO

Folders and files

Latest commit

History

Repository files navigation

Frame In-N-Out: Unbounded Controllable Image-to-Video Generation

Update 🔥🔥🔥

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages