FlexWorld

This is the official PyTorch implementation of FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis.

Update

[2025-5-21]: Add training code and data preperation.

Installation

For complete installation instructions, please see INSTALL.md.

Usage

Static scene video generation given an image and a camera trajectory:

python video_generate.py --input_image_path ./assets/room.png --output_dir ./results-single-traj

You can pass in traj argument to specify camera movements, the basic movements is defined in "ops/utils/all_traj.py". The supported camera movements includes ["up","down","left","right","forward","backward","rotate_left","rotate_right"].

python video_generate.py --input_image_path ./assets/room.png --output_dir ./results-single-traj --traj backward

You can also generate videos share the same camera trajectories with those in DL3DV and Re10K. Just pass the video path to traj arguments.

python video_generate.py --input_image_path ./assets/room.png --output_dir ./results-single-traj --traj ./path_to_dl3dv/1.mp4

A flexible-view 360° scene generation given an image.

# You are free to modify the corresponding YAML configuration file by name in `./configs/examples`.
python main_3dgs.py --name room2

Visualization

First running:

python 3dgs_viewer.py

then visit 127.0.0.1:8000 to freely explore the generated scene in the current directory. The script will scan the ply file recursively, please doing this after the generation.

Dataset Preperation

Download dataset to local dir following DL3DV repo. You may download only part of them, like 1K.
Prepare 3DGS from DL3DV dataset, you can first download colmap annotation from DL3DV colmap annotation and then do reconstruction following Gaussian Splatting repo. The final output will listed like:

- output/
  - 001dccbc1f78146a9f03861026613d8e73f39f372b545b26118e37a23c740d5f
    - point_cloud
        - iteration_7000
            - point_cloud.ply
  - 0003dc82473fd52c53dcbdc2deb4e6e9c3548d6f8c9b03f9ea8d3c7d3ce33546
    - point_cloud
        - iteration_7000
            - point_cloud.ply

Run following to generate broken video constructed by 3DGS.

# The path here is an example.
python gen_dataset.py --dataset_path ./DL3DV/DL3DV-10K/1K --output_path ./DL3DV/processed --gs_path ./gaussian-splatting/output

Run following to label the video constructed.

# The path here is an example.
python label_dataset.py --input_path ./DL3DV/processed --output_path ./train_data_v2v

Training

Change following lines in "./tools/CogVideo/configs/sft_v2v.yaml".

args:
  checkpoint_activations: True 
  experiment_name: lora-disney # your save folder name 
  mode: finetune
  load: "xxx/CogVideoX-5B-I2V-SAT/transformer" # path to transformer original checkpoints
  save: "./ckpts_5b" # path to save dir.
  train_data: [ "train_data_v2v" ] # Train data path
  valid_data: [ "train_data_v2v" ] # Validation data path, can be the same as train_data(no recommended)

Run training script

cd ./tools/CogVideo/
bash train_video_v2v.py

ToDo List

A user manual for our camera trajectory, offering support for more flexible trajectory inputs and accommodating a wider variety of trajectory types (such as RealEstate camera input and DL3DV-10K camera input).
A 3DGS viewer for generated results.
Training code for video diffusion model.

Acknowledgement

This work is built on many amazing open source projects, thanks to all the authors!

Citation

@misc{chen2025flexworldprogressivelyexpanding3d,
      title={FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis}, 
      author={Luxi Chen and Zihan Zhou and Min Zhao and Yikai Wang and Ge Zhang and Wenhao Huang and Hao Sun and Ji-Rong Wen and Chongxuan Li},
      year={2025},
      eprint={2503.13265},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.13265}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FlexWorld

Update

Installation

Usage

Visualization

Dataset Preperation

Training

ToDo List

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
configs		configs
data		data
ops		ops
pipe		pipe
tools		tools
.gitignore		.gitignore
3dgs_viewer.py		3dgs_viewer.py
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
gen_dataset.py		gen_dataset.py
label_dataset.py		label_dataset.py
main_3dgs.py		main_3dgs.py
requirements.txt		requirements.txt
video_generate.py		video_generate.py

License

ML-GSAI/FlexWorld

Folders and files

Latest commit

History

Repository files navigation

FlexWorld

Update

Installation

Usage

Visualization

Dataset Preperation

Training

ToDo List

Acknowledgement

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages