RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models [ECCV 2024]

By Bowen Zhang*, Yiji Cheng*, Chunyu Wang†, Ting Zhang, Jiaolong Yang, Yansong Tang, Feng Zhao, Dong Chen, and Baining Guo.

Paper | Project Page | Code

RodinHD_realworld.mp4

Environment Setup

We recommend using Anaconda to create a new environment and install the dependencies. Our code is tested with Python 3.8 on Linux. Our model is trained and can be inferred using NVIDIA V100 GPUs.

~~conda env create -n rodinhd python=3.8~~

conda create -n rodinhd python=3.9
conda activate rodinhd
pip install -r requirements.txt

cd Renderer
pip install -r requirements.txt

conda install mpi4py

Data Preparation

Due to organization policy, the training data is not publicly available. You can prepare your own data following the instructions below. Your 3D dataset can be organized as follows:

data
├── obj_00
│   ├── img_proc_fg_000000.png
│   ├── img_proc_fg_000001.png
│   ├── ...
|   ├── metadata_000000.json
|   ├── metadata_000001.json
|   ├── ...
├── obj_01
|   ├── ...

Then encode the multi-scale vae features of the frontal images of each object using the following command:

cd scripts
python encode_multiscale_feature.py --root /path/to/data --output_dir /path/to/feature --txt_file /path/to/txt_file --start_idx 0 --end_idx 1000

Where --txt_file is a txt file containing the list of objects to be encoded, and can be obtained by ls /path/to/data > /path/to/txt_file.

Inference

Inference the base diffusion model:

cd scripts
sh base_sample.sh

Then inference the upsample diffusion model:

cd scripts
sh upsample_sample.sh

You need to modify the arguments in the scripts to fit your own data path.

Training

Triplane Fitting

We first fit the shared feature decoder with the proposed task-replay and identity-aware weight consolidation strategies using:

cd Renderer
sh fit_stage1.sh

If you want to support distributed single machine multi card training(for stage 1), run the following script:

CUDA_VISIBLE_DEVICES=2,3,4 sh fit_stage1_dist.sh

Then we fix the shared feature decoder and fit each triplane per object using:

sh fit_stage2.sh

You need to modify the arguments in the scripts to fit your own data path.

Triplane Diffusion

After fitting the triplanes, we train the diffusion model using:

sh ../scripts/base_train.sh

Then we train the upsample diffusion model using:

sh ../scripts/upsample_train.sh

You need to modify the arguments in the scripts to fit your own data path.

Acknowledgement

This repository is built upon improved-diffusion, torch-ngp and Rodin. Thanks for their great work!

Citation

If you find our work useful in your research, please consider citing:

@article{zhang2024rodinhd,
  title={RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models},
  author={Zhang, Bowen and Cheng, Yiji and Wang, Chunyu and Zhang, Ting and Yang, Jiaolong and Tang, Yansong and Zhao, Feng and Chen, Dong and Guo, Baining},
  journal={arXiv preprint arXiv:2407.06938},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
.idea		.idea
.vscode		.vscode
DynamicRenderer		DynamicRenderer
Preprocessor		Preprocessor
Renderer		Renderer
Warper		Warper
pretrained_diffusion		pretrained_diffusion
pretrained_diffusion_difference		pretrained_diffusion_difference
pretrained_diffusion_warper		pretrained_diffusion_warper
script		script
.gitignore		.gitignore
README.md		README.md
base_sample.py		base_sample.py
base_sample_pic.py		base_sample_pic.py
base_train.py		base_train.py
base_train_difference.py		base_train_difference.py
base_train_warper.py		base_train_warper.py
environment.yaml		environment.yaml
nohup.out		nohup.out
requirements.txt		requirements.txt
requirements124.txt		requirements124.txt
test_shape.py		test_shape.py
upsample_sample.py		upsample_sample.py
upsample_train.py		upsample_train.py
upsample_train_nods.py		upsample_train_nods.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models [ECCV 2024]

Environment Setup

Data Preparation

Inference

Training

Triplane Fitting

Triplane Diffusion

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Languages

NytePlus/RodinHD

Folders and files

Latest commit

History

Repository files navigation

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models [ECCV 2024]

Environment Setup

Data Preparation

Inference

Training

Triplane Fitting

Triplane Diffusion

Acknowledgement

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages