The Code of DiffROP: Training Class-Imbalanced Diffusion Model Via Overlap Optimization

This repository is the official implementation of Training Class-Imbalanced Diffusion Model Via Overlap Optimization (Arxiv 2024, In Submission)

[Project Page] [Arxiv] [OpenReview] [Slides] [Poster]

Authors: Liang Yan, Lu Qi, Vincent Tao Hu, Ming-Hsuan Yang, Meng Tang

Introduction

Diffusion models have made significant advances recently in high-quality image synthesis and related tasks. However, diffusion models trained on real-world datasets, which often follow long-tailed distributions, yield inferior fidelity for tail classes. Deep generative models, including diffusion models, are biased towards classes with abundant training images. To address the observed appearance overlap between synthesized images of rare classes and tail classes, we propose a method based on contrastive learning to minimize the overlap between distributions of synthetic images for different classes. We show variants of our probabilistic contrastive learning method can be applied to any class conditional diffusion model. We show significant improvement in image synthesis using our loss for multiple datasets with long-tailed distribution. Extensive experimental results demonstrate that the proposed method can effectively handle imbalanced data for diffusion-based generation and classification models.

About this repository

The repo is implemented based on https://github.com/w86763777/pytorch-ddpm. Currently it supports the training for four datasets namely CIFAR10(LT) and CIFAR100(LT).

Regular (conditional or unconditional) diffusion model training
Class-balancing model training
Class-balancing model finetuning based on a regular diffusion model

(TODO) Running the Experiments

We provide mainly the scripts for trianing and evaluating the CIFAR100LT dataset. To run the code, please change the argument 'root' to the path where the dataset is downloaded.

(TODO) Checkpoint Release

Files used in evaluation

Please find the features for cifar 100 and cifar 10 used in precision/recall/f_beta metrics. Put them in the stats folder and the codes will be ready to run. Note that those two metrics will only be evaluated if the number of samples is 50k otherwise it returns 0.

Configuration

All the algorithms and models are implemented in Python and Pytorch. Experiments are conducted on a server with 8 NVIDIA V100 GPUs (32 GB memory) and Intel(R) Xeon (R) Platinum 8255C CPU @ 2.50GHz.

Acknowledgements

This implementation is based on / inspired by:

https://github.com/w86763777/pytorch-ddpm
https://github.com/crowsonkb/k-diffusion/blob/master/train.py (we refer to the implementation of ADA augmentation in K-diffusion model).

Cite Us

Feel free to cite this work if you find it useful to you!

@article{yan2024training,
  title={Training Class-Imbalanced Diffusion Model Via Overlap Optimization},
  author={Yan, Divin and Qi, Lu and Hu, Vincent Tao and Yang, Ming-Hsuan and Tang, Meng},
  journal={arXiv preprint arXiv:2402.10821},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
config		config
model		model
pipeline		pipeline
score		score
LICENSE		LICENSE
README.md		README.md
cifar10.sh		cifar10.sh
cifar100		cifar100
cifar100.sh		cifar100.sh
cifar10lt.sh		cifar10lt.sh
dataset.py		dataset.py
ddpm+cl.sh		ddpm+cl.sh
ddpm+cl2.sh		ddpm+cl2.sh
diffusion.py		diffusion.py
environment.yaml		environment.yaml
main.py		main.py
new_cbdm_cl_cifar10.sh		new_cbdm_cl_cifar10.sh
new_cbdm_cl_cifar100.sh		new_cbdm_cl_cifar100.sh
new_ddpm_cl_cifar10.sh		new_ddpm_cl_cifar10.sh
new_ddpm_cl_cifar100.sh		new_ddpm_cl_cifar100.sh
requirements.txt		requirements.txt
run_new_tau_0_0_1.sh		run_new_tau_0_0_1.sh
run_new_tau_0_0_1_copy_2.sh		run_new_tau_0_0_1_copy_2.sh
run_new_tau_0_1.sh		run_new_tau_0_1.sh
run_new_tau_0_1_copy_2.sh		run_new_tau_0_1_copy_2.sh
run_new_tau_1_0.sh		run_new_tau_1_0.sh
run_new_tau_1_0_copy_2.sh		run_new_tau_1_0_copy_2.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Code of DiffROP: Training Class-Imbalanced Diffusion Model Via Overlap Optimization

Introduction

About this repository

(TODO) Running the Experiments

(TODO) Checkpoint Release

Files used in evaluation

Configuration

Acknowledgements

Cite Us

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

yanliang3612/DiffROP

Folders and files

Latest commit

History

Repository files navigation

The Code of DiffROP: Training Class-Imbalanced Diffusion Model Via Overlap Optimization

Introduction

About this repository

(TODO) Running the Experiments

(TODO) Checkpoint Release

Files used in evaluation

Configuration

Acknowledgements

Cite Us

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages