Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

Anle Ke¹ · Xu Zhang¹ · Tong Chen¹ · Ming Lu¹ · Chao Zhou² · Jiawen Gu² · Zhan Ma¹

¹ Nanjing University ²Kuaishou Technology

📖 Table Of Contents

✨ Visual Results
⏳ Train
😀 Inference
🌊 TODO
❤ Acknowledgement
🙇‍ Citation

⚙️ Environment Setup

- conda create -n ResULIC python=3.10
- conda activate ResULIC
- pip install -r requirements.txt

✨ Visual Results

⏳ Train

Note: The numbers in the yaml filenames (e.g., 1_1_1) represent $\lambda_{\text{diffusion}}$, $\lambda_{\text{mse}}$, and $\lambda_{\text{bpp}}$ respectively.

Stage 1: Initial Training

Download Pretrained Model
Download the pretrained Stable Diffusion v2.1 model into the ./weight directory:

wget https://huggingface.co/stabilityai/stable-diffusion-2-1-base/resolve/main/v2-1_512-ema-pruned.ckpt --no-check-certificate -P ./weight

Modify the configuration file./configs/train_zc_eps.yaml and ./configs/model/stage1/xx.yaml accordingly.
Start training.
```
bash stage1.sh 
```

Stage 2:

Modify the configuration file ./configs/train_stage2.yaml and ./configs/model/stage2/xx.yaml accordingly.
Start training.
```
bash stage2.py 
```

😀 Inference

Note: It is recommended to set "ddim_steps" to a number that is divisible by "add_steps". For example, when add_steps=600, ddim_steps could be 2, 3, 5...

W/o Srr, W/o Pfo.

CUDA_VISIBLE_DEVICES=2 python inference_win.py \
 --ckpt xx \
 --config /xx/xx.yaml \
 --output xx/ \
 --ddim_steps 3 \
 --ddim_eta 0 \
 --Q x.0 \
 --add_steps x00

W/ Srr, W/o Pfo.

 CUDA_VISIBLE_DEVICES=2 python inference_res.py \
 --ckpt xx \
 --config /xx/xx.yaml \
 --output xx/ \
 --ddim_steps 3 \
 --ddim_eta 0 \
 --Q x.0 \
 --add_steps x00

🌊 TODO

Release code
Release quantitative metrics （👾The quantitative metrics for ResULIC presented in our paper can be found in indicator.）
Release pretrained models (Coming soon)

❤ Acknowledgement

This work is based on ControlNet, ControlNet-XS, DiffEIC, and ELIC, thanks to their invaluable contributions.

🙇‍ Citation

If you find our work useful, please consider citing:

@inproceedings{Ke2025resulic,
               author = {Ke, Anle and Zhang, Xu and Chen, Tong and Lu, Ming and Zhou, Chao and Gu, Jiawen and Ma, Zhan},
               title = {Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion},
               booktitle = {International Conference on Machine Learning},
               year = {2025}
               }

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
configs		configs
dataset		dataset
fig		fig
indicator		indicator
ldm		ldm
model		model
prompt_inversion		prompt_inversion
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference_res.py		inference_res.py
inference_win.py		inference_win.py
nn_indices.py		nn_indices.py
qwen.py		qwen.py
requirements.txt		requirements.txt
sensechat.py		sensechat.py
stage1.sh		stage1.sh
stage2.sh		stage2.sh
train.py		train.py
train_stage2.py		train_stage2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

📖 Table Of Contents

⚙️ Environment Setup

✨ Visual Results

⏳ Train

Stage 1: Initial Training

Stage 2:

😀 Inference

🌊 TODO

❤ Acknowledgement

🙇‍ Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

NJUVISION/ResULIC

Folders and files

Latest commit

History

Repository files navigation

Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

📖 Table Of Contents

⚙️ Environment Setup

✨ Visual Results

⏳ Train

Stage 1: Initial Training

Stage 2:

😀 Inference

🌊 TODO

❤ Acknowledgement

🙇‍ Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages