Katz: Efficient Workflow Serving for Diffusion Models with Many Adapters

Katz is a high-performance serving system designed specifically for diffusion model workflows with multiple adapters. It dramatically improves inference efficiency while maintaining image quality.

🌟 Key Features

ControlNet-as-a-Service: Decouples ControlNets from the base model for independent scaling.
Bounded Asynchronous LoRA Loading: Overlaps LoRA loading with base model execution for reduced latency.
Latent Parallelism: Accelerates base model execution across multiple GPUs.
Performance Gains: Up to $7.8 \times$ latency reduction and $1.7 \times$ throughput improvement.

🎬 Demo

Prompt: papercut -subject/scene-a shiba inu wearing a beret and black turtleneck, 4k, clean background

Negative prompt: low quality, bad quality, sketches, numbers, letters

This image was generated with 1 ControlNet with depth guidance and 1 LoRA for the papercut style. The depth reference image used for guidance is available here.

🚀 Getting Started

System Requirements

NVIDIA GPUs (H800 recommended for best performance)
CUDA 11.8+
Python 3.10+

🚧 Environment Setup

$ conda create -n katz python=3.10
$ conda activate katz
$ pip install -r requirements.txt
# Install our customized diffusers package
$ pushd ./diffusers-hf && pip install -e . && popd
# Install fast-kernel
$ pushd ./diffusers-hf/src/fast_kernel/ && git submodule update --init --recursive && pip install . && popd

🔥 Quickstart Example

Coming soon.

🔮 Artifact Evaluation

For detailed benchmarking instructions and reproducing our results, see the artifact evaluation guide.

🗄️ Production Trace Analysis

We provide tools and datasets for analyzing real-world production traces in the trace directory.

📝 Citation

Please cite our paper if it is helpful to your research.

@inproceedings{Katz2025,
  title = {Katz: Efficient Workflow Serving for Diffusion Models with Many Adapters},
  author = {Li, Suyi and Yang, Lingyun and Jiang, Xiaoxiao and Lu, Hanfeng and An, Dakai and Di, Zhipeng and Lu, Weiyi and Chen, Jiawei and Liu, Kan and Yu, Yinghao and Lan, Tao and Yang, Guodong and Qu, Lin and Zhang, Liping and Wang, Wei},
  booktitle = {Proc. USENIX ATC},
  year = {2025}
}

🙏🏻 Acknowledgement

We thank the contributors of 🤗 Diffusers for their foundational work.

📬 Contact

For questions and support, please open an issue or contact the authors.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
PartiPrompts_Detail_eval		PartiPrompts_Detail_eval
assets		assets
baselines		baselines
configs		configs
diffusers-hf		diffusers-hf
figures		figures
logs		logs
lora_info_shm		lora_info_shm
trace		trace
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
artifact_evaluation.md		artifact_evaluation.md
gen_lora_diffuser_key_match.py		gen_lora_diffuser_key_match.py
gen_unet_state_dict.py		gen_unet_state_dict.py
load_lora_shm_multi.py		load_lora_shm_multi.py
lora_utils.py		lora_utils.py
plot_end2end_latency.py		plot_end2end_latency.py
preprocess_lora_info.py		preprocess_lora_info.py
requirements.txt		requirements.txt
run_katz.py		run_katz.py
sdxl_controlnet_server.py		sdxl_controlnet_server.py
sdxl_pipeline.py		sdxl_pipeline.py
sdxl_unet_server.py		sdxl_unet_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Katz: Efficient Workflow Serving for Diffusion Models with Many Adapters

🌟 Key Features

🎬 Demo

🚀 Getting Started

System Requirements

🚧 Environment Setup

🔥 Quickstart Example

🔮 Artifact Evaluation

🗄️ Production Trace Analysis

📝 Citation

🙏🏻 Acknowledgement

📬 Contact

About

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

License

modelscope/Katz

Folders and files

Latest commit

History

Repository files navigation

Katz: Efficient Workflow Serving for Diffusion Models with Many Adapters

🌟 Key Features

🎬 Demo

🚀 Getting Started

System Requirements

🚧 Environment Setup

🔥 Quickstart Example

🔮 Artifact Evaluation

🗄️ Production Trace Analysis

📝 Citation

🙏🏻 Acknowledgement

📬 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 3

Uh oh!

Languages