Skip to content

HiDream ModelSpec #371

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

HiDream ModelSpec #371

wants to merge 2 commits into from

Conversation

a-r-r-o-w
Copy link
Member

@a-r-r-o-w a-r-r-o-w commented Apr 12, 2025

Fixes #366

This PR adds support for training HiDream-Image using the Diffusers adaptation. Additionally, take a look at the Diffusers Trainer: huggingface/diffusers#11281

Currently, only the Full and Dev checkpoints are supported:

For training "Fast", extensive testing with timestep schedules (possibly LADD) are required, so it is not going to be covered in this PR.

Full-finetuning may fail even on an 80gb GPU until expert parallel or better load balancing with gradient checkpointing is implemented

Note: This PR is a work-in-progress and may not work as intended yet!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add support for HiDream
1 participant