-
Notifications
You must be signed in to change notification settings - Fork 125
Open
Description
Thank you for sharing the great work!
I would like to ask how the weight of the transformer of ControlNet can be initialized from the Stable Diffusion 3.5 model for training?
Particularly, when I check ControlNet for Canny and Depth, the transformer uses SD3SingleTransformerBlock
, while SD3.5's transformer uses JointTransformerBlock
.
Is it correct that ControlNet's transformer just copies the query, key, value, output projection weights, and the FFN from the SD3.5 transformer?
I am looking forward to receiving your answer.
Much appreciate!
Metadata
Metadata
Assignees
Labels
No labels