Implement Frequency-Decoupled Guidance (FDG) as a Guider #11976

dg845 · 2025-07-23T02:47:39Z

What does this PR do?

This PR implements frequency-decoupled guidance (FDG) (paper), a new guidance strategy, as a guider. The idea behind FDG is to decompose the CFG prediction into low- and high-frequency components and apply guidance separately to each via a CFG-style update (with separate guidance scales $w_{low}$ and $w_{high}$). The authors find that low guidance scales work better for the low-frequency components while high guidance scales work better for the high-frequency components (e.g. you should set $w_{low} < w_{high}$).

Fixes #11956.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@a-r-r-o-w
@yiyixuxu
@Msadat97

…uider

dg845 · 2025-07-23T03:06:06Z

Some notes on the initial implementation:

I have followed the paper implementation in Algorithm 2, which uses the kornia library to build a Laplacian pyramid as the frequency transform $\psi$. I'm not sure if this is already a dependency for diffusers; it happens to be in the dev environment I'm using, but doesn't appear to be explicitly pinned in setup.py.
Right now, the FrequencyDecoupledGuidance class accepts guidance_scale_low and guidance_scale_high arguments in __init__ for $w_{low}$ and $w_{high}$, and similarly for other parameters such as parallel_weights_low/parallel_weights_high. Alternatively, we could accept a e.g. guidance_scales: Tuple[int] = [10.0, 5.0] argument for $w_{high} = 10$ and $w_{low} = 5$, and have all similar parameters (e.g. parallel_weights, guidance_rescale, etc.) be tuples of the same length. The latter approach is nice because it supports multiple frequency levels and makes it easier to decouple all parameters for each frequency level, but might be less usable, especially if using only 2 levels (low and high frequency) is the dominant use case.

Msadat97 · 2025-07-23T11:49:05Z

Thank you for the quick implementation. Regarding your question, I believe it's cleaner to use tuples for the weights, as it allows users to seamlessly apply multiple levels when finer control over the generation is needed.

a-r-r-o-w

@dg845 Thanks for taking it up, implementation looks great!

What you suggested about tuples sounds good, let's do that. We can always update the implementation later if needed to simplify since modular guiders is experimental at the moment (plus, users can pass their own guider implementations so if someone wants to simplify, it will be quite easy to take your implementation and make the necessary modifications)

Let's not add kornia as a dependancy. Instead, we can do the same thing done in attention dispatcher (import only if package is available):

diffusers/src/diffusers/models/attention_dispatch.py

Line 63 in 7ae6347

if _CAN_USE_FLASH_ATTN_3:

a-r-r-o-w · 2025-07-23T12:59:02Z

src/diffusers/guiders/frequency_decoupled_guidance.py

+import math
+from typing import TYPE_CHECKING, Dict, List, Optional, Tuple, Union
+
+import kornia


Could we add a is_kornia_available to diffusers.utils.import_utils and import only if user already has it downloaded? A check could exist in __init__ as well so that if user tries to instantiate FDG guider, it fails if kornia isn't available

I have added a is_kornia_available function to utils and added logic in the FDG guider to only import from kornia if available following the Flash Attention 3 example above.

… each freq level

dg845 · 2025-07-24T01:02:03Z

Hi @Msadat97, quick question: how should FDG interact with guidance rescaling (from https://arxiv.org/pdf/2305.08891)? Currently, I'm rescaling in frequency space for each frequency level, with different guidance_rescale values allowed for different levels, but would it make more sense to rescale after the FDG prediction is mapped back to data space (in which case there would only be one guidance_rescale value for all frequency levels)?

Msadat97 · 2025-07-24T10:22:08Z

It seems more natural to perform a single rescaling at the end (after the FDG prediction) since FDG is meant to replace the CFG output. Rescaling in the frequency domain is also possible, but I can’t comment further as we haven’t tested FDG with guidance rescaling. Do you have any output comparisons for this?

SahilCarterr · 2025-07-24T11:50:28Z

Can you share a code snippet how to use FDG . @dg845

Msadat97 · 2025-07-25T13:00:10Z

@dg845 I noticed a mistake in the implementation. pred_cond and pred_uncond in the for loop should come from the Laplacian pyramid, but the current code uses the values in the data space. Could you please fix this? The correct approach is given in the paper:

Initial commit implementing frequency-decoupled guidance (FDG) as a g…

7d5901d

…uider

dg845 mentioned this pull request Jul 23, 2025

Frequency-Decoupled Guidance (FDG) for diffusion models #11956

Open

dg845 added 2 commits July 22, 2025 20:47

Update FrequencyDecoupledGuidance docstring to describe FDG

fe824a8

Update project so that it accepts any number of non-batch dims

6949ece

a-r-r-o-w reviewed Jul 23, 2025

View reviewed changes

dg845 added 5 commits July 23, 2025 16:48

Change guidance_scale and other params to accept a list of params for…

8c05d64

… each freq level

Add comment with Laplacian pyramid shapes

33822e8

Add function to import_utils to check if the kornia package is available

565ce2a

Only import from kornia if package is available

f608c5f

Merge branch 'main' into fdg-guider

34427b7

dg845 marked this pull request as ready for review July 24, 2025 00:36

dg845 changed the title ~~[WIP] Implement Frequency-Decoupled Guidance (FDG) as a Guider~~ Implement Frequency-Decoupled Guidance (FDG) as a Guider Jul 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement Frequency-Decoupled Guidance (FDG) as a Guider #11976

Implement Frequency-Decoupled Guidance (FDG) as a Guider #11976

dg845 commented Jul 23, 2025

Uh oh!

dg845 commented Jul 23, 2025 •

edited

Loading

Uh oh!

Msadat97 commented Jul 23, 2025

Uh oh!

a-r-r-o-w left a comment

Uh oh!

a-r-r-o-w Jul 23, 2025

Uh oh!

dg845 Jul 24, 2025

Uh oh!

dg845 commented Jul 24, 2025

Uh oh!

Msadat97 commented Jul 24, 2025

Uh oh!

SahilCarterr commented Jul 24, 2025

Uh oh!

Msadat97 commented Jul 25, 2025

Uh oh!

Uh oh!

Implement Frequency-Decoupled Guidance (FDG) as a Guider #11976

Are you sure you want to change the base?

Implement Frequency-Decoupled Guidance (FDG) as a Guider #11976

Conversation

dg845 commented Jul 23, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

dg845 commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Msadat97 commented Jul 23, 2025

Uh oh!

a-r-r-o-w left a comment

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

dg845 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

dg845 commented Jul 24, 2025

Uh oh!

Msadat97 commented Jul 24, 2025

Uh oh!

SahilCarterr commented Jul 24, 2025

Uh oh!

Msadat97 commented Jul 25, 2025

Uh oh!

Uh oh!

dg845 commented Jul 23, 2025 •

edited

Loading