Questions regarding implementation

Hey 👋 

I'm Aryan from the HuggingFace Diffusers team. I am working on integrating FasterCache into the library to make it available for all the video models we support. I had some questions regarding the implementation and was hoping to get some help.

In the paper, the section describing CFG Cache has the following:

> These biases ensure that both high- and low-frequency differences are accurately captured and compensated during the reuse process. In the subsequent n timesteps (from t − 1 to t − n), we infer only the outputs of the conditional branches and compute the unconditional outputs using the cached ∆HF and ∆LF as follows:

It says that inference is run for the conditional branch, and outputs for the unconditional branch are computed with the given equations. This is the relevant lines of code that seems to be doing what is mentioned:

https://github.com/Vchitect/FasterCache/blob/fab32c15014636dc854948319c0a9a8d92c7acb4/scripts/latte/fastercache_sample_latte.py#L90

However, the indexing of the inputs is done as `hidden_states[:1],timestep[:1],encoder_hidden_states[:1]`. Isn't this corresponding to the unconditional inputs instead of conditional inputs? I think it is unconditional because the order of concatenation of prompts embeds is like: `(negative_prompt_embeds, prompt_embeds)` [here](https://github.com/Vchitect/FasterCache/blob/fab32c15014636dc854948319c0a9a8d92c7acb4/fastercache/models/latte/pipeline.py#L651).

Is this incorrect by any chance? Or is unconditional branch being used for approximating output of conditional branch?

Thank you for your time! :hugs:

cc @cszy98 @ChenyangSi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Questions regarding implementation #13

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions regarding implementation #13

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions