@@ -529,15 +529,14 @@ def __call__(
529
529
their `set_timesteps` method. If not defined, the default behavior when `num_inference_steps` is passed
530
530
will be used.
531
531
true_cfg_scale (`float`, *optional*, defaults to 1.0):
532
- When > 1.0 and a provided `negative_prompt`, enables true classifier-free guidance.
532
+ True classifier-free guidance (guidance scale) is enabled when `true_cfg_scale` > 1 and
533
+ `negative_prompt` is provided.
533
534
guidance_scale (`float`, defaults to `6.0`):
534
- Guidance scale as defined in [Classifier-Free Diffusion
535
- Guidance](https://huggingface.co/papers/2207.12598). `guidance_scale` is defined as `w` of equation 2.
536
- of [Imagen Paper](https://huggingface.co/papers/2205.11487). Guidance scale is enabled by setting
537
- `guidance_scale > 1`. Higher guidance scale encourages to generate images that are closely linked to
538
- the text `prompt`, usually at the expense of lower image quality. Note that the only available
539
- HunyuanVideo model is CFG-distilled, which means that traditional guidance between unconditional and
540
- conditional latent is not applied.
535
+ Embedded guiddance scale is enabled by setting `guidance_scale` > 1. Higher `guidance_scale` encourages
536
+ a model to generate images more aligned with `prompt` at the expense of lower image quality.
537
+
538
+ Guidance-distilled models approximates true classifer-free guidance for `guidance_scale` > 1. Refer to
539
+ the [paper](https://huggingface.co/papers/2210.03142) to learn more.
541
540
num_videos_per_prompt (`int`, *optional*, defaults to 1):
542
541
The number of images to generate per prompt.
543
542
generator (`torch.Generator` or `List[torch.Generator]`, *optional*):
0 commit comments