A way to make AI gen less RNG perhaps? #8898

KindaSavageD3 · 2025-07-14T02:42:10Z

KindaSavageD3
Jul 14, 2025

I think we've all been at the point where we keep getting close to the image we want in different ways.. maybe you want the hands to be palm up and no matter what engine you use its just not getting it- sure you can try doing a control net to help ensure the pose is right maybe you're going for a specific character and you use IPAdapter and/or FaceID but still it is almost there but the specific facial expression you are going for isn't easy to describe. You finally get that facial expression but now the eyes are completely off so you throw in a Her Eyes lora but that shifts the expression away even though you're on the same seed. What if there was a way that you could sort of 'upvote' pictures that were heading in the general direction and 'downvote' the ones that were moving further away from your goal? I have a limited understanding of how AI image gen works and have only been messing around with it for a couple months but what if you could apply some sort of weight to specific feature to try to lock it in place where words from your text prompt just aren't translating well even in Flux. It could really cut down the 'RNG' element quite a lot if it were possible to favor certain vectors during the generation process similar to how I think an embedding works? Maybe it could even be more general than that "I liked elements from this one but not this." It could also give meaning to those hundreds of images you produced that just didn't meet the mark Just a thought. Love your work so far guys keep it up!

ltdrdata · 2025-07-14T22:15:22Z

ltdrdata
Jul 14, 2025
Collaborator

The concept you're referring to with "upvote" and "downvote" is actually known as positive conditioning and negative conditioning. This technique uses a method called CFG, where the positive conditioning steers the image generation in the desired direction, and the negative conditioning helps suppress unwanted elements.

At each step of generation, the model produces slightly adjusted versions of the latent image in both directions (positive and negative), and the final image is guided by expanding the gap between these two directions according to the CFG scale.

However, this approach effectively doubles the computation per step, which makes it extremely slow for large models like FLUX. Moreover, in the case of released FLUX models that are CFG-distilled, the image can degrade significantly unless additional techniques like dynamic thresholding are applied.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A way to make AI gen less RNG perhaps? #8898

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

A way to make AI gen less RNG perhaps? #8898

Uh oh!

KindaSavageD3 Jul 14, 2025

Replies: 1 comment

Uh oh!

ltdrdata Jul 14, 2025 Collaborator

KindaSavageD3
Jul 14, 2025

ltdrdata
Jul 14, 2025
Collaborator