GitHub - shaitanzx/Fooocus_extend: Focus on prompting and generating

I would like to introduce a new fork of the popular generative neural network Fooocus - Fooocus extend. I would like to point out that this fork can be run both locally on your computer and via Google Colab. Let's look at everything in order.

Full list of differences from the original Fooocus

Modules running during generation
- OneButtonPrompt
- Prompt Translate
- PhotoMaker
- InstantID
- Inswapper
- CodeFormer
- Vector
Additional modules
- Image Batch
- Prompt Batch
- X/Y/Z Plot
- Inswapper
- CodeFormer
- Remove Background
- Vector
Tools
- Civitai Helper
- TextMask
- SVGcode
- Roller
- OpenPoseEditor
- Logo
- Photopea
Select the resolution and aspect ratio of the generated image
Wildcard
OpenPose ControlNet
Recolor ControlNet
Scribble ControlNet
Manga Recolor ControlNet
Save Image Grid for Each Batch
Filename Prefix
Paths and Presets
Load file of style
View LoRA trigger words and view the models page on civitai.com
Seamless tiling
Transparency

Launch. If you will run it on a local machine, you can safely skip this item.

Before launching you are offered to choose the following settings Fooocus Profile - select which profile will be loaded at startup (default, anime, realistic). Fooocus Theme - select a theme - light or dark. Tunnel - select launch tunnel. When it happens that gradio stops working for some reason, you can choose cloudflared tunnel. However, the generation is a bit slower Memory patch - adds a few keys to the launch bar that allow you to optimise your graphics card if you are using the free version of Google Colab. If you have paid access, this item can be disabled GoogleDrive output - connects your GoogleDisk and save all of your generation directly to it.

The “Extensions” panel is divided into three groups:

in generation - modules working during generation
modules - generation modules
tools - auxiliary tools

“in generation” panel

OneButtonPrompt - allows you to generate prompts, generate variations of your prompt and runs to generate images from them. I will not dwell on the detailed description of this module - I will only point out the main functions

On the "Main" tab, you can select a preset for the hint generation theme, as well as specify additional hint prefixes and suffixes. You can also create and save your own preset in this tab.

On the "Prompt assist" tab, you can generate 5 prompts and move each of them to the Fooocus prompt field. When you select the "Prompt assist" mode, during image generation, not a new prompt will be generated, but variations of the prompt specified in the prompt field. The strength of variation is selected by the slider located directly below it.

In this tab you can select the hint syntax for different generation models, the hint length, enable the hint generation enhancement tool, and specify the number of iterations of the prompt generation for one image.

This is where you can control the generation of negative prompts

Prompt Translate

Allows to translate both positive and negative prompts from any language into English, both before generation and directly during generation.

PhotoMaker (works only in the local version)

The module allows you to generate an image with a reference face Upload a photo of your face - your reference face should be uploaded here. Unlike normal generation, this 30-step generation on Nvidia 3060 12GB takes about 4 minutes.

InstantID (works only in the local version)

The module allows you to generate an image with a reference face Upload a photo of your face - your reference face should be uploaded here.

Upload a reference pose image (Optional) - a reference pose image will be uploaded here. Optional.

Pregeneration image - if this mode is enabled, then a new image will be generated as a reference pose image based on your Prompt.

If you have not specified a pose reference image, an image will be generated from your prompt based only on the face image and the same resolution.

If a pose reference image is present, the dimensions will be the same as the pose reference image.

IdentityNet strength (for fidelity) - responsible for the accuracy of face repetition.

Image adapter strength (for detail) - responsible for details

ContrloNet

canny - used to define image contours
depth - used to create a depth map of the image Schedulers - select the Schedulers that will be used when generating the image with our face. Enhance non-face region - enhance non-face parts of the image

Unlike normal generation, this 30-step generation on Nvidia 3060 12GB takes about 7 minutes.

Inswapper

This module is also intended for face replacement.

Source Image Index - index of the face in the reference image. Faces are numbered from left to right from top to bottom starting from zero. If you specify -1, the average mixed face of all available faces will be taken as the reference face

Target Image Index - index of the face in the output image. This is the index of the face to be replaced in the output image. If you specify -1, all faces will be replaced.

Source Face Image - image with face

CodeFormer

Face enhancement module with upscale capability

Pre_Face_Align - aligns the face if it is tilted

Background Enchanced - improve background quality

Face Upsample - adjust reference face to the size of the input image face

Upscale - image enlargement

Codeformer_Fidelity - signability coefficient, inversely proportional to quality

Vector

This module allows you to get images in svg format in b/w mode.

Transparent PNG - Pre-Create Transparency

Noise Tolerance - setting for cutting off noise in the original image

Quantize - the level of image quantization

Keep temp images - do not save the intermediate transparency file

Threshold - line curvature threshold

“modules” panel

Image Batch (batch image processing)

In a nutshell, this module allows you to perform group scaling of images, as well as create images based on a group of existing images using ImagePromt (ControlNet). To better understand this module, I suggest you conduct some experiments yourself. But I want to note that using it allows you to use your images as references and change their style depending on the prompt and model you choose. First, depending on the "Uplode ZIP-file" selector, you need to upload several reference images or a zip archive with them. The archive should not contain subfolders, file names should not contain non-Latin characters and spaces.

Next, select the mode of changing the image resolution

NOT scale - the source image resolution will not be taken into account during generation
to ORIGINAL - this means that immediately before generation the resolution equal to the resolution of the source image will be selected.
to OUTPUT - in this case, before generation the resolution of the source image will be changed to the generated one with preserving the proportions

Depending on what you want to do with the source images, select Action - Upscale or ImagePrompt. From the Method drop-down list, select the appropriate image processing method. If you are using ImagePrompt, you must also select the "Stop at" and "Weight" options.

Start batch - starts the process for execution.

When the process is finished, click on Output->Zip button to create an archive with all previously created images from the output folder. The archive itself will appear in the "Download a Zip file" window. You can download it from there.

Clear Output - clear the output folder. It should be noted that not only the folder for the current date is cleared, but also the whole folder.

Prompt Batch (batch processing of prompts)

This module allows you to run generation of several hints sequentially one after another. To do this, you should fill in the table. Enter a positive hint in the hint column and a negative hint in the negative hint column respectively. Clicking the New Row button will add an empty row to the end of the table. Delete Last Row deletes the last row of the table. Start batch starts the execution of the list of prompts to generate. You can also choose to add basic positive and negative hints. None - no base hints will be added. Prefix - base hints will be added before the table hints. Suffix - basic hints will be added after hints from the table.

only positive prompts - if this item is active, all prompts in the file will be considered positive, otherwise, both positive and negative prompts will be loaded Load prompts from file - allows to load the list of positive and negative prompts from a text file into the table. The file can have any extension

As an example, consider a file with the following contents

---- start of file -------

------ end of file --------

If the item ‘only positive prompts’ is active, then the table with prompts will have the following form

Otherwise

This means that in the first case all prompts in the file are treated as positive, and empty lines are ignored. In the second case, the file first contains a line with a positive prompt, followed by a line with a negative prompt. If you don't need to specify a negative hint, leave this line blank, but the positive hint line must always be there.

X/Y/Z Plot

This extension allows you to make image grids to make it easier to see the difference between different generation settings and choose the best option. You can change the following parameters - Styles, Steps, Aspect Ratio, Seed, Sharpness, CFG (Guidance) Scale, Checkpoint, Refiner, Clip skip, Sampler, Scheduler, VAE, Refiner swap method, Softness of ControlNet, and also replace words in the prompt and change their order

Inswapper

The full analog of this module in the “in generation” panel, unlike which you need to load an additional input image

CodeFormer

The full analog of this module in the “in generation” panel, unlike which you need to load an additional input image

Remove Background

This extension is designed to add background removal, image/video processing, and blending to your projects. It provides precise background removal with support for multiple models, chroma keying, foreground adjustments, and advanced effects. Whether you’re working with images or videos, this extension provides everything you need to efficiently process visual content.

Key Features:

Multi-model background removal: Supports u2net, isnet-general-use, and other models.

Chroma keying support: Removes specific colors (green, blue, or red) from the background.

Blending modes: 10 powerful blending modes for image compositing.

Foreground adjustments: Scale, rotate, flip, and position elements precisely.

Video and image support: Easily process images and videos.

Multi-threaded processing: Efficiently process large files with streaming and GPU support. Customizable output formats: Export to PNG, JPEG, MP4, AVI, and more.

All processing results are automatically saved to the output folder without saving to History Log

Vector

The full analog of this module in the “in generation” panel, unlike which you need to load an additional input image

“tools” panel

Civitai Helper

This extension allows you to download models for generation from the civitai website. To download a model you first need to specify your Civitai_API_key. In the Download Model section in the Civitai URL field you need to specify a link to the required model from the browser address bar and click Get Model Info by Civitai URL. After analysing the link you will be given information about the model. You will also be able to select the version of the model before downloading. This extension also allows you to find duplicates of downloaded models and check for updates. In addition, there is a group download option.

TextMask

Fast text editor with mask creation for ControlNet and Inpaint!

What this tool can do:

Adds up to 5 text blocks to any image
Supports fonts of any size up to 300px
Works with Cyrillic, hieroglyphics and other non-Latin characters
Generates black and white mask with one click
Lets you forget Photoshop for 90% of simple text tasks
Supports a sufficient number of fonts

How to use:

Upload an image
Add and edit the text as you like
Generate a mask in two clicks
Download and use in ControlNet (CPDS or PyraCanny) or as a mask for Inpaint to stylise text and blend into an image.

SVGcode

Module for processing raster images for saving from to svg

Roller

This module is designed for rolling tiles to give it a more aesthetic appearance without losing the seamless effect.

Y_shift - vertical image shift.

Х_shift - horizontal image shift.

Save to OUTPUT FOLDER - saving the image to the output folder

OpenPoseEditor

This module allows you to create skeletons for subsequent image creation using OpenPose ControlNet. You can also create a skeleton from an existing image.

Logo - Logo insertion tool with automatic placement selection based on image content

Size ratio - the relative size of the logo to the image. Determines what percentage of the smaller side of the image the logo will occupy.

Margin Ratio - logo offset from image edges. Determines how far from the corner the logo will be placed.

Minimal complexity for background - the background complexity threshold at which a background is automatically added under the logo.

Сorner priority - select the priority of choosing the angle when overlaying the logo.

Photopea - a free online analogue of Photoshop

In addition to these extensions, there are several other add-ons available

Select the resolution and aspect ratio of the generated image

This setting is located in the generation resolution selection tab. Here you can select the number of horizontal and vertical points, aspect ratio. To apply the settings, click the Set button and select this resolution from the list of proposed resolutions. Your resolution will be the second to last. You can also select a random aspect ratio for each generation from the available list of aspect ratios

Wildcard

This module allows you not to memorise existing files with wildcard words, but to select them directly from a list of dictionaries. You can also select directly the item you need from the list.

OpenPose ControlNet

Allows you to create an image based on the pose skeleton.

Recolor ControlNet

Allows you to colorize an image based on a black and white image.

Scribble ControlNet

Allows you to color an image based on a sketch.

Manga Recolor ControlNet

This is a specially trained ControlNet model designed to automatically colorize grayscale images in anime style.

The model takes grayscale anime images as input and generates a colorized version.

An anime model is required for proper generation. Works with or without a clarifying prompt.

Save Image Grid for Each Batch

Filename Prefix

This setting may be useful when working on several projects to separate one from another.

Paths and Presets

Here you can change the paths to your models if they are already in other folders on the disk. If your Сheckponts or LORAs are in different folders, then the paths to them can be specified separated by commas (,). After changing the path, it is best to restart FooocusExtend.

Also here you can create a new preset based on the existing settings, and delete any of the existing ones, except for default and initial. The preset saves the following parameters: base model, refiner, refiner_switch, loras settings, cfg scale, sharpness, CFG Mimicking from TSNR, clip_skip, sampler, scheduler, Forced Overwrite of Sampling Step, Forced Overwrite of Refiner Switch Step, performance, image number, prompt negative, styles selections, aspect ratio, vae, inpaint_engine_version

Load file of style

Allows you to upload a file (in *.json format) with custom styles

View LoRA trigger words and view the models page on civitai.com

If trigger words or links to model pages are not displayed, you will need to scan the models in the Civitai Helper module in the "Scan Models" section.

Seamless tiling

Settings for creating seamless tiles. Located in Advanced - Developer Debug Mode - Control - Tiled. Sometimes a little edge refinement is required in any Photo Editor.

Transparency

Settings for creating images on a transparent background and a mask for it are located in the "Advanced" - "Developer Debug Mode" - "Controls" - "Transparency" section.

None - the normal generation mode

Attention Injection - This mode uses LoRA rank 256, turning SDXL into a transparent image generator. It transforms the model's latent distribution into a "transparent latent space" that can be decoded by a dedicated VAE pipeline.

Conv Injection - This method uses an alternative model to transform SDXL into a transparent image generator. It uses biases on all convolutional layers (and, in fact, on all layers that are not q, k, v in any of the attention layers). These biases can be combined with any XL model to change the latent distribution to transparent images. Since learning the biases on all q, k, v layers was eliminated, the understanding of SDXL should be fully preserved. However, in practice, this first method has proven to yield better results. This method is used for some special cases that require special understanding. This method can have a strong impact on the style of the underlying model. This extension is based on layerdiffuse by lllyasviel (https://github.com/lllyasviel/sd-forge-layerdiffuse)

	Fooocus_extend. Base version 2.5.5
	Original Fooocus Colab

All suggestions and questions can be voiced in the Telegram-group

Change log

v9.2.3

View the models page on civitai.com

v9.2.2

Settings for Transparency generation

v9.2.1

Add Logo in tools

v9.2.0

Add Vector Style
Add SVGcode
Add Vector in generate mode
Add Vector in module
Add batch in Codeformer in module
Add batch in Inswapper in module
Optimized ImageBatch
The x/y/z-plot module has been updated with the following parameters: Lora Name, Lora weight, Codeforeme, refiner settings, aspect ratio selection
Final preview of ImageBatch
Final preview of PromptBatch

v9.1.7

Add Roller in tools

v9.1.6

Settings for Seamless tiling

v9.1.5

A deeper integration of the OneButtonPrompt module has been carried out and it has been moved to the 'in generation' group

v9.1.4

Some bug fix

v9.1.3

Add Manga Recolor ControlNet

v9.1.2

Added selector for choosing to load individual files or zip archive in Image Batch module

v9.1.1

Add view trigger words of LoRA
Bug fix in wildcards
Add Civitai_API_key saving
Add preview files for InstantId and Photomaker styles

v9.1.0

Add PhotoMaker module
Add PhotoMaker styles
Add Random Aspect Ratio

v9.0.0

Dividing extensions into groups
Add InstantID module
Add Inswapper module
Add CodeFormer module
Add InstanID styles

v8.1.0

Some bug fix
Add TextMask - Fast text editor with mask creation for ControlNet and Inpaint

v8.0.4

User style upload
Fixed maximum height of PromptBatch
Option to load only positive prompts

v8.0.3

Some bug fix
Add load prompt from files in PromptBatch
The ImageBatch extension interface has been simplified

v8.0.2

Some bug fix

v8.0.1

Add Filename Prefix
Add Paths and Presets Settings

V8

Save Image Grid for Each Batch
Add X/Y/Z Plot Extention
Prompt Batch is now in the extensions panel
Images Batch has become easier to manage while retaining its functionality
Images Batch is now in the extensions panel
Add support VAE and LyCoris in Civitai_Helper
The extension Remove Background has been changed
Add auto update on startup

V7

Add OpenPoseEditor
Fix bug in Image Batch Mode
Added cell selection in Image Batch Mode
Added selection of adding base prompts in Prompt Batch Mode
Add OpenPose ControlNet
Add Recolor ControlNet
Add Scribble ControlNet

V6

Add Prompt Batch Mode
Rename Batch Mode to Images Batch Mode
Fixed an incorrect start random number in Batch Mode
Add visual management of Wildcard and Words/phrases of wildcard
Added the ability to set any resolution for the generated image
Add OneButtonPrompt

V5

Model Downloader replaced with Civitai Helper

V4

Add VAE download
Add Batch mode

V3

Add Photopea
Add Remove Background
Add Extention Panel
All extensions are available in Extention Panel

V2

Added a Model Downloader to Fooocus webui instead of colab

V1

added the ability to download models from the civitai.com
saving the generated image to Google Drive
added prompt translator
added a patch for the ability to work in free colab mode

Name		Name	Last commit message	Last commit date
Latest commit History 2,197 Commits
.github		.github
css		css
extentions		extentions
extras		extras
javascript		javascript
language		language
ldm_patched		ldm_patched
models		models
modules		modules
presets		presets
sdxl_styles		sdxl_styles
tests		tests
wheel		wheel
wildcards		wildcards
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
Fooocus_extend.ipynb		Fooocus_extend.ipynb
Fooocus_extend_wo_update.ipynb		Fooocus_extend_wo_update.ipynb
LICENSE		LICENSE
README.md		README.md
args_manager.py		args_manager.py
auth-example.json		auth-example.json
build_launcher.py		build_launcher.py
development.md		development.md
docker-compose.yml		docker-compose.yml
docker.md		docker.md
entry_with_update.py		entry_with_update.py
entrypoint.sh		entrypoint.sh
environment.yaml		environment.yaml
experiments_expansion.py		experiments_expansion.py
experiments_face.py		experiments_face.py
experiments_interrogate.py		experiments_interrogate.py
experiments_mask_generation.py		experiments_mask_generation.py
fooocus_version.py		fooocus_version.py
launch.py		launch.py
notification-example.mp3		notification-example.mp3
patcher_tunel.py		patcher_tunel.py
readme.md		readme.md
requirements_docker.txt		requirements_docker.txt
requirements_versions.txt		requirements_versions.txt
shared.py		shared.py
troubleshoot.md		troubleshoot.md
update_FooocusExtend_log.txt		update_FooocusExtend_log.txt
update_log.md		update_log.md
webui.py		webui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases 6

Packages

Uh oh!

Languages

License

shaitanzx/Fooocus_extend

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Languages

Packages