- Batch Caption – generates automatic image descriptions with MiaoshouAI / Florence‑2 PromptGen v2.0.
- Manual Viewer – displays images 20 by 20, allows you to reread / correct each prompt manually and save instantly.
⚡ Optimized CUDA / fp16 & SDPA — also works on CPU if needed.
Windows or Linux An NVIDIA graphics card is recommended.
-
Python 3.10 64-bit | https://www.python.org/downloads/release/python-31011/ (✔ check "Add to PATH" ; restart)
-
Run
start.bat
ORstart.sh
Your images located in the selected folder input/
, will be automatically sent to the folder /Florence Caption/output/TimestampedFolder/here.
You can then switch to the Manual Viewer tab, which will allow you to edit the prompts manually if they don't suit you. Saving is done when you click outside the text area.