GPT Image Captioner 🖼️

A web app that generates AI-powered image captions. Ideal for LoRA model training on platforms like fal LoRA Trainer and Replicate LoRA Trainer.

Live Demo

✨ Features

Dual Model Support: OpenAI API (GPT-4.1 series) or Ollama (local models)
Batch Processing: Upload and caption multiple images at once
Customization: Add prefix/suffix to captions
Export: Download all captions as a ZIP file
API Key Management: Securely store OpenAI keys in-app

🧠 Model Options

OpenAI: GPT-4.1 (high-quality), GPT-4.1-mini (balanced), and GPT-4.1-nano (faster, cheaper)
Ollama: Local vision models (LLaVA, moondream, bakLLaVA), no API key needed

Note: When using the deployed web app with Ollama, you have several options:

Use ngrok to create a secure tunnel to your local Ollama server. Learn more.

Configure Ollama to allow additional web origins using the OLLAMA_ORIGINS environment variable. Learn more and check out LobeHub's Ollama provider documentation.

🛠️ Tech Stack

Next.js 14, Tailwind CSS, shadcn/ui, Lucide React, Vercel AI SDK

🚀 Quick Start

Prerequisites

Node.js (v16+)
Yarn
OpenAI API key (if using OpenAI)
Ollama installed locally (if using local models)

Install & Run

# Clone repo
git clone https://github.com/aleksa-codes/gpt-flux-img-captioner.git
cd gpt-image-captioner

# Install dependencies
yarn install

# Start development server
yarn dev

Open http://localhost:3000 in your browser.

💡 Usage

Choose between OpenAI or Ollama
Upload one or more images
Add optional prefix/suffix
Generate captions
Download as ZIP

Using Ollama

Install Ollama
Pull a vision model: ollama pull llava
Start Ollama server
Select "Ollama" in the app and choose your model

🤝 Contributing

Contributions welcome! Fork the repo, create a feature branch, and submit a pull request.

📝 License

MIT License - see the LICENSE file for details.

Made with ❤️ by aleksa.codes

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
src		src
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
components.json		components.json
next.config.mjs		next.config.mjs
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GPT Image Captioner 🖼️

✨ Features

🧠 Model Options

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Install & Run

💡 Usage

Using Ollama

🤝 Contributing

📝 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

aleksa-codes/gpt-flux-img-captioner

Folders and files

Latest commit

History

Repository files navigation

GPT Image Captioner 🖼️

✨ Features

🧠 Model Options

🛠️ Tech Stack

🚀 Quick Start

Prerequisites

Install & Run

💡 Usage

Using Ollama

🤝 Contributing

📝 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages