Image Editing Assistant

A modular Python application that uses Google's Gemini API to perform intelligent image editing operations including analysis, global adjustments, and local object manipulation.

Features

🤖 Intelligent Routing: Automatically determines the right editing approach
📊 Image Analysis: Detailed image information and content analysis
🌈 Global Edits: Brightness, contrast, saturation, and color temperature adjustments
🎯 Local Edits: Object detection and inpainting for targeted modifications
💬 Chat-based Editing: Interactive conversational editing sessions

Quick Start

1. Installation

# Clone or download the project files
# Install dependencies using UV (Python package manager)
uv sync

2. Configuration

# Copy the environment template
cp .env.example .env

# Edit .env and add your API key
# .env file:
GEMINI_API_KEY="your_actual_api_key_here"
# Gemini API information: https://gemini.readthedocs.io/en/latest/

Running Scripts

This project uses uv for Python package management. When running Python scripts or managing dependencies, use uv run commands:

# Run the CLI interface (without GUI)
uv run main.py

Gradio UI Interface

Usage

Launch the modern Gradio web interface:

# Launch the web UI
uv run launch_ui.py [--use-gemini-local-edit]

Then open your browser to http://localhost:7860 to access the interface.

Web UI Features:

📤 Image Upload: Upload images via drag-and-drop, file browser, or webcam
🖌️ Mask Drawing: Built-in brush and eraser tools for precise inpainting masks
💬 Chat Interface: Real-time conversation with the AI assistant
🔄 Live Updates: See edits applied in real-time
📥 Download: Save your edited images instantly
🎛️ Image Tools: Crop, flip, rotate, and transform images

Agent Architecture

Agent Responsibilities

Router Agent

Analyzes user prompts to determine the appropriate editing action
Routes requests to the appropriate specialized agent: info, global_edit, local_edit, or advanced
Uses Gemini API to understand user intent and select the best agent for the task

Info Agent

Analyzes images to provide detailed information about content and characteristics
Generates concise descriptions (under 150 words) in a single paragraph format
Identifies objects, scenes, colors, lighting conditions, and other visual elements

Global Edit Agent

Performs whole-image adjustments like brightness, contrast, saturation
Handles color temperature, sharpness, and other global parameters
Applies filters and overall image enhancements

Local Edit Agent

Performs targeted edits on specific objects or regions
Handles object detection, selection, and manipulation
Supports inpainting for object removal and replacement

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gradio		.gradio
_archived		_archived
logic		logic
model		model
test_images		test_images
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
DOCKER.md		DOCKER.md
Dockerfile		Dockerfile
README.md		README.md
agent-map.png		agent-map.png
docker-compose.yml		docker-compose.yml
example_usage.py		example_usage.py
gradio_ui.log		gradio_ui.log
gradio_ui.py		gradio_ui.py
image_assistant.log		image_assistant.log
launch_ui.py		launch_ui.py
main.py		main.py
pyproject.toml		pyproject.toml
test_gemini_local_edit.py		test_gemini_local_edit.py
test_integration.py		test_integration.py
test_local_edit.py		test_local_edit.py
test_main.py		test_main.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Editing Assistant

Features

Quick Start

1. Installation

2. Configuration

Running Scripts

Gradio UI Interface

Usage

Agent Architecture

Agent Responsibilities

Router Agent

Info Agent

Global Edit Agent

Local Edit Agent

About

Uh oh!

Releases

Packages

Languages

quang-design/image-editing-assistant

Folders and files

Latest commit

History

Repository files navigation

Image Editing Assistant

Features

Quick Start

1. Installation

2. Configuration

Running Scripts

Gradio UI Interface

Usage

Agent Architecture

Agent Responsibilities

Router Agent

Info Agent

Global Edit Agent

Local Edit Agent

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages