OpenAI Image Generation & Extraction

This repository demonstrates how to use OpenAI's Response API (with GPT-4o, and gpt-image-1 via Azure OpenAI or OpenAI) to:

Generate images from a text prompt
Generate images from an existing image and a prompt (image-to-image)
Extract the main product image URL from an e-commerce product page

It provides both Python and TypeScript implementations, returning structured outputs for easy integration.

Features

Generate images from a prompt using GPT-4o, or gpt-image-1
Generate images from an image and a prompt (image-to-image)
Extracts the main product image URL from a given e-commerce product page
Uses OpenAI tool-calling to orchestrate the extraction and generation process
Returns a structured response with the image URL, source URL, and extraction/generation method
Supports both Python and TypeScript (Node.js) environments

How it Works

For generation: The script sends a prompt (and optionally an image) to the OpenAI/Azure OpenAI API to generate a new image.
The result is returned in a structured format, including the extraction or generation method used.

Flow Diagram

flowchart TD
    A[User provides product URL or prompt (+ optional image)] --> B(OpenAI Response API: tool calling)
    B --> C{Tool: extract_image_url or generate_image}
    C --> D[Fetch product page HTML or send prompt/image to API]
    D --> E{Extraction or Generation strategies}
    E -->|og:image| F[Extract from Open Graph meta tag]
    E -->|product__media| G[Extract from product__media class]
    E -->|first_img| H[Extract first <img> tag]
    E -->|prompt| L[Generate from prompt]
    E -->|image+prompt| M[Generate from image and prompt]
    F & G & H & L & M --> I[Return structured output]
    I --> J[OpenAI returns structured JSON]
    J --> K[User receives image_url, source_url, found_method]

Example Output

{
  "image_url": "http://minecraftshop.com/cdn/shop/files/MINE-PLU1_R_MF_1200x1200.jpg?v=1729532536",
  "source_url": "https://minecraftshop.com/collections/plush/products/minecraft-goat-8-plush",
  "found_method": "og:image"
}

Requirements

Python 3.8+ (for Python version)
Node.js 18+ (for TypeScript version)
Access to Azure OpenAI or OpenAI API (with appropriate keys and endpoint)

Setup

1. Clone the repository

git clone <repo-url>
cd get-item-image

2. Environment Variables

Copy .env.sample to .env and fill in your API keys and endpoints:

cp .env.sample .env

Edit .env and set:

AZURE_OPENAI_V1_API_ENDPOINT
AZURE_OPENAI_API_KEY
AZURE_OPENAI_API_MODEL

3. Install Dependencies

Python/.py

pip install -r requirements.txt
# or manually:
pip install openai python-dotenv beautifulsoup4 requests

TypeScript/Node.js

npm install

Usage

Python

python get_item_image.py "https://minecraftshop.com/collections/plush/products/minecraft-goat-8-plush"
# or for image generation:
python get_item_image.py --generate --prompt "A cat in a spacesuit"
python get_item_image.py --generate --prompt "A cat in a spacesuit" --image path/to/input.jpg

TypeScript

npx ts-node get_item_image.ts "https://minecraftshop.com/collections/plush/products/minecraft-goat-8-plush"
# or for image generation:
npx ts-node get_item_image.ts --generate --prompt "A cat in a spacesuit"
npx ts-node get_item_image.ts --generate --prompt "A cat in a spacesuit" --image path/to/input.jpg

File Overview

get_item_image.py — Python implementation
get_item_image.ts — TypeScript/Node.js implementation
.env.sample — Example environment variable file
demo.gif — Demo animation

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.env.sample		.env.sample
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
aoai_responses_gpt_41.py		aoai_responses_gpt_41.py
aoai_responses_structured.py		aoai_responses_structured.py
aoai_responses_structured.ts		aoai_responses_structured.ts
get_item_image.py		get_item_image.py
get_item_image.ts		get_item_image.ts
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OpenAI Image Generation & Extraction

Features

How it Works

Flow Diagram

Example Output

Requirements

Setup

1. Clone the repository

2. Environment Variables

3. Install Dependencies

Python/.py

TypeScript/Node.js

Usage

Python

TypeScript

File Overview

License

About

Uh oh!

Releases

Packages

Languages

License

john-carroll-sw/generating-images-with-gpt-image-1

Folders and files

Latest commit

History

Repository files navigation

OpenAI Image Generation & Extraction

Features

How it Works

Flow Diagram

Example Output

Requirements

Setup

1. Clone the repository

2. Environment Variables

3. Install Dependencies

Python/.py

TypeScript/Node.js

Usage

Python

TypeScript

File Overview

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages