Text-to-Image Generator Prototype

Introduction

This project is a simple text-to-image generator prototype built using Streamlit for the user interface and the FLUX.1-schnell model from Gradio for the image generation backend.
Users can input text prompts, and the app generates AI-created images based on the prompt, which can be downloaded.
Tap below on Hugging Face for a simple application interface that generates images for given input prompts.

Features

- Allows users to input text prompts. - Generates an image based on the entered prompt. - Provides a download button for the generated image. - Simple progress bar indicating image generation status.

Installation

Prerequisites

Python 3.7+
Streamlit
Gradio Client
shutil and os are part of the Python standard library.

Install the requirements

pip install -r requirements.txt

Run the app

streamlit run app.py

Usage

Change save_dir in app.py file so that images will be saved directly to your directory
Run the app.py file using the instructions above.
Input a text prompt in the sidebar.
Click on the "Generate Image" button to start generating.
Wait for the progress bar to complete, and your generated image will appear on the screen.
Click the "Download Image" button to save the image locally.

Model Selection

I selected FLUX.1-schnell, an open-source model accessible via Gradio, due to its robust text-to-image capabilities and ease of integration into web-based applications. It is designed to efficiently convert text prompts into high-quality images, making it ideal for prototyping projects like this.

I opted for a pre-trained model rather than building one from scratch because:

Simplicity: FLUX.1-schnell offers an accessible API with ready-to-use image generation without needing extensive training.
Versatility: It handles a wide variety of input prompts and can generate detailed, aesthetically pleasing images.
Time efficiency: Given the assignment's deadline, using a pre-trained model minimizes setup time while ensuring high-quality results.

Integration

The integration process involved using Gradio's API to make predictions based on user input. The model's functionality was embedded within a Streamlit web interface for simplicity and ease of deployment.

Challenges

One of the challenges was ensuring that the app could handle multiple requests without latency. I addressed this by using lower inference steps (set to 4) to speed up image generation without compromising too much on quality.

Use Cases

Content Creation for Social Media Influencers

Use Case: Social media influencers, bloggers, or content creators can use the tool to generate visually engaging images based on a theme or topic. For example, they can create customized images for promotional posts, event announcements, or inspirational quotes.

Marketing and Advertising Campaigns

Use Case: Marketing teams can quickly generate high-quality visuals that align with their campaign themes. This can speed up the process of creating banners, posters, and advertisements without relying on graphic designers.

Art and Design Inspiration

Use Case: Artists and designers looking for inspiration or quick concept art can use the generator to visualize ideas. It can help kickstart the creative process by providing rough imagery based on descriptive input.

Educational Use

Use Case: Educators can use the tool to create engaging visual content for lessons, making abstract ideas more relatable and visually appealing for students.

Storybook Illustrations

Use Case: Authors or publishers of children's books could use the text-to-image generator to create illustrations based on the story content, cutting down on the time required to find or commission artwork.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text-to-Image Generator Prototype

Introduction

Features

Installation

Usage

Model Selection

Integration

Challenges

Use Cases

Content Creation for Social Media Influencers

Marketing and Advertising Campaigns

Art and Design Inspiration

Educational Use

Storybook Illustrations

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

abeed04/Text_to_Image_generator

Folders and files

Latest commit

History

Repository files navigation

Text-to-Image Generator Prototype

Introduction

Features

Installation

Usage

Model Selection

Integration

Challenges

Use Cases

Content Creation for Social Media Influencers

Marketing and Advertising Campaigns

Art and Design Inspiration

Educational Use

Storybook Illustrations

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages