Image Colorization using GANs (Advanced computer vision project)

This project provides an implementation of Image colorization using GANs (Generative Adversarial Networks), specifically using a U-Net architecture for the generator and a PatchCNN for the discriminator. The project is constructed in Python with PyTorch as the deep learning framework, and PyTorch-Candle for efficient training process.

Project structure

Following are the main components:

notebook.ipynb: This is the Jupyter notebook file containing the model training implementation and execution steps.
README.md: The file you are currently reading which contains detailed explanation about the project and the implementation.
requirements.txt: It contains the list of required libraries and dependencies for the project.
callbacks.py: It contains custom callback classes for saving images and checkpoints during the training process.
models.py: It houses the neural network models used in this project.
app.py: Streamlit web application for image colorization.
utils/: Directory containing utility functions and helper classes:

Web Interface

The project includes a user-friendly web interface built with Streamlit that allows users to:

Upload grayscale images in common formats (PNG, JPG, JPEG)
View the original image in real-time
Colorize images with a single click
Download the colorized results

To run the web interface:

streamlit run app.py

Components

Training

During the training, the discriminator and the generator are trained alternately in the defined number of epochs. The progress of the training can be tracked and visualized.

Callbacks

There are two callbacks employed:

ImageSaver: This callback is used to save the generated images after each epoch during training. As a result, we can review the progress of the generator in creating images.
CheckpointSaver: This callback is invoked to save the model after every specific number of steps. This can be particularly helpful during long training processes or when training is expected to be interrupted.

Models

This project comprises of two main models, namely a generator and a discriminator model.

Unet: The generator model used is a U-Net model. This model is particularly effective for image-to-image translation problems.
PatchDiscriminator: The discriminator is implementing a PatchGAN style, which operates over patches of the image, rather than the whole image. It helps to make the model faster and more efficient.
ImageColorizationModel: The main model class holding the generator and discriminator models.

Dataset

The data used for training this project is available on kaggle.

Usage

In order to run the project, you need to have Python installed on your system. You can install the required libraries using the following command:

pip install -r requirements.txt

The training code is contained in the notebook.ipynb Jupyter notebook file. You can run the notebook in a Jupyter notebook environment or on cloud platforms like Google Colab. The implementation on Kaggle is available here: Image Colorization on Kaggle

Using the Web Interface

Install the required dependencies:

pip install streamlit pillow

Run the Streamlit app:

streamlit run app.py

Open your web browser and navigate to the provided local URL
Upload a grayscale image and click the "Colorize Image" button
Download the colorized result using the download button

Results

(After 45 epochs)

Inspired from article: Colorizing black & white images with U-Net and conditional GAN — A Tutorial

by Moein Shariatnia

Future Objectives

The project's current configuration parameters, such as the number of epochs and batch size, are tailored for learning and experimentation. These parameters may require further tuning based on the specific dataset and computational resources available. To achieve superior results, future work could include:

Advanced Techniques: Incorporating more sophisticated techniques to enhance the model's accuracy and efficiency.
Larger Datasets: Utilizing more extensive datasets to improve the generalization and robustness of the model.
Extended Training Times: Allowing for longer training periods to enable the model to learn more complex patterns and features.
UI Enhancements: Adding features like batch processing, additional image preprocessing options, and different colorization styles.
API Development: Creating a REST API to allow integration with other applications.

This project serves as a flexible foundation, open to modifications and enhancements. Users are encouraged to adapt and expand upon the provided code to explore the vast possibilities of conditional GANs and image colorization, aiming for innovative solutions and improved outcomes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Colorization using GANs (Advanced computer vision project)

Project structure

Web Interface

Components

Training

Callbacks

Models

Dataset

Usage

Using the Web Interface

Results

(After 45 epochs)

Inspired from article: Colorizing black & white images with U-Net and conditional GAN — A Tutorial

Future Objectives

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
style		style
utils		utils
.gitignore		.gitignore
README.md		README.md
app.py		app.py
generate.ipynb		generate.ipynb
notebook.ipynb		notebook.ipynb
requirements.txt		requirements.txt

paraglondhe098/image-colorization

Folders and files

Latest commit

History

Repository files navigation

Image Colorization using GANs (Advanced computer vision project)

Project structure

Web Interface

Components

Training

Callbacks

Models

Dataset

Usage

Using the Web Interface

Results

(After 45 epochs)

Inspired from article: Colorizing black & white images with U-Net and conditional GAN — A Tutorial

Future Objectives

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages