Repository for Image SR Project for the NITK Chapter of ACM, 2023-2024

Abstract

We leverage a GAN-based architecture to tackle the issue of constructing high-resolution images from low-resolution images while preserving image structure and simultaneously improving image fidelity and sharpness.

Mentors:

Santosh C
Prasanna Kumar

Mentees:

Aaryan Patil
Arnav Santosh

Key Points:

SRGAN trained on an image dataset consisting of more than 6000 high resolution and low resolution pairs of training images alongside 273 corresponding pairs of validation images (publicly available here.)
Performs data augmentation to the training images
Includes dataset loading, model architecture and the training
Includes an easy to use upscale function to easily upscale any image given its path

Report:

Introduction

Classical image enhancement techniques such as nearest-neighbor, bilinear and bicubic interpolation produce poor results for upscaling given images, often leading to blurred edges and low image contrast. So in recent years, through the general development of generative models, artificial intelligence has applied itself naturally to the task of image super-resolution.

Generative Adversarial Networks rose to popularity around five years after their introduction in a research paper published in 2014, eventually seeing a rise in popularity in 2019. In recent years, the trend has begun to shift into using diffusion-based models which offer more stability and higher quality outputs in classical image processing tasks, primarily image upscaling.

Nevertheless, this project looks to explore and apply known deep learning techniques that have been known to achieve superior visual fidelity and sharpness in upscaled images and videos using state-of-the-art algorithms such as Generative Adversarial Networks and other deep learning models (such as RNNs) for the task of Super Resolution.

Method

To perform SISO-SR (Single-Image-Single-Output Super Resolution), we use a GAN with a generator and discriminator architecture as shown below.

Fig 1. Generator

Fig 2. Discriminator

The main block of the generator lies in the list of residual blocks connected sequentially. Upscaling is done via Pixel Shuffling layers. Bicubically upscaling the original input image as a baseline for the output allows for color-correct output images. Optional post-processing such as sharpening can be done.

Results:

Fig 3. Website developed for the model using Flask.

Fig 4. Example output image. Note the high SSIM value.

The SRGAN model was tested using a completely new dataset consisting of about 76 images, of various objects, animals, places and people. The model was then fed the test dataset and the evaluation metrics (PSNR and SSIM) were calculated and printed between the upscaled image and the original high resolution image.

Average PSNR value: 23.048
Average SSIM value: 0.9438

Note that: An SSIM (Structural Similarity Index Measure) closer to 1 is considered perfect. A higher PSNR indicates a higher quality reconstruction but studies have shown that it is a poor indicator compared to other metrics

Conclusion:

SRGAN was successfully implemented with results surpassing that of the original research paper on which the project was based owing to custom modifications made. With a larger dataset and more training time, the model can be deployed for use in commercial image editing software.

Technologies/Libraries/Frameworks used:

Numpy
PyTorch
Matplotlib
Pillow
Flask
HTML
CSS

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
GUI		GUI
results		results
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
RESOURCES.md		RESOURCES.md
SRGAN-Modified.ipynb		SRGAN-Modified.ipynb
SRGAN_M_5.pth		SRGAN_M_5.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Repository for Image SR Project for the NITK Chapter of ACM, 2023-2024

Abstract

Mentors:

Mentees:

Key Points:

Report:

Introduction

Method

Results:

Conclusion:

Technologies/Libraries/Frameworks used:

Associated links:

References:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

doobiusP/Single_Image_Super_Resolution

Folders and files

Latest commit

History

Repository files navigation

Repository for Image SR Project for the NITK Chapter of ACM, 2023-2024

Abstract

Mentors:

Mentees:

Key Points:

Report:

Introduction

Method

Results:

Conclusion:

Technologies/Libraries/Frameworks used:

Associated links:

References:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages