Recognizing an image

Humans take no effort to distinguish a dog, cat, or flying saucer. But this process is quite difficult for a computer to emulate: it only looks easy because God designs our brains incredibly well to recognize images. One common example of image recognition with machine learning is optical character recognition. In this project , I am building an Image Recognition model with Machine Learning using PyTorch.

For the image recognition task, the TorchVision package is used which contains some of the best performing neural network architectures for computer vision, such as AlexNet. It also provides easy access to datasets like ImageNet and other utilities to learn about computer vision applications in PyTorch.

The predefined models can be found in torchvision.models:

from torchvision import models
dir(models)

AlexNet

To run the AlexNet architecture on an input image, we can create an instance of the AlexNet class. Here’s how to do it:

alexnet = models.AlexNet()

At this stage, alexnet is an object that runs the AlexNet architecture. It is not essential for us to understand the details of this architecture at this time. At the moment, AlexNet is just an opaque object that can be called as a function.

By providing alexnet with precisely sized input data, we will perform a direct transfer across the network. In other words, the input will go through the first set of neurons, the outputs of which will be transmitted to the next set of neurons, until the final output.

ResNet

By using the resnet101 method, we can now instantiate a 101-layer convolutional neural network.

resnet = models.resnet101(pretrained=True)
resnet

Image Recognition

Now we can use an image for the image recognition task using our model. I took a picture of a dog and a cat too . We can start by loading an image from the local filesystem using Pillow, an image manipulation module for Python:

from google.colab import files
uploaded = files.upload()
from PIL import Image
img = Image.open("dog.png")
img

Run The Image Recognition Model

By running the Image Recognition Model ,we get the following result:-

"golden retriever", 96.29334259033203

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Image recognition with ML.ipynb		Image recognition with ML.ipynb
README.md		README.md
cat.png		cat.png
dog.png		dog.png
imagenet_classes.txt		imagenet_classes.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Recognizing an image

AlexNet

ResNet

Image Recognition

Run The Image Recognition Model

About

Uh oh!

Releases

Packages

Languages

Brijesh03032001/Recognizing-an-image-

Folders and files

Latest commit

History

Repository files navigation

Recognizing an image

AlexNet

ResNet

Image Recognition

Run The Image Recognition Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages