Skip to content

max-danenhower/geolocation_bot

Repository files navigation

Python notebooks to embed street view images with StreetCLIP and use those embeddings to predict the geolocation of street view images.

The training dataset is from Kaggle: https://www.kaggle.com/datasets/ayuseless/streetview-image-dataset coordinates.csv has indexes that correspond to the kaggle Streetview_Images_Dataset folder (ie 0.png corresponds to the 0th row in coordinates.csv)

training_image_distribution.png shows the distribution of image locations from the training (Kaggle) dataset.

image_embeddings.npy contains the StreetCLIP image embeddings of each image in the training dataset. Code to regenerate these embeddings is in generate_streetCLIP_embeddings.ipynb. The indexes of image_embedding.npy correspond with the indexes in coordinates.csv

The dataset used for testing the models is from HuggingFace: https://huggingface.co/datasets/stochastic/random_streetview_images_pano_v0.0.2

About

Uses StreetCLIP to predict coordinates of street view images

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published