Reinforcement learning finetune of a vlm for image-to-coordinates geolocation
This project relies on the publicly available OpenStreetView-5M (osv5m) dataset for training and testing [1].
[1] Guillaume Astruc et al., “OpenStreetView-5M: The Many Roads to Global Visual Geolocation,” CVPR 2024. 📄