Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 85% Match Research Paper Computer vision researchers,Geospatial analysts,Developers of mapping and navigation systems 1 week ago

Scaling Image Geo-Localization to Continent Level

computer-vision › 3d-vision
📄 Abstract

Abstract: Determining the precise geographic location of an image at a global scale remains an unsolved challenge. Standard image retrieval techniques are inefficient due to the sheer volume of images (>100M) and fail when coverage is insufficient. Scalable solutions, however, involve a trade-off: global classification typically yields coarse results (10+ kilometers), while cross-view retrieval between ground and aerial imagery suffers from a domain gap and has been primarily studied on smaller regions. This paper introduces a hybrid approach that achieves fine-grained geo-localization across a large geographic expanse the size of a continent. We leverage a proxy classification task during training to learn rich feature representations that implicitly encode precise location information. We combine these learned prototypes with embeddings of aerial imagery to increase robustness to the sparsity of ground-level data. This enables direct, fine-grained retrieval over areas spanning multiple countries. Our extensive evaluation demonstrates that our approach can localize within 200m more than 68\% of queries of a dataset covering a large part of Europe. The code is publicly available at https://scaling-geoloc.github.io.
Authors (7)
Philipp Lindenberger
Paul-Edouard Sarlin
Jan Hosang
Matteo Balice
Marc Pollefeys
Simon Lynen
+1 more
Submitted
October 30, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

Introduces a hybrid approach for fine-grained image geo-localization at a continental scale, overcoming the limitations of traditional methods. It leverages a proxy classification task to learn rich location-encoding features and combines them with aerial imagery embeddings to handle sparse ground-level data and bridge the domain gap.

Business Value

Enables precise location identification for vast amounts of imagery, crucial for applications like autonomous navigation, disaster response, and urban planning, by making global-scale geo-localization practical and accurate.