arxiv_ml 85% Match Research Paper Computer vision researchers,Geospatial analysts,Developers of mapping and navigation systems 1 week ago

Scaling Image Geo-Localization to Continent Level

computer-vision › 3d-vision

📄 Abstract

Abstract: Determining the precise geographic location of an image at a global scale remains an unsolved challenge. Standard image retrieval techniques are inefficient due to the sheer volume of images (>100M) and fail when coverage is insufficient. Scalable solutions, however, involve a trade-off: global classification typically yields coarse results (10+ kilometers), while cross-view retrieval between ground and aerial imagery suffers from a domain gap and has been primarily studied on smaller regions. This paper introduces a hybrid approach that achieves fine-grained geo-localization across a large geographic expanse the size of a continent. We leverage a proxy classification task during training to learn rich feature representations that implicitly encode precise location information. We combine these learned prototypes with embeddings of aerial imagery to increase robustness to the sparsity of ground-level data. This enables direct, fine-grained retrieval over areas spanning multiple countries. Our extensive evaluation demonstrates that our approach can localize within 200m more than 68\% of queries of a dataset covering a large part of Europe. The code is publicly available at https://scaling-geoloc.github.io.

Authors (7)

Philipp Lindenberger

Paul-Edouard Sarlin

Jan Hosang

Matteo Balice

Marc Pollefeys

Simon Lynen

+1 more

Submitted

October 30, 2025

arXiv Category

cs.CV

arXiv PDF

Key Contributions

Introduces a hybrid approach for fine-grained image geo-localization at a continental scale, overcoming the limitations of traditional methods. It leverages a proxy classification task to learn rich location-encoding features and combines them with aerial imagery embeddings to handle sparse ground-level data and bridge the domain gap.

Business Value

Enables precise location identification for vast amounts of imagery, crucial for applications like autonomous navigation, disaster response, and urban planning, by making global-scale geo-localization practical and accurate.

Paper Metadata

Innovation Type

Algorithmic / Hybrid Approach

Deployment Feasibility

Moderate to High, depending on the availability of large-scale aerial and ground-level imagery datasets and computational resources for feature extraction.

Limitations Addressed

Addresses the inefficiency of standard image retrieval for massive datasets, the coarse results of global classification, and the domain gap between ground and aerial imagery in cross-view retrieval.

Technical Tags

image geo-localizationglobal scaleimage retrievalclassificationfeature representationaerial imageryground-level datadomain gap

Research Topics

Computer VisionGeographic Information SystemsImage RetrievalMachine Learning

Methods & Architectures

Proxy classification taskFeature embeddingPrototype learningHybrid approach (classification + retrieval)

Applications & Tasks

Geospatial Intelligence Mapping Autonomous Driving Surveillance Image Geo-localizationLarge-scale Image Retrieval Fine-grained geo-localization at continent levelRetrieving ground-level images using aerial imagery

Related Fields

Computer VisionGeographic Information Systems (GIS)Remote SensingMachine Learning

Keywords

geo-localizationimage retrievalcomputer visionlarge-scalecontinental scaleaerial imageryground-level imagerydomain adaptationfeature learningclassificationmapping

Academic Context

#Computer Vision#Geographic Information Systems#Image Retrieval#Machine Learning

Commercial Potential

Potential Products

Global mapping servicesAutonomous vehicle localization systemsGeospatial intelligence platforms

Target Industries

Mapping and NavigationAutonomous VehiclesDefense and IntelligenceReal Estate

Use Case Examples

Pinpointing the exact location of a street-view image anywhere on Earth.Matching satellite imagery to ground-level photos for surveillance.Improving the accuracy of GPS-denied navigation systems.

Competitive Edge

Achieves fine-grained geo-localization at a scale previously unaddressed, bridging the gap between coarse classification and limited-region retrieval.

Market Opportunity

Large market for geospatial data and location-based services.

Revenue Models

API accessdata licensingspecialized services.

Resource Requirements

Compute Needs

High (for training and large-scale inference)

Data Requirements

Requires large-scale datasets of geo-tagged ground-level and aerial imagery.

Deployment Constraints

Requires access to vast amounts of geo-referenced image data.

Scalability

Designed for continental-scale problems, implying good scalability.

Production Readiness

Maturity Level

Research

Time to Market

2-4 years

Patent Potential

Moderate

View Full Paper Back to Papers