Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 90% Match Research Paper Computer Vision Researchers,Robotics Engineers,AI Researchers,GIS Specialists,Autonomous Systems Developers 1 day ago

GraphGeo: Multi-Agent Debate Framework for Visual Geo-localization with Heterogeneous Graph Neural Networks

graph-neural-networks › multi-agent
📄 Abstract

Abstract: Visual geo-localization requires extensive geographic knowledge and sophisticated reasoning to determine image locations without GPS metadata. Traditional retrieval methods are constrained by database coverage and quality. Recent Large Vision-Language Models (LVLMs) enable direct location reasoning from image content, yet individual models struggle with diverse geographic regions and complex scenes. Existing multi-agent systems improve performance through model collaboration but treat all agent interactions uniformly. They lack mechanisms to handle conflicting predictions effectively. We propose \textbf{GraphGeo}, a multi-agent debate framework using heterogeneous graph neural networks for visual geo-localization. Our approach models diverse debate relationships through typed edges, distinguishing supportive collaboration, competitive argumentation, and knowledge transfer. We introduce a dual-level debate mechanism combining node-level refinement and edge-level argumentation modeling. A cross-level topology refinement strategy enables co-evolution between graph structure and agent representations. Experiments on multiple benchmarks demonstrate GraphGeo significantly outperforms state-of-the-art methods. Our framework transforms cognitive conflicts between agents into enhanced geo-localization accuracy through structured debate.
Authors (9)
Heng Zheng
Yuling Shi
Xiaodong Gu
Haochen You
Zijian Zhang
Lubin Gan
+3 more
Submitted
November 2, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

GraphGeo introduces a multi-agent debate framework for visual geo-localization using heterogeneous graph neural networks. It models diverse agent interactions (collaboration, competition, knowledge transfer) via typed edges and employs a dual-level debate mechanism to effectively handle conflicting predictions and improve location accuracy.

Business Value

Enhances the accuracy and reliability of location determination for autonomous systems and mapping services, crucial for navigation and situational awareness.