Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Panoramic imaging enables capturing 360{\deg} images with an ultra-wide
Field-of-View (FoV) for dense omnidirectional perception, which is critical to
applications, such as autonomous driving and augmented reality, etc. However,
current panoramic semantic segmentation methods fail to identify outliers, and
pinhole Out-of-distribution Segmentation (OoS) models perform unsatisfactorily
in the panoramic domain due to background clutter and pixel distortions. To
address these issues, we introduce a new task, Panoramic Out-of-distribution
Segmentation (PanOoS), with the aim of achieving comprehensive and safe scene
understanding. Furthermore, we propose the first solution, POS, which adapts to
the characteristics of panoramic images through text-guided prompt distribution
learning. Specifically, POS integrates a disentanglement strategy designed to
materialize the cross-domain generalization capability of CLIP. The proposed
Prompt-based Restoration Attention (PRA) optimizes semantic decoding by prompt
guidance and self-adaptive correction, while Bilevel Prompt Distribution
Learning (BPDL) refines the manifold of per-pixel mask embeddings via semantic
prototype supervision. Besides, to compensate for the scarcity of PanOoS
datasets, we establish two benchmarks: DenseOoS, which features diverse
outliers in complex environments, and QuadOoS, captured by a quadruped robot
with a panoramic annular lens system. Extensive experiments demonstrate
superior performance of POS, with AuPRC improving by 34.25% and FPR95
decreasing by 21.42% on DenseOoS, outperforming state-of-the-art pinhole-OoS
methods. Moreover, POS achieves leading closed-set segmentation capabilities
and advances the development of panoramic understanding. Code and datasets will
be available at https://github.com/MengfeiD/PanOoS.
Authors (8)
Mengfei Duan
Yuheng Zhang
Yihong Cao
Fei Teng
Kai Luo
Jiaming Zhang
+2 more
Key Contributions
Introduces the task of Panoramic Out-of-distribution Segmentation (PanOoS) and proposes POS, the first solution that adapts to panoramic image characteristics using text-guided prompt distribution learning. POS integrates a disentanglement strategy to leverage CLIP's cross-domain generalization and uses Prompt-based Restoration Attention (PRA) for optimization.
Business Value
Enhances the safety and reliability of autonomous driving systems by enabling better perception of the entire environment, including unusual or unexpected objects/scenarios, which is critical for regulatory approval and public trust.