Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 95% Match Research Paper Computer Vision Researchers,AR/VR Developers,Robotics Engineers 2 weeks ago

PPMStereo: Pick-and-Play Memory Construction for Consistent Dynamic Stereo Matching

computer-vision › video-understanding
📄 Abstract

Abstract: Temporally consistent depth estimation from stereo video is critical for real-world applications such as augmented reality, where inconsistent depth estimation disrupts the immersion of users. Despite its importance, this task remains challenging due to the difficulty in modeling long-term temporal consistency in a computationally efficient manner. Previous methods attempt to address this by aggregating spatio-temporal information but face a fundamental trade-off: limited temporal modeling provides only modest gains, whereas capturing long-range dependencies significantly increases computational cost. To address this limitation, we introduce a memory buffer for modeling long-range spatio-temporal consistency while achieving efficient dynamic stereo matching. Inspired by the two-stage decision-making process in humans, we propose a \textbf{P}ick-and-\textbf{P}lay \textbf{M}emory (PPM) construction module for dynamic \textbf{Stereo} matching, dubbed as \textbf{PPMStereo}. PPM consists of a `pick' process that identifies the most relevant frames and a `play' process that weights the selected frames adaptively for spatio-temporal aggregation. This two-stage collaborative process maintains a compact yet highly informative memory buffer while achieving temporally consistent information aggregation. Extensive experiments validate the effectiveness of PPMStereo, demonstrating state-of-the-art performance in both accuracy and temporal consistency. % Notably, PPMStereo achieves 0.62/1.11 TEPE on the Sintel clean/final (17.3\% \& 9.02\% improvements over BiDAStereo) with fewer computational costs. Codes are available at \textcolor{blue}{https://github.com/cocowy1/PPMStereo}.
Authors (7)
Yun Wang
Junjie Hu
Qiaole Dong
Yongjian Zhang
Yanwei Fu
Tin Lun Lam
+1 more
Submitted
October 23, 2025
arXiv Category
cs.CV
NeurIPS 2025
arXiv PDF

Key Contributions

Introduces PPMStereo, a novel approach for dynamic stereo matching that uses a Pick-and-Play Memory (PPM) module to achieve long-range spatio-temporal consistency efficiently. It addresses the trade-off between temporal modeling and computational cost.

Business Value

Enables more immersive and realistic AR/VR experiences by providing stable and accurate depth estimation. Crucial for applications requiring precise 3D understanding of dynamic environments.