Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 91% Match Research Paper Autonomous Driving Engineers,Robotics Researchers,Computer Vision Engineers,Real-time Systems Developers 2 weeks ago

4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads

computer-vision › video-understanding
📄 Abstract

Abstract: 4D panoptic segmentation in a streaming setting is critical for highly dynamic environments, such as evacuating dense crowds and autonomous driving in complex scenarios, where real-time, fine-grained perception within a constrained time budget is essential. In this paper, we introduce 4DSegStreamer, a novel framework that employs a Dual-Thread System to efficiently process streaming frames. The framework is general and can be seamlessly integrated into existing 3D and 4D segmentation methods to enable real-time capability. It also demonstrates superior robustness compared to existing streaming perception approaches, particularly under high FPS conditions. The system consists of a predictive thread and an inference thread. The predictive thread leverages historical motion and geometric information to extract features and forecast future dynamics. The inference thread ensures timely prediction for incoming frames by aligning with the latest memory and compensating for ego-motion and dynamic object movements. We evaluate 4DSegStreamer on the indoor HOI4D dataset and the outdoor SemanticKITTI and nuScenes datasets. Comprehensive experiments demonstrate the effectiveness of our approach, particularly in accurately predicting dynamic objects in complex scenes.
Authors (3)
Ling Liu
Jun Tian
Li Yi
Submitted
October 20, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

Introduces 4DSegStreamer, a novel framework employing a Dual-Thread System for efficient real-time 4D panoptic segmentation in streaming settings. It uses predictive and inference threads to forecast future dynamics and ensure timely perception, outperforming existing streaming approaches, especially under high FPS.

Business Value

Enables safer and more efficient operation of autonomous systems in complex, dynamic environments by providing real-time, comprehensive scene understanding.