Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Existing multi-view 3D object reconstruction methods heavily rely on
sufficient overlap between input views, where occlusions and sparse coverage in
practice frequently yield severe reconstruction incompleteness. Recent
advancements in diffusion-based 3D generative techniques offer the potential to
address these limitations by leveraging learned generative priors to
hallucinate invisible parts of objects, thereby generating plausible 3D
structures. However, the stochastic nature of the inference process limits the
accuracy and reliability of generation results, preventing existing
reconstruction frameworks from integrating such 3D generative priors. In this
work, we comprehensively analyze the reasons why diffusion-based 3D generative
methods fail to achieve high consistency, including (a) the insufficiency in
constructing and leveraging cross-view connections when extracting multi-view
image features as conditions, and (b) the poor controllability of iterative
denoising during local detail generation, which easily leads to plausible but
inconsistent fine geometric and texture details with inputs. Accordingly, we
propose ReconViaGen to innovatively integrate reconstruction priors into the
generative framework and devise several strategies that effectively address
these issues. Extensive experiments demonstrate that our ReconViaGen can
reconstruct complete and accurate 3D models consistent with input views in both
global structure and local details.Project page:
https://jiahao620.github.io/reconviagen.
Authors (9)
Jiahao Chang
Chongjie Ye
Yushuang Wu
Yuantao Chen
Yidan Zhang
Zhongjin Luo
+3 more
Submitted
October 27, 2025
Key Contributions
ReconViaGen addresses limitations in multi-view 3D reconstruction by integrating diffusion-based generative models. It improves cross-view connection utilization and controllability of denoising to leverage generative priors for hallucinating occluded parts, leading to more complete and accurate reconstructions.
Business Value
Enables creation of more complete and accurate 3D models from limited or occluded views, valuable for AR/VR content creation, robotics, and digital twins.