Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 95% Match Research Paper Game Developers,VR/AR Engineers,Simulation Designers,3D Artists 1 week ago

WorldGrow: Generating Infinite 3D World

computer-vision › 3d-vision
📄 Abstract

Abstract: We tackle the challenge of generating the infinitely extendable 3D world -- large, continuous environments with coherent geometry and realistic appearance. Existing methods face key challenges: 2D-lifting approaches suffer from geometric and appearance inconsistencies across views, 3D implicit representations are hard to scale up, and current 3D foundation models are mostly object-centric, limiting their applicability to scene-level generation. Our key insight is leveraging strong generation priors from pre-trained 3D models for structured scene block generation. To this end, we propose WorldGrow, a hierarchical framework for unbounded 3D scene synthesis. Our method features three core components: (1) a data curation pipeline that extracts high-quality scene blocks for training, making the 3D structured latent representations suitable for scene generation; (2) a 3D block inpainting mechanism that enables context-aware scene extension; and (3) a coarse-to-fine generation strategy that ensures both global layout plausibility and local geometric/textural fidelity. Evaluated on the large-scale 3D-FRONT dataset, WorldGrow achieves SOTA performance in geometry reconstruction, while uniquely supporting infinite scene generation with photorealistic and structurally consistent outputs. These results highlight its capability for constructing large-scale virtual environments and potential for building future world models.
Authors (9)
Sikuang Li
Chen Yang
Jiemin Fang
Taoran Yi
Jia Lu
Jiazhong Cen
+3 more
Submitted
October 24, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

WorldGrow introduces a hierarchical framework for generating infinitely extendable 3D worlds by leveraging pre-trained 3D foundation models. It addresses limitations of existing methods by focusing on structured scene block generation, employing a data curation pipeline, a 3D block inpainting mechanism for context-aware extension, and a coarse-to-fine generation strategy to ensure coherent geometry and realistic appearance.

Business Value

Enables the creation of vast, detailed, and consistent virtual environments for gaming, VR/AR experiences, and simulations, reducing manual effort and increasing immersion.