Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: We tackle the challenge of generating the infinitely extendable 3D world --
large, continuous environments with coherent geometry and realistic appearance.
Existing methods face key challenges: 2D-lifting approaches suffer from
geometric and appearance inconsistencies across views, 3D implicit
representations are hard to scale up, and current 3D foundation models are
mostly object-centric, limiting their applicability to scene-level generation.
Our key insight is leveraging strong generation priors from pre-trained 3D
models for structured scene block generation. To this end, we propose
WorldGrow, a hierarchical framework for unbounded 3D scene synthesis. Our
method features three core components: (1) a data curation pipeline that
extracts high-quality scene blocks for training, making the 3D structured
latent representations suitable for scene generation; (2) a 3D block inpainting
mechanism that enables context-aware scene extension; and (3) a coarse-to-fine
generation strategy that ensures both global layout plausibility and local
geometric/textural fidelity. Evaluated on the large-scale 3D-FRONT dataset,
WorldGrow achieves SOTA performance in geometry reconstruction, while uniquely
supporting infinite scene generation with photorealistic and structurally
consistent outputs. These results highlight its capability for constructing
large-scale virtual environments and potential for building future world
models.
Authors (9)
Sikuang Li
Chen Yang
Jiemin Fang
Taoran Yi
Jia Lu
Jiazhong Cen
+3 more
Submitted
October 24, 2025
Key Contributions
WorldGrow introduces a hierarchical framework for generating infinitely extendable 3D worlds by leveraging pre-trained 3D foundation models. It addresses limitations of existing methods by focusing on structured scene block generation, employing a data curation pipeline, a 3D block inpainting mechanism for context-aware extension, and a coarse-to-fine generation strategy to ensure coherent geometry and realistic appearance.
Business Value
Enables the creation of vast, detailed, and consistent virtual environments for gaming, VR/AR experiences, and simulations, reducing manual effort and increasing immersion.