Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 85% Match Research Paper 3D artists,Game developers,Researchers in generative AI,Computer graphics engineers 17 hours ago

3DBonsai: Structure-Aware Bonsai Modeling Using Conditioned 3D Gaussian Splatting

generative-ai › diffusion
📄 Abstract

Abstract: Recent advancements in text-to-3D generation have shown remarkable results by leveraging 3D priors in combination with 2D diffusion. However, previous methods utilize 3D priors that lack detailed and complex structural information, limiting them to generating simple objects and presenting challenges for creating intricate structures such as bonsai. In this paper, we propose 3DBonsai, a novel text-to-3D framework for generating 3D bonsai with complex structures. Technically, we first design a trainable 3D space colonization algorithm to produce bonsai structures, which are then enhanced through random sampling and point cloud augmentation to serve as the 3D Gaussian priors. We introduce two bonsai generation pipelines with distinct structural levels: fine structure conditioned generation, which initializes 3D Gaussians using a 3D structure prior to produce detailed and complex bonsai, and coarse structure conditioned generation, which employs a multi-view structure consistency module to align 2D and 3D structures. Moreover, we have compiled a unified 2D and 3D Chinese-style bonsai dataset. Our experimental results demonstrate that 3DBonsai significantly outperforms existing methods, providing a new benchmark for structure-aware 3D bonsai generation.

Key Contributions

3DBonsai introduces a novel text-to-3D framework specifically designed for generating complex 3D bonsai structures, which are challenging for existing methods. It achieves this by employing a trainable 3D space colonization algorithm to create structural priors and enhancing them with Gaussian splatting, enabling detailed and intricate generation.

Business Value

Enables the creation of highly detailed and complex 3D models for applications like virtual environments, game development, and digital art, potentially reducing manual modeling effort.