Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 95% Match Research Paper 3D Artists,Game Developers,Researchers in Generative AI,Computer Graphics Engineers 1 week ago

From One to More: Contextual Part Latents for 3D Generation

generative-ai › diffusion
📄 Abstract

Abstract: Recent advances in 3D generation have transitioned from multi-view 2D rendering approaches to 3D-native latent diffusion frameworks that exploit geometric priors in ground truth data. Despite progress, three key limitations persist: (1) Single-latent representations fail to capture complex multi-part geometries, causing detail degradation; (2) Holistic latent coding neglects part independence and interrelationships critical for compositional design; (3) Global conditioning mechanisms lack fine-grained controllability. Inspired by human 3D design workflows, we propose CoPart - a part-aware diffusion framework that decomposes 3D objects into contextual part latents for coherent multi-part generation. This paradigm offers three advantages: i) Reduces encoding complexity through part decomposition; ii) Enables explicit part relationship modeling; iii) Supports part-level conditioning. We further develop a mutual guidance strategy to fine-tune pre-trained diffusion models for joint part latent denoising, ensuring both geometric coherence and foundation model priors. To enable large-scale training, we construct Partverse - a novel 3D part dataset derived from Objaverse through automated mesh segmentation and human-verified annotations. Extensive experiments demonstrate CoPart's superior capabilities in part-level editing, articulated object generation, and scene composition with unprecedented controllability.
Authors (13)
Shaocong Dong
Lihe Ding
Xiao Chen
Yaokun Li
Yuxin Wang
Yucheng Wang
+7 more
Submitted
July 11, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

CoPart proposes a part-aware diffusion framework that decomposes 3D objects into contextual part latents, addressing limitations of single-latent representations and holistic coding. This enables better capture of multi-part geometries, explicit modeling of part relationships, and fine-grained, part-level controllability in 3D generation.

Business Value

Enables more efficient and controllable creation of complex 3D assets for gaming, VR/AR, product design, and digital twins, potentially reducing manual modeling effort.