arxiv_cv 95% Match Research Paper 3D Artists,Game Developers,Researchers in Generative AI,Computer Graphics Engineers 1 week ago

From One to More: Contextual Part Latents for 3D Generation

generative-ai › diffusion

📄 Abstract

Abstract: Recent advances in 3D generation have transitioned from multi-view 2D rendering approaches to 3D-native latent diffusion frameworks that exploit geometric priors in ground truth data. Despite progress, three key limitations persist: (1) Single-latent representations fail to capture complex multi-part geometries, causing detail degradation; (2) Holistic latent coding neglects part independence and interrelationships critical for compositional design; (3) Global conditioning mechanisms lack fine-grained controllability. Inspired by human 3D design workflows, we propose CoPart - a part-aware diffusion framework that decomposes 3D objects into contextual part latents for coherent multi-part generation. This paradigm offers three advantages: i) Reduces encoding complexity through part decomposition; ii) Enables explicit part relationship modeling; iii) Supports part-level conditioning. We further develop a mutual guidance strategy to fine-tune pre-trained diffusion models for joint part latent denoising, ensuring both geometric coherence and foundation model priors. To enable large-scale training, we construct Partverse - a novel 3D part dataset derived from Objaverse through automated mesh segmentation and human-verified annotations. Extensive experiments demonstrate CoPart's superior capabilities in part-level editing, articulated object generation, and scene composition with unprecedented controllability.

Authors (13)

Shaocong Dong

Lihe Ding

Xiao Chen

Yaokun Li

Yuxin Wang

Yucheng Wang

+7 more

Submitted

July 11, 2025

arXiv Category

cs.CV

arXiv PDF

Key Contributions

CoPart proposes a part-aware diffusion framework that decomposes 3D objects into contextual part latents, addressing limitations of single-latent representations and holistic coding. This enables better capture of multi-part geometries, explicit modeling of part relationships, and fine-grained, part-level controllability in 3D generation.

Business Value

Enables more efficient and controllable creation of complex 3D assets for gaming, VR/AR, product design, and digital twins, potentially reducing manual modeling effort.

Paper Metadata

Innovation Type

Algorithmic

Deployment Feasibility

Moderate. Requires significant computational resources for training and inference. Integration into existing 3D pipelines is feasible.

Limitations Addressed

Detail degradation in complex multi-part geometries (due to single-latent representations),Neglect of part independence and interrelationships (due to holistic latent coding),Lack of fine-grained controllability (due to global conditioning mechanisms)

Technical Tags

3D generationdiffusion modelspart-aware generationcontextual part latentscompositional designgeometric priorslatent diffusionmulti-part objects

Research Topics

3D Generative ModelsCompositional AIDiffusion ModelsGeometric Deep Learning

Methods & Architectures

Part-aware diffusion frameworkContextual part latentsMutual guidance strategyDecomposition of 3D objectsHolistic latent coding (contrast) Diffusion ModelsLatent Diffusion Models

Applications & Tasks

3D Content Creation Computer Graphics Virtual Reality Augmented Reality 3D Object GenerationHandling Complex GeometriesControllable Generation Generating complex multi-part 3D objectsImproving detail and coherence in 3D generationEnabling part-level control in 3D synthesis

Related Fields

Computer GraphicsGenerative ModelsDeep LearningComputer Vision3D Modeling

Keywords

3D generationdiffusion modelspart-based generationcompositionalitylatent spacecontrollable generationgeometric priorsmulti-part objects3D assetscomputer graphics

Academic Context

#3D Generative Models#Compositional AI#Diffusion Models#Geometric Deep Learning

Commercial Potential

Potential Products

3D Asset Generation ToolsProcedural Content Generation SystemsAI-powered 3D Design Software

Target Industries

GamingEntertainmentArchitectureProduct DesignMetaverse

Use Case Examples

Generating diverse furniture models for interior designCreating character models with customizable parts for gamesSynthesizing complex mechanical components

Competitive Edge

Advances state-of-the-art in 3D generation by introducing explicit part-aware decomposition and control, overcoming limitations of holistic approaches.

Market Opportunity

Significant growth in the 3D content creation market.

Revenue Models

Software licensingAPI accesscloud-based generation services.

Resource Requirements

Compute Needs

High (training diffusion models for 3D)

Data Requirements

Large datasets of 3D objects, potentially with part annotations.

Deployment Constraints

Computational cost, need for specialized 3D data.

Scalability

Scalability depends on the complexity of the 3D objects and the efficiency of the diffusion model.

Regulatory Considerations

N/A

Production Readiness

Maturity Level

Research

Time to Market

2-4 years (for robust tools)

Patent Potential

Moderate (novel representation and framework)

View Full Paper Back to Papers