Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
Introduces ORIGEN, the first zero-shot method for 3D orientation grounding in text-to-image generation. It uses reward-guided sampling with Langevin dynamics and adaptive time rescaling to control object orientation without explicit training for each object/category.
Enables more precise and controllable generation of 3D assets from text descriptions, significantly benefiting industries like game development, VR/AR content creation, and product design by automating and refining the asset creation process.