Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 95% Match Research Paper Researchers in generative AI,Animators,Game developers,Robotics engineers 1 week ago

InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation

generative-ai › autoregressive
📄 Abstract

Abstract: We present InfiniDreamer, a novel framework for arbitrarily long human motion generation. InfiniDreamer addresses the limitations of current motion generation methods, which are typically restricted to short sequences due to the lack of long motion training data. To achieve this, we first generate sub-motions corresponding to each textual description and then assemble them into a coarse, extended sequence using randomly initialized transition segments. We then introduce an optimization-based method called Segment Score Distillation (SSD) to refine the entire long motion sequence. SSD is designed to utilize an existing motion prior, which is trained only on short clips, in a training-free manner. Specifically, SSD iteratively refines overlapping short segments sampled from the coarsely extended long motion sequence, progressively aligning them with the pre-trained motion diffusion prior. This process ensures local coherence within each segment, while the refined transitions between segments maintain global consistency across the entire sequence. Extensive qualitative and quantitative experiments validate the superiority of our framework, showcasing its ability to generate coherent, contextually aware motion sequences of arbitrary length.
Authors (3)
Wenjie Zhuo
Fan Ma
Hehe Fan
Submitted
November 27, 2024
arXiv Category
cs.CV
arXiv PDF

Key Contributions

Introduces InfiniDreamer for generating arbitrarily long human motion sequences, overcoming the limitations of short sequence generation in current methods. It achieves this by assembling sub-motions and using Segment Score Distillation (SSD) to refine the entire sequence in a training-free manner, leveraging existing short-clip motion priors.

Business Value

Enables the creation of more realistic and longer character animations for entertainment, virtual environments, and potentially for training robotic agents, reducing manual effort.