Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cv 95% Match Research Paper Computer Vision Researchers,Graphics Engineers,Robotics Engineers,Developers of VR/AR applications 1 week ago

STG-Avatar: Animatable Human Avatars via Spacetime Gaussian

computer-vision › 3d-vision
📄 Abstract

Abstract: Realistic animatable human avatars from monocular videos are crucial for advancing human-robot interaction and enhancing immersive virtual experiences. While recent research on 3DGS-based human avatars has made progress, it still struggles with accurately representing detailed features of non-rigid objects (e.g., clothing deformations) and dynamic regions (e.g., rapidly moving limbs). To address these challenges, we present STG-Avatar, a 3DGS-based framework for high-fidelity animatable human avatar reconstruction. Specifically, our framework introduces a rigid-nonrigid coupled deformation framework that synergistically integrates Spacetime Gaussians (STG) with linear blend skinning (LBS). In this hybrid design, LBS enables real-time skeletal control by driving global pose transformations, while STG complements it through spacetime adaptive optimization of 3D Gaussians. Furthermore, we employ optical flow to identify high-dynamic regions and guide the adaptive densification of 3D Gaussians in these regions. Experimental results demonstrate that our method consistently outperforms state-of-the-art baselines in both reconstruction quality and operational efficiency, achieving superior quantitative metrics while retaining real-time rendering capabilities. Our code is available at https://github.com/jiangguangan/STG-Avatar
Authors (7)
Guangan Jiang
Tianzi Zhang
Dong Li
Zhenjun Zhao
Haoang Li
Mingrui Li
+1 more
Submitted
October 25, 2025
arXiv Category
cs.CV
arXiv PDF

Key Contributions

STG-Avatar is a 3DGS-based framework for high-fidelity animatable human avatars from monocular videos. It combines LBS for skeletal control with STG for detailed spacetime optimization, and uses optical flow to adapt densification in dynamic regions, effectively handling clothing and limb movements.

Business Value

Enables the creation of highly realistic and controllable digital humans for immersive experiences, virtual collaboration, and advanced human-robot interaction.