Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Realistic animatable human avatars from monocular videos are crucial for
advancing human-robot interaction and enhancing immersive virtual experiences.
While recent research on 3DGS-based human avatars has made progress, it still
struggles with accurately representing detailed features of non-rigid objects
(e.g., clothing deformations) and dynamic regions (e.g., rapidly moving limbs).
To address these challenges, we present STG-Avatar, a 3DGS-based framework for
high-fidelity animatable human avatar reconstruction. Specifically, our
framework introduces a rigid-nonrigid coupled deformation framework that
synergistically integrates Spacetime Gaussians (STG) with linear blend skinning
(LBS). In this hybrid design, LBS enables real-time skeletal control by driving
global pose transformations, while STG complements it through spacetime
adaptive optimization of 3D Gaussians. Furthermore, we employ optical flow to
identify high-dynamic regions and guide the adaptive densification of 3D
Gaussians in these regions. Experimental results demonstrate that our method
consistently outperforms state-of-the-art baselines in both reconstruction
quality and operational efficiency, achieving superior quantitative metrics
while retaining real-time rendering capabilities. Our code is available at
https://github.com/jiangguangan/STG-Avatar
Authors (7)
Guangan Jiang
Tianzi Zhang
Dong Li
Zhenjun Zhao
Haoang Li
Mingrui Li
+1 more
Submitted
October 25, 2025
Key Contributions
STG-Avatar is a 3DGS-based framework for high-fidelity animatable human avatars from monocular videos. It combines LBS for skeletal control with STG for detailed spacetime optimization, and uses optical flow to adapt densification in dynamic regions, effectively handling clothing and limb movements.
Business Value
Enables the creation of highly realistic and controllable digital humans for immersive experiences, virtual collaboration, and advanced human-robot interaction.