arxiv_cv 95% Match Research Paper Computer Vision Researchers,Graphics Engineers,Robotics Engineers,Developers of VR/AR applications 1 week ago

STG-Avatar: Animatable Human Avatars via Spacetime Gaussian

computer-vision › 3d-vision

📄 Abstract

Abstract: Realistic animatable human avatars from monocular videos are crucial for advancing human-robot interaction and enhancing immersive virtual experiences. While recent research on 3DGS-based human avatars has made progress, it still struggles with accurately representing detailed features of non-rigid objects (e.g., clothing deformations) and dynamic regions (e.g., rapidly moving limbs). To address these challenges, we present STG-Avatar, a 3DGS-based framework for high-fidelity animatable human avatar reconstruction. Specifically, our framework introduces a rigid-nonrigid coupled deformation framework that synergistically integrates Spacetime Gaussians (STG) with linear blend skinning (LBS). In this hybrid design, LBS enables real-time skeletal control by driving global pose transformations, while STG complements it through spacetime adaptive optimization of 3D Gaussians. Furthermore, we employ optical flow to identify high-dynamic regions and guide the adaptive densification of 3D Gaussians in these regions. Experimental results demonstrate that our method consistently outperforms state-of-the-art baselines in both reconstruction quality and operational efficiency, achieving superior quantitative metrics while retaining real-time rendering capabilities. Our code is available at https://github.com/jiangguangan/STG-Avatar

Authors (7)

Guangan Jiang

Tianzi Zhang

Dong Li

Zhenjun Zhao

Haoang Li

Mingrui Li

+1 more

Submitted

October 25, 2025

arXiv Category

cs.CV

arXiv PDF

Key Contributions

STG-Avatar is a 3DGS-based framework for high-fidelity animatable human avatars from monocular videos. It combines LBS for skeletal control with STG for detailed spacetime optimization, and uses optical flow to adapt densification in dynamic regions, effectively handling clothing and limb movements.

Business Value

Enables the creation of highly realistic and controllable digital humans for immersive experiences, virtual collaboration, and advanced human-robot interaction.

Paper Metadata

Innovation Type

Methodological

Deployment Feasibility

Moderate, requires significant computational resources for training and rendering, but LBS enables real-time control.

Limitations Addressed

Struggles of existing 3DGS-based avatars in accurately representing detailed non-rigid deformations (like clothing) and dynamic regions (like moving limbs).

Technical Tags

animatable avatarsmonocular videoSpacetime Gaussians (STG)3D Gaussian Splattinglinear blend skinning (LBS)non-rigid deformationclothing deformationdynamic regionshuman reconstructionhuman-robot interaction

Research Topics

3D Human ReconstructionAvatar GenerationComputer GraphicsComputer VisionHuman-Robot InteractionGenerative Models

Methods & Architectures

STG-Avatar frameworkRigid-nonrigid coupled deformationSpacetime Gaussians (STG)Linear Blend Skinning (LBS)Optical flow for densification 3D Gaussian Splatting (3DGS)Spacetime Gaussians (STG)

Applications & Tasks

Virtual Reality Augmented Reality Gaming Human-Robot Interaction Telepresence Metaverse Accurate representation of non-rigid object deformations (e.g., clothing)Handling dynamic regions in human avatarsGenerating realistic animatable avatars from monocular videoReal-time avatar control High-fidelity animatable human avatar reconstructionReal-time avatar animationGenerating avatars from monocular input

Related Fields

Computer VisionComputer GraphicsRoboticsVirtual RealityMachine Learning

Keywords

avatar3D reconstructionmonocular videoSpacetime Gaussians3D Gaussian SplattinganimatabledeformationLBSreal-timehuman avatar

Academic Context

#3D Human Reconstruction#Avatar Generation#Computer Graphics#Computer Vision#Human-Robot Interaction#Generative Models

Commercial Potential

Potential Products

Real-time avatar creation toolsSDKs for animatable digital humansVirtual character engines for games and metaverse platforms

Target Industries

GamingMetaverseFilm and AnimationTelecommunicationsRobotics

Use Case Examples

Creating realistic avatars for virtual meetings and social interactionsEnabling robots to have expressive and human-like avatarsGenerating characters for video games and virtual worlds

Competitive Edge

Achieves higher fidelity and better handling of dynamic elements in animatable avatars compared to previous 3DGS-based methods by integrating LBS and spacetime optimization.

Market Opportunity

Rapid growth in the virtual reality, metaverse, and digital human markets.

Revenue Models

Licensing the STG-Avatar technologyproviding avatar creation services.

Resource Requirements

Compute Needs

High for training, moderate to high for rendering depending on complexity and real-time needs.

Data Requirements

Requires monocular videos of humans with motion capture or ground truth pose information for training.

Deployment Constraints

Computational cost, need for accurate motion capture or pose estimation for animation.

Scalability

Scales with the complexity of the human model and the desired level of detail.

Production Readiness

Maturity Level

Research

Time to Market

18-36 months

Patent Potential

Moderate, for the coupled deformation framework and STG integration.

View Full Paper Back to Papers