arxiv_ai 90% Match Research Paper Robotics researchers,Control engineers,AI researchers in robotics 2 weeks ago

SoftMimic: Learning Compliant Whole-body Control from Examples

robotics › robotics-rl

📄 Abstract

Abstract: We introduce SoftMimic, a framework for learning compliant whole-body control policies for humanoid robots from example motions. Imitating human motions with reinforcement learning allows humanoids to quickly learn new skills, but existing methods incentivize stiff control that aggressively corrects deviations from a reference motion, leading to brittle and unsafe behavior when the robot encounters unexpected contacts. In contrast, SoftMimic enables robots to respond compliantly to external forces while maintaining balance and posture. Our approach leverages an inverse kinematics solver to generate an augmented dataset of feasible compliant motions, which we use to train a reinforcement learning policy. By rewarding the policy for matching compliant responses rather than rigidly tracking the reference motion, SoftMimic learns to absorb disturbances and generalize to varied tasks from a single motion clip. We validate our method through simulations and real-world experiments, demonstrating safe and effective interaction with the environment.

Authors (4)

Gabriel B. Margolis

Michelle Wang

Nolan Fey

Pulkit Agrawal

Submitted

October 20, 2025

arXiv Category

cs.RO

arXiv PDF

Key Contributions

Introduces SoftMimic, a framework for learning compliant whole-body control policies for humanoid robots from example motions. It uses an inverse kinematics solver to generate augmented data, enabling RL policies to learn compliant responses to external forces while maintaining balance, leading to safer and more robust behavior than rigid imitation methods.

Business Value

Enables the development of safer, more adaptable humanoid robots capable of interacting with complex, unpredictable environments, crucial for applications in logistics, manufacturing, and assistance.

Paper Metadata

Innovation Type

Framework and Methodology

Deployment Feasibility

Moderate to high, demonstrated in simulation and real-world experiments, but requires sophisticated robot hardware.

Limitations Addressed

Brittle and unsafe behavior of humanoid robots trained with rigid imitation methods when encountering unexpected contacts or external forces.

Technical Tags

compliant whole-body controlhumanoid robotsexample motionsreinforcement learninginverse kinematicsaugmented datasetexternal forcesbalance and posturesim-to-realmotion planning

Research Topics

Robotics ControlHumanoid RoboticsReinforcement Learning for RoboticsMotion Imitation

Methods & Architectures

reinforcement learninginverse kinematics solverdataset augmentation

Applications & Tasks

humanoid robotics robot control motion synthesis learning compliant controlhandling external disturbancesimproving robot safety learning whole-body control policiesimitating human motionsresponding compliantly to forces

Related Fields

RoboticsControl TheoryMachine LearningHumanoid RoboticsReinforcement Learning

Keywords

humanoid robotwhole-body controlcompliant controlreinforcement learningmotion imitationinverse kinematicsroboticssim-to-realsafetybalancepostureexternal forcesmotion planning

Academic Context

#Robotics Control#Humanoid Robotics#Reinforcement Learning for Robotics#Motion Imitation

Commercial Potential

Potential Products

Control software for humanoid robotsRobotics simulation platforms with compliant control

Target Industries

RoboticsManufacturingLogisticsHumanoid Robot Development

Use Case Examples

Humanoid robots assisting in warehousesRobots performing tasks in dynamic environmentsSafer human-robot collaboration

Competitive Edge

Offers a more robust and safer control strategy compared to methods that rigidly track reference motions, enabling robots to handle real-world uncertainties better.

Market Opportunity

Growing market for advanced robotics and automation.

Revenue Models

Licensing of control softwareintegration servicesdevelopment of robotic platforms.

Resource Requirements

Compute Needs

Significant compute for RL training (simulations) and potentially real-time control.

Data Requirements

Example motion data for humanoid robots.

Deployment Constraints

Requires advanced humanoid robot hardware; sim-to-real transfer challenges.

Scalability

Scalability depends on the RL algorithm and simulation environment.

Regulatory Considerations

Safety standards for advanced robotics.

Production Readiness

Maturity Level

Research

Time to Market

3-5 years for commercial humanoid robots using this control strategy.

Patent Potential

Moderate to high, for the SoftMimic framework and its components.

View Full Paper Back to Papers