Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
This paper introduces novel sampling strategies for configuration variations to generate robust universal locomotion policies for quadrupedal robots. By investigating the effects of sampling physical robot parameters and joint proportional-derivative gains, it enables training a single reinforcement learning policy that generalizes to multiple parameter configurations, improving robustness and enabling zero-shot deployment.
Enables the development of more adaptable and reliable legged robots for diverse environments and tasks, reducing the need for extensive re-training for each new configuration.