Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
S$^2$-Diffusion enables robot manipulation skills to generalize from specific instances to entire categories by using an open-vocabulary spatial-semantic diffusion policy. It leverages a promptable semantic module and depth estimation from a single RGB camera, allowing skills to transfer between different objects and environments within the same category.
Significantly enhances the adaptability and reusability of robot skills, reducing the need for extensive retraining for new objects or slightly different environments, thus accelerating robot deployment in diverse settings.