Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
Proposes Evidential Proximal Policy Optimization (EPPO) to overcome non-stationary dynamics in continuous control tasks. EPPO utilizes an evidential critic to accurately estimate uncertainty around state values, preserving critic plasticity and enabling directed exploration for rapid adaptation to changing dynamics, thus improving stability and performance.
Enables the development of more robust and adaptable robotic and autonomous systems that can operate reliably in unpredictable and changing real-world environments.