Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Spiking Neural Networks (SNNs) offer low-latency and energy-efficient
decision making on neuromorphic hardware, making them attractive for
Reinforcement Learning (RL) in resource-constrained edge devices. However, most
RL algorithms for continuous control are designed for Artificial Neural
Networks (ANNs), particularly the target network soft update mechanism, which
conflicts with the discrete and non-differentiable dynamics of spiking neurons.
We show that this mismatch destabilizes SNN training and degrades performance.
To bridge the gap between discrete SNNs and continuous-control algorithms, we
propose a novel proxy target framework. The proxy network introduces continuous
and differentiable dynamics that enable smooth target updates, stabilizing the
learning process. Since the proxy operates only during training, the deployed
SNN remains fully energy-efficient with no additional inference overhead.
Extensive experiments on continuous control benchmarks demonstrate that our
framework consistently improves stability and achieves up to $32\%$ higher
performance across various spiking neuron models. Notably, to the best of our
knowledge, this is the first approach that enables SNNs with simple Leaky
Integrate and Fire (LIF) neurons to surpass their ANN counterparts in
continuous control. This work highlights the importance of SNN-tailored RL
algorithms and paves the way for neuromorphic agents that combine high
performance with low power consumption. Code is available at
https://github.com/xuzijie32/Proxy-Target.
Authors (5)
Zijie Xu
Tong Bu
Zecheng Hao
Jianhao Ding
Zhaofei Yu
Key Contributions
This paper introduces a novel proxy target framework to bridge the gap between discrete Spiking Neural Networks (SNNs) and continuous control algorithms. The framework enables smooth target updates by introducing continuous and differentiable dynamics during training, stabilizing the learning process without adding inference overhead to the deployed SNN. This is crucial for enabling energy-efficient SNNs on neuromorphic hardware for tasks requiring continuous control.
Business Value
Enables the development of highly energy-efficient and low-latency AI systems for edge devices, such as robots or IoT devices, that require continuous control. This can lead to longer battery life and faster response times in real-world applications.