Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 85% Match Research Paper Researchers in AI/ML,Robotics engineers,Hardware designers for neuromorphic computing 2 weeks ago

Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

reinforcement-learning › robotics-rl
📄 Abstract

Abstract: Spiking Neural Networks (SNNs) offer low-latency and energy-efficient decision making on neuromorphic hardware, making them attractive for Reinforcement Learning (RL) in resource-constrained edge devices. However, most RL algorithms for continuous control are designed for Artificial Neural Networks (ANNs), particularly the target network soft update mechanism, which conflicts with the discrete and non-differentiable dynamics of spiking neurons. We show that this mismatch destabilizes SNN training and degrades performance. To bridge the gap between discrete SNNs and continuous-control algorithms, we propose a novel proxy target framework. The proxy network introduces continuous and differentiable dynamics that enable smooth target updates, stabilizing the learning process. Since the proxy operates only during training, the deployed SNN remains fully energy-efficient with no additional inference overhead. Extensive experiments on continuous control benchmarks demonstrate that our framework consistently improves stability and achieves up to $32\%$ higher performance across various spiking neuron models. Notably, to the best of our knowledge, this is the first approach that enables SNNs with simple Leaky Integrate and Fire (LIF) neurons to surpass their ANN counterparts in continuous control. This work highlights the importance of SNN-tailored RL algorithms and paves the way for neuromorphic agents that combine high performance with low power consumption. Code is available at https://github.com/xuzijie32/Proxy-Target.
Authors (5)
Zijie Xu
Tong Bu
Zecheng Hao
Jianhao Ding
Zhaofei Yu
Submitted
May 30, 2025
arXiv Category
cs.NE
arXiv PDF

Key Contributions

This paper introduces a novel proxy target framework to bridge the gap between discrete Spiking Neural Networks (SNNs) and continuous control algorithms. The framework enables smooth target updates by introducing continuous and differentiable dynamics during training, stabilizing the learning process without adding inference overhead to the deployed SNN. This is crucial for enabling energy-efficient SNNs on neuromorphic hardware for tasks requiring continuous control.

Business Value

Enables the development of highly energy-efficient and low-latency AI systems for edge devices, such as robots or IoT devices, that require continuous control. This can lead to longer battery life and faster response times in real-world applications.