arxiv_ml 85% Match Research Paper Researchers in AI/ML,Robotics engineers,Hardware designers for neuromorphic computing 2 weeks ago

Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

reinforcement-learning › robotics-rl

📄 Abstract

Abstract: Spiking Neural Networks (SNNs) offer low-latency and energy-efficient decision making on neuromorphic hardware, making them attractive for Reinforcement Learning (RL) in resource-constrained edge devices. However, most RL algorithms for continuous control are designed for Artificial Neural Networks (ANNs), particularly the target network soft update mechanism, which conflicts with the discrete and non-differentiable dynamics of spiking neurons. We show that this mismatch destabilizes SNN training and degrades performance. To bridge the gap between discrete SNNs and continuous-control algorithms, we propose a novel proxy target framework. The proxy network introduces continuous and differentiable dynamics that enable smooth target updates, stabilizing the learning process. Since the proxy operates only during training, the deployed SNN remains fully energy-efficient with no additional inference overhead. Extensive experiments on continuous control benchmarks demonstrate that our framework consistently improves stability and achieves up to $32\%$ higher performance across various spiking neuron models. Notably, to the best of our knowledge, this is the first approach that enables SNNs with simple Leaky Integrate and Fire (LIF) neurons to surpass their ANN counterparts in continuous control. This work highlights the importance of SNN-tailored RL algorithms and paves the way for neuromorphic agents that combine high performance with low power consumption. Code is available at https://github.com/xuzijie32/Proxy-Target.

Authors (5)

Zijie Xu

Tong Bu

Zecheng Hao

Jianhao Ding

Zhaofei Yu

Submitted

May 30, 2025

arXiv Category

cs.NE

arXiv PDF

Key Contributions

This paper introduces a novel proxy target framework to bridge the gap between discrete Spiking Neural Networks (SNNs) and continuous control algorithms. The framework enables smooth target updates by introducing continuous and differentiable dynamics during training, stabilizing the learning process without adding inference overhead to the deployed SNN. This is crucial for enabling energy-efficient SNNs on neuromorphic hardware for tasks requiring continuous control.

Business Value

Enables the development of highly energy-efficient and low-latency AI systems for edge devices, such as robots or IoT devices, that require continuous control. This can lead to longer battery life and faster response times in real-world applications.

Paper Metadata

Innovation Type

Algorithmic

Deployment Feasibility

High, as the proxy operates only during training, and the deployed SNN remains fully energy-efficient with no additional inference overhead.

Limitations Addressed

Mismatch between discrete SNN dynamics and continuous-control algorithms (e.g., target network soft update),Training instability and performance degradation in SNNs for continuous control

Technical Tags

Spiking Neural NetworksReinforcement LearningContinuous ControlNeuromorphic HardwareProxy TargetDifferentiable DynamicsEdge DevicesLow-LatencyEnergy-EfficientDiscrete Dynamics

Research Topics

Deep Reinforcement LearningNeuromorphic ComputingRobotics ControlEdge AINeural Network Architectures

Methods & Architectures

Proxy Target FrameworkTarget Network Soft UpdateReinforcement Learning Algorithms Spiking Neural Networks (SNNs)Artificial Neural Networks (ANNs)

Applications & Tasks

Resource-constrained edge devices Robotics Autonomous Systems Bridging discrete SNN dynamics with continuous control algorithmsStabilizing SNN trainingDegraded performance in SNNs for continuous control Continuous controlDecision making on neuromorphic hardware

Related Fields

Machine LearningNeuroscienceRoboticsComputer Engineering

Keywords

Spiking Neural NetworksReinforcement LearningContinuous ControlNeuromorphic ComputingEdge AIEnergy EfficiencyLow LatencyProxy TargetDifferentiable ProgrammingRoboticsAutonomous SystemsDeep Learning

Academic Context

#Deep Reinforcement Learning#Neuromorphic Computing#Robotics Control#Edge AI#Neural Network Architectures

Commercial Potential

Potential Products

Energy-efficient AI chips for edge devicesRobotics control systemsAutonomous navigation systems

Target Industries

RoboticsAutomotiveConsumer ElectronicsIoT

Use Case Examples

Autonomous drone controlRobotic arm manipulationSmart sensor data processing

Competitive Edge

Offers a way to leverage the efficiency of SNNs for complex continuous control tasks, which are typically dominated by ANNs, by overcoming fundamental training challenges.

Market Opportunity

Growing market for edge AI and neuromorphic computing.

Resource Requirements

Compute Needs

Likely significant for training, but inference is highly efficient.

Data Requirements

Continuous control benchmarks.

Deployment Constraints

Requires neuromorphic hardware for optimal energy efficiency.

Scalability

SNNs are generally considered scalable for inference on specialized hardware.

Production Readiness

Maturity Level

Research

Time to Market

Medium to Long (requires hardware integration and further validation)

View Full Paper Back to Papers