BrainQN: Enhancing the Robustness of Deep Reinforcement Learning with Spiking Neural Networks

被引：0

作者：

Feng, Shuo ^{[1
]}

Cao, Jian ^{[1
]}

Ou, Zehong ^{[1
]}

Chen, Guang ^{[1
]}

Zhong, Yi ^{[2
]}

Wang, Zilin ^{[2
]}

Yan, Juntong ^{[1
]}

Chen, Jue ^{[1
]}

Wang, Bingsen ^{[1
]}

Zou, Chenglong ^{[3
]}

Feng, Zebang ^{[1
]}

Wang, Yuan ^{[2
,4
]}

机构：

[1] Peking Univ, Sch Software & Microelect, Beijing 102600, Peoples R China

[2] Peking Univ, MPW Ctr, Sch Integrated Circuits, Key Lab Microelect Devices & Circuits MoE, Beijing 100871, Peoples R China

[3] Peking Univ, Chongqing Res Inst Big Data, Chongqing 400030, Peoples R China

[4] Beijing Adv Innovat Ctr Integrated Circuits, Beijing 100871, Peoples R China

来源：

ADVANCED INTELLIGENT SYSTEMS | 2024年 / 6卷 / 09期

关键词：

deep reinforcement learning; efficiency; neuromorphic chips; robustness; spiking neural networks; CHIP; INTELLIGENCE; MEMORY; LEVEL; GO;

D O I：

10.1002/aisy.202400075

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As the third-generation network succeeding artificial neural networks (ANNs), spiking neural networks (SNNs) offer high robustness and low energy consumption. Inspired by biological systems, the limitations of low robustness and high-power consumption in deep reinforcement learning (DRL) are addressed by introducing SNNs. The Brain Q-network (BrainQN) is proposed, which replaces the neurons in the classic Deep Q-learning (DQN) algorithm with SNN neurons. BrainQN is trained using surrogate gradient learning (SGL) and ANN-to-SNN conversion methods. Robustness tests with input noise reveal BrainQN's superior performance, achieving an 82.14% increase in rewards under low noise and 71.74% under high noise compared to DQN. These findings highlight BrainQN's robustness and superior performance in noisy environments, supporting its application in complex scenarios. SGL-trained BrainQN is more robust than ANN-to-SNN conversion under high noise. The differences in network output correlations between noisy and original inputs, along with training algorithm distinctions, explain this phenomenon. BrainQN successfully transitioned from a simulated Pong environment to a ball-catching robot with dynamic vision sensors (DVS). On the neuromorphic chip PAICORE, it shows significant advantages in latency and power consumption compared to Jetson Xavier NX. This article addresses the limitations of deep reinforcement learning (DRL) by introducing spiking neural networks (SNNs). This article proposes the Brain Q-network (BrainQN), which replaces the neurons in the classic Deep Q-learning (DQN) with SNN neurons. BrainQN demonstrates excellent performance in terms of robustness against noise attacks and power consumption.image (c) 2024 WILEY-VCH GmbH

引用

页数：21

共 50 条

[1] A reinforcement learning algorithm for spiking neural networks
Florian, RV
[J]. Seventh International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, Proceedings, 2005, : 299 - 306
[2] Deep learning in spiking neural networks
Tavanaei, Amirhossein
Ghodrati, Masoud
Kheradpisheh, Saeed Reza
Masquelier, Timothee
Maida, Anthony
[J]. NEURAL NETWORKS, 2019, 111 : 47 - 63
[3] Learning in neural networks by reinforcement of irregular spiking
Xie, XH
Seung, HS
[J]. PHYSICAL REVIEW E, 2004, 69 (04): : 10
[4] Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms
Ding, Jianhao
Yu, Zhaofei
Huang, Tiejun
Liu, Jian K.
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 492 - 502
[5] Deep Residual Learning in Spiking Neural Networks
Fang, Wei
Yu, Zhaofei
Chen, Yanqi
Huang, Tiejun
Masquelier, Timothee
Tian, Yonghong
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[6] The geometry of robustness in spiking neural networks
Calaim, Nuno
Dehmelt, Florian A.
Goncalves, Pedro J.
Machens, Christian K.
[J]. ELIFE, 2022, 11
[7] Stochasticity and robustness in spiking neural networks
Olin-Ammentorp, Wilkie
Beckmann, Karsten
Schuman, Catherine D.
Plank, James S.
Cady, Nathaniel C.
[J]. NEUROCOMPUTING, 2021, 419 : 23 - 36
[8] Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses
Yuan, Mengwen
Wu, Xi
Yan, Rui
Tang, Huajin
[J]. NEURAL COMPUTATION, 2019, 31 (12) : 2368 - 2389
[9] Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
Seung, HS
[J]. NEURON, 2003, 40 (06) : 1063 - 1073
[10] DSNNs: learning transfer from deep neural networks to spiking neural networks
Zhang, Lei
Du, Zidong
Li, Ling
Chen, Yunji
[J]. High Technology Letters, 2020, 26 (02): : 136 - 144

← 1 2 3 4 5 →