BrainQN: Enhancing the Robustness of Deep Reinforcement Learning with Spiking Neural Networks

被引:0
|
作者
Feng, Shuo [1 ]
Cao, Jian [1 ]
Ou, Zehong [1 ]
Chen, Guang [1 ]
Zhong, Yi [2 ]
Wang, Zilin [2 ]
Yan, Juntong [1 ]
Chen, Jue [1 ]
Wang, Bingsen [1 ]
Zou, Chenglong [3 ]
Feng, Zebang [1 ]
Wang, Yuan [2 ,4 ]
机构
[1] Peking Univ, Sch Software & Microelect, Beijing 102600, Peoples R China
[2] Peking Univ, MPW Ctr, Sch Integrated Circuits, Key Lab Microelect Devices & Circuits MoE, Beijing 100871, Peoples R China
[3] Peking Univ, Chongqing Res Inst Big Data, Chongqing 400030, Peoples R China
[4] Beijing Adv Innovat Ctr Integrated Circuits, Beijing 100871, Peoples R China
关键词
deep reinforcement learning; efficiency; neuromorphic chips; robustness; spiking neural networks; CHIP; INTELLIGENCE; MEMORY; LEVEL; GO;
D O I
10.1002/aisy.202400075
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the third-generation network succeeding artificial neural networks (ANNs), spiking neural networks (SNNs) offer high robustness and low energy consumption. Inspired by biological systems, the limitations of low robustness and high-power consumption in deep reinforcement learning (DRL) are addressed by introducing SNNs. The Brain Q-network (BrainQN) is proposed, which replaces the neurons in the classic Deep Q-learning (DQN) algorithm with SNN neurons. BrainQN is trained using surrogate gradient learning (SGL) and ANN-to-SNN conversion methods. Robustness tests with input noise reveal BrainQN's superior performance, achieving an 82.14% increase in rewards under low noise and 71.74% under high noise compared to DQN. These findings highlight BrainQN's robustness and superior performance in noisy environments, supporting its application in complex scenarios. SGL-trained BrainQN is more robust than ANN-to-SNN conversion under high noise. The differences in network output correlations between noisy and original inputs, along with training algorithm distinctions, explain this phenomenon. BrainQN successfully transitioned from a simulated Pong environment to a ball-catching robot with dynamic vision sensors (DVS). On the neuromorphic chip PAICORE, it shows significant advantages in latency and power consumption compared to Jetson Xavier NX. This article addresses the limitations of deep reinforcement learning (DRL) by introducing spiking neural networks (SNNs). This article proposes the Brain Q-network (BrainQN), which replaces the neurons in the classic Deep Q-learning (DQN) with SNN neurons. BrainQN demonstrates excellent performance in terms of robustness against noise attacks and power consumption.image (c) 2024 WILEY-VCH GmbH
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A reinforcement learning algorithm for spiking neural networks
    Florian, RV
    [J]. Seventh International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, Proceedings, 2005, : 299 - 306
  • [2] Deep learning in spiking neural networks
    Tavanaei, Amirhossein
    Ghodrati, Masoud
    Kheradpisheh, Saeed Reza
    Masquelier, Timothee
    Maida, Anthony
    [J]. NEURAL NETWORKS, 2019, 111 : 47 - 63
  • [3] Learning in neural networks by reinforcement of irregular spiking
    Xie, XH
    Seung, HS
    [J]. PHYSICAL REVIEW E, 2004, 69 (04): : 10
  • [4] Enhancing the Robustness of Spiking Neural Networks with Stochastic Gating Mechanisms
    Ding, Jianhao
    Yu, Zhaofei
    Huang, Tiejun
    Liu, Jian K.
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 492 - 502
  • [5] Deep Residual Learning in Spiking Neural Networks
    Fang, Wei
    Yu, Zhaofei
    Chen, Yanqi
    Huang, Tiejun
    Masquelier, Timothee
    Tian, Yonghong
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] The geometry of robustness in spiking neural networks
    Calaim, Nuno
    Dehmelt, Florian A.
    Goncalves, Pedro J.
    Machens, Christian K.
    [J]. ELIFE, 2022, 11
  • [7] Stochasticity and robustness in spiking neural networks
    Olin-Ammentorp, Wilkie
    Beckmann, Karsten
    Schuman, Catherine D.
    Plank, James S.
    Cady, Nathaniel C.
    [J]. NEUROCOMPUTING, 2021, 419 : 23 - 36
  • [8] Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses
    Yuan, Mengwen
    Wu, Xi
    Yan, Rui
    Tang, Huajin
    [J]. NEURAL COMPUTATION, 2019, 31 (12) : 2368 - 2389
  • [9] Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
    Seung, HS
    [J]. NEURON, 2003, 40 (06) : 1063 - 1073
  • [10] DSNNs: learning transfer from deep neural networks to spiking neural networks
    Zhang, Lei
    Du, Zidong
    Li, Ling
    Chen, Yunji
    [J]. High Technology Letters, 2020, 26 (02): : 136 - 144