Direct training high-performance spiking neural networks for object recognition and detection

被引:5
|
作者
Zhang, Hong [1 ]
Li, Yang [1 ]
He, Bin [1 ]
Fan, Xiongfei [1 ]
Wang, Yue [1 ]
Zhang, Yu [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, State Key Lab Ind Control Technol, Hangzhou, Peoples R China
[2] Key Lab Collaborat Sensing & Autonomous Unmanned S, Hangzhou, Peoples R China
关键词
spiking neural networks; gate residual learning; attention spike decoder; spiking RetinaNet; object recognition; object detection;
D O I
10.3389/fnins.2023.1229951
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
IntroductionThe spiking neural network (SNN) is a bionic model that is energy-efficient when implemented on neuromorphic hardwares. The non-differentiability of the spiking signals and the complicated neural dynamics make direct training of high-performance SNNs a great challenge. There are numerous crucial issues to explore for the deployment of direct training SNNs, such as gradient vanishing and explosion, spiking signal decoding, and applications in upstream tasks. MethodsTo address gradient vanishing, we introduce a binary selection gate into the basic residual block and propose spiking gate (SG) ResNet to implement residual learning in SNNs. We propose two appropriate representations of the gate signal and verify that SG ResNet can overcome gradient vanishing or explosion by analyzing the gradient backpropagation. For the spiking signal decoding, a better decoding scheme than rate coding is achieved by our attention spike decoder (ASD), which dynamically assigns weights to spiking signals along the temporal, channel, and spatial dimensions. Results and discussionThe SG ResNet and ASD modules are evaluated on multiple object recognition datasets, including the static ImageNet, CIFAR-100, CIFAR-10, and neuromorphic DVS-CIFAR10 datasets. Superior accuracy is demonstrated with a tiny simulation time step of four, specifically 94.52% top-1 accuracy on CIFAR-10 and 75.64% top-1 accuracy on CIFAR-100. Spiking RetinaNet is proposed using SG ResNet as the backbone and ASD module for information decoding as the first direct-training hybrid SNN-ANN detector for RGB images. Spiking RetinaNet with a SG ResNet34 backbone achieves an mAP of 0.296 on the object detection dataset MSCOCO.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] High-Performance Scaphoid Fracture Recognition via Effectiveness Assessment of Artificial Neural Networks
    Tung, Yu-Cheng
    Su, Ja-Hwung
    Liao, Yi-Wen
    Chang, Ching-Di
    Cheng, Yu-Fan
    Chang, Wan-Ching
    Chen, Bo-Hong
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [42] A Spiking Neural Network for Tactile Form Based Object Recognition
    Ratnasingam, Sivalogeswaran
    McGinnity, T. M.
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 880 - 885
  • [43] Evolving Probabilistic Spiking Neural Networks for Spatio-temporal Pattern Recognition: A Preliminary Study on Moving Object Recognition
    Kasabov, Nikola
    Dhoble, Kshitij
    Nuntalid, Nuttapod
    Mohemmed, Ammar
    NEURAL INFORMATION PROCESSING, PT III, 2011, 7064 : 230 - 239
  • [44] Object detection and activity recognition in video surveillance using neural networks
    Payghode, Vishva
    Goyal, Ayush
    Bhan, Anupama
    Iyer, Sailesh Suryanarayan
    Dubey, Ashwani Kumar
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2023, 19 (3/4) : 123 - 138
  • [45] Object Recognition and Detection by Shape and Color Pattern Recognition Utilizing Artificial Neural Networks
    Cruz, Jerome Paul N.
    Lourdes Dimaala, Ma
    Francisco, Laurene Gaile L.
    Franco, Erica Joanna S.
    Bandala, Argel A.
    Dadios, Elmer P.
    2013 INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2013, : 140 - 144
  • [46] QueryProp: Object Query Propagation for High-Performance Video Object Detection
    He, Fei
    Gao, Naiyu
    Jia, Jian
    Zhao, Xin
    Huang, Kaiqi
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 834 - 842
  • [47] Training Convolutional Neural Networks with Synthesized Data for Object Recognition in Industrial Manufacturing
    Li, Jason
    Gotvall, Per-Lage
    Provost, Julien
    Akesson, Knut
    2019 24TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2019, : 1544 - 1547
  • [48] Recognition of Arabic Characters using Spiking Neural Networks
    Humaidi, Amjad J.
    Kadhim, Thaer M.
    2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 7 - 11
  • [49] A Method to Automatic Create Dataset for Training Object Detection Neural Networks
    Zhou, Shi
    Yang, Zijun
    Zhu, Miaomiao
    Li, He
    Serikawa, Seiichi
    Mizumachi, Mitsunori
    Zhang, Lifeng
    IEEE Access, 2022, 10 : 80505 - 80517
  • [50] A Method to Automatic Create Dataset for Training Object Detection Neural Networks
    Zhou, Shi
    Yang, Zijun
    Zhu, Miaomiao
    Li, He L.
    Serikawa, Seiichi
    Mizumachi, Mitsunori
    Zhang, Lifeng
    IEEE ACCESS, 2022, 10 : 80505 - 80517