LoCS-Net: Localizing convolutional spiking neural network for fast visual place recognition

被引:0
|
作者
Akcal, Ugur [1 ,2 ,3 ]
Raikov, Ivan Georgiev [4 ]
Gribkova, Ekaterina Dmitrievna [3 ,5 ]
Choudhuri, Anwesa [3 ,6 ]
Kim, Seung Hyun [7 ]
Gazzola, Mattia [7 ]
Gillette, Rhanor [5 ,8 ]
Soltesz, Ivan [4 ]
Chowdhary, Girish [2 ,3 ,9 ]
机构
[1] Univ Illinois, Grainger Coll Engn, Dept Aerosp Engn, Urbana, IL 61820 USA
[2] Univ Illinois, Grainger Coll Engn, Siebel Sch Comp & Data Sci, Urbana, IL 61820 USA
[3] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61820 USA
[4] Stanford Univ, Dept Neurosurg, Stanford, CA 94305 USA
[5] Univ Illinois, Ctr Artificial Intelligence Innovat, Neurosci Program, Urbana, IL USA
[6] Univ Illinois, Grainger Coll Engn, Dept Elect & Comp Engn, Urbana, IL USA
[7] Univ Illinois, Grainger Coll Engn Mech Sci & Engn, Urbana, IL USA
[8] Univ Illinois, Dept Mol & Integrat Physiol, Urbana, IL USA
[9] Univ Illinois, Grainger Coll Engn, Coll Agr & Consumer Econ, Dept Agr & Biol Engn, Urbana, IL 61820 USA
来源
关键词
spiking neural networks; robotics; visual place recognition; localization; supervised learning; convolutional networks; FEATURES; LOCALIZATION; SCENE; MODEL;
D O I
10.3389/fnbot.2024.1490267
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual place recognition (VPR) is the ability to recognize locations in a physical environment based only on visual inputs. It is a challenging task due to perceptual aliasing, viewpoint and appearance variations and complexity of dynamic scenes. Despite promising demonstrations, many state-of-the-art (SOTA) VPR approaches based on artificial neural networks (ANNs) suffer from computational inefficiency. However, spiking neural networks (SNNs) implemented on neuromorphic hardware are reported to have remarkable potential for more efficient solutions computationally. Still, training SOTA SNNs for VPR is often intractable on large and diverse datasets, and they typically demonstrate poor real-time operation performance. To address these shortcomings, we developed an end-to-end convolutional SNN model for VPR that leverages backpropagation for tractable training. Rate-based approximations of leaky integrate-and-fire (LIF) neurons are employed during training, which are then replaced with spiking LIF neurons during inference. The proposed method significantly outperforms existing SOTA SNNs on challenging datasets like Nordland and Oxford RobotCar, achieving 78.6% precision at 100% recall on the Nordland dataset (compared to 73.0% from the current SOTA) and 45.7% on the Oxford RobotCar dataset (compared to 20.2% from the current SOTA). Our approach offers a simpler training pipeline while yielding significant improvements in both training and inference times compared to SOTA SNNs for VPR. Hardware-in-the-loop tests using Intel's neuromorphic USB form factor, Kapoho Bay, show that our on-chip spiking models for VPR trained via the ANN-to-SNN conversion strategy continue to outperform their SNN counterparts, despite a slight but noticeable decrease in performance when transitioning from off-chip to on-chip, while offering significant energy efficiency. The results highlight the outstanding rapid prototyping and real-world deployment capabilities of this approach, showing it to be a substantial step toward more prevalent SNN-based real-world robotics solutions.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] CSF images fast recognition model based on improved convolutional Neural Network
    Huang, Wenming
    Leng, Jinqiang
    Deng, Zhenrong
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 516 - 522
  • [42] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition
    Zhang, Shiqing
    Zhang, Shiliang
    Huang, Tiejun
    Gao, Wen
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
  • [43] SAM: A Rethinking of Prominent Convolutional Neural Network Architectures for Visual Object Recognition
    Wang, Zhenyang
    Deng, Zhidong
    Wang, Shiyao
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1008 - 1014
  • [44] Visual Tea Leaf Disease Recognition Using a Convolutional Neural Network Model
    Chen, Jing
    Liu, Qi
    Gao, Lingwang
    SYMMETRY-BASEL, 2019, 11 (03):
  • [45] Chinese Traditional Visual Cultural Symbols Recognition Based on Convolutional Neural Network
    Tan, Xiao
    Wu, Xiaoyu
    Yang, Cheng
    PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 945 - 948
  • [46] Visual Recognition of Ancient Inscriptions Using Convolutional Neural Network and Fisher Vector
    Amato, Giuseppe
    Falchi, Fabrizio
    Vadicamo, Lucia
    ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2016, 9 (04):
  • [47] Robust place recognition based on salient landmarks screening and convolutional neural network features
    Niu, Jie
    Qian, Kun
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
  • [48] Visual Tracking with Convolutional Neural Network
    Zhang, Le
    Suganthan, P. N.
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2072 - 2077
  • [49] Leukocyte recognition with convolutional neural network
    Lin, Liqun
    Wang, Weixing
    Chen, Bolin
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2018, 13 : 1 - 8
  • [50] An Odor Recognition Algorithm of Electronic Noses Based on Convolutional Spiking Neural Network for Spoiled Food Identification
    Xiong, Yizhou
    Chen, Yuantao
    Chen, Changming
    Wei, Xinwei
    Xue, Yingying
    Wan, Hao
    Wang, Ping
    JOURNAL OF THE ELECTROCHEMICAL SOCIETY, 2021, 168 (07)