LoCS-Net: Localizing convolutional spiking neural network for fast visual place recognition

被引：0

作者：

Akcal, Ugur ^{[1
,2
,3
]}

Raikov, Ivan Georgiev ^{[4
]}

Gribkova, Ekaterina Dmitrievna ^{[3
,5
]}

Choudhuri, Anwesa ^{[3
,6
]}

Kim, Seung Hyun ^{[7
]}

Gazzola, Mattia ^{[7
]}

Gillette, Rhanor ^{[5
,8
]}

Soltesz, Ivan ^{[4
]}

Chowdhary, Girish ^{[2
,3
,9
]}

机构：

[1] Univ Illinois, Grainger Coll Engn, Dept Aerosp Engn, Urbana, IL 61820 USA

[2] Univ Illinois, Grainger Coll Engn, Siebel Sch Comp & Data Sci, Urbana, IL 61820 USA

[3] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61820 USA

[4] Stanford Univ, Dept Neurosurg, Stanford, CA 94305 USA

[5] Univ Illinois, Ctr Artificial Intelligence Innovat, Neurosci Program, Urbana, IL USA

[6] Univ Illinois, Grainger Coll Engn, Dept Elect & Comp Engn, Urbana, IL USA

[7] Univ Illinois, Grainger Coll Engn Mech Sci & Engn, Urbana, IL USA

[8] Univ Illinois, Dept Mol & Integrat Physiol, Urbana, IL USA

[9] Univ Illinois, Grainger Coll Engn, Coll Agr & Consumer Econ, Dept Agr & Biol Engn, Urbana, IL 61820 USA

来源：

FRONTIERS IN NEUROROBOTICS | 2025年 / 18卷

关键词：

spiking neural networks; robotics; visual place recognition; localization; supervised learning; convolutional networks; FEATURES; LOCALIZATION; SCENE; MODEL;

D O I：

10.3389/fnbot.2024.1490267

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visual place recognition (VPR) is the ability to recognize locations in a physical environment based only on visual inputs. It is a challenging task due to perceptual aliasing, viewpoint and appearance variations and complexity of dynamic scenes. Despite promising demonstrations, many state-of-the-art (SOTA) VPR approaches based on artificial neural networks (ANNs) suffer from computational inefficiency. However, spiking neural networks (SNNs) implemented on neuromorphic hardware are reported to have remarkable potential for more efficient solutions computationally. Still, training SOTA SNNs for VPR is often intractable on large and diverse datasets, and they typically demonstrate poor real-time operation performance. To address these shortcomings, we developed an end-to-end convolutional SNN model for VPR that leverages backpropagation for tractable training. Rate-based approximations of leaky integrate-and-fire (LIF) neurons are employed during training, which are then replaced with spiking LIF neurons during inference. The proposed method significantly outperforms existing SOTA SNNs on challenging datasets like Nordland and Oxford RobotCar, achieving 78.6% precision at 100% recall on the Nordland dataset (compared to 73.0% from the current SOTA) and 45.7% on the Oxford RobotCar dataset (compared to 20.2% from the current SOTA). Our approach offers a simpler training pipeline while yielding significant improvements in both training and inference times compared to SOTA SNNs for VPR. Hardware-in-the-loop tests using Intel's neuromorphic USB form factor, Kapoho Bay, show that our on-chip spiking models for VPR trained via the ANN-to-SNN conversion strategy continue to outperform their SNN counterparts, despite a slight but noticeable decrease in performance when transitioning from off-chip to on-chip, while offering significant energy efficiency. The results highlight the outstanding rapid prototyping and real-world deployment capabilities of this approach, showing it to be a substantial step toward more prevalent SNN-based real-world robotics solutions.

引用

页数：16

共 50 条

[41] CSF images fast recognition model based on improved convolutional Neural Network
Huang, Wenming
Leng, Jinqiang
Deng, Zhenrong
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 516 - 522
[42] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition
Zhang, Shiqing
Zhang, Shiliang
Huang, Tiejun
Gao, Wen
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
[43] SAM: A Rethinking of Prominent Convolutional Neural Network Architectures for Visual Object Recognition
Wang, Zhenyang
Deng, Zhidong
Wang, Shiyao
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1008 - 1014
[44] Visual Tea Leaf Disease Recognition Using a Convolutional Neural Network Model
Chen, Jing
Liu, Qi
Gao, Lingwang
SYMMETRY-BASEL, 2019, 11 (03):
[45] Chinese Traditional Visual Cultural Symbols Recognition Based on Convolutional Neural Network
Tan, Xiao
Wu, Xiaoyu
Yang, Cheng
PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 945 - 948
[46] Visual Recognition of Ancient Inscriptions Using Convolutional Neural Network and Fisher Vector
Amato, Giuseppe
Falchi, Fabrizio
Vadicamo, Lucia
ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2016, 9 (04):
[47] Robust place recognition based on salient landmarks screening and convolutional neural network features
Niu, Jie
Qian, Kun
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
[48] Visual Tracking with Convolutional Neural Network
Zhang, Le
Suganthan, P. N.
2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2072 - 2077
[49] Leukocyte recognition with convolutional neural network
Lin, Liqun
Wang, Weixing
Chen, Bolin
JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2018, 13 : 1 - 8
[50] An Odor Recognition Algorithm of Electronic Noses Based on Convolutional Spiking Neural Network for Spoiled Food Identification
Xiong, Yizhou
Chen, Yuantao
Chen, Changming
Wei, Xinwei
Xue, Yingying
Wan, Hao
Wang, Ping
JOURNAL OF THE ELECTROCHEMICAL SOCIETY, 2021, 168 (07)

← 1 2 3 4 5 →