Temporal Encoding and Multispike Learning Framework for Efficient Recognition of Visual Patterns

被引：10

作者：

Yu, Qiang ^{[1
]}

Song, Shiming ^{[1
]}

Ma, Chenxiang ^{[1
]}

Wei, Jianguo ^{[2
,3
]}

Chen, Shengyong ^{[4
]}

Tan, Kay Chen ^{[5
,6
]}

机构：

[1] Tianjin Univ, Tianjin Key Lab Cognit Comp & Applicat, Coll Intelligence & Comp, Tianjin 300350, Peoples R China

[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China

[3] Qinghai Univ Nationalities, Sch Comp Sci, Xining 810007, Qinghai, Peoples R China

[4] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300072, Peoples R China

[5] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[6] City Univ Hong Kong, Shenzhen Res Inst, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2022年 / 33卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Encoding; Neurons; Image coding; Task analysis; Visualization; Feature extraction; Computational modeling; Image classification; multispike learning; neuromorphic computing; spiking neural networks (SNNs); temporal encoding; SPIKING NEURONS; NETWORK; ARCHITECTURE;

D O I：

10.1109/TNNLS.2021.3052804

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Biological systems under a parallel and spike-based computation endow individuals with abilities to have prompt and reliable responses to different stimuli. Spiking neural networks (SNNs) have thus been developed to emulate their efficiency and to explore principles of spike-based processing. However, the design of a biologically plausible and efficient SNN for image classification still remains as a challenging task. Previous efforts can be generally clustered into two major categories in terms of coding schemes being employed: rate and temporal. The rate-based schemes suffer inefficiency, whereas the temporal-based ones typically end with a relatively poor performance in accuracy. It is intriguing and important to develop an SNN with both efficiency and efficacy being considered. In this article, we focus on the temporal-based approaches in a way to advance their accuracy performance by a great margin while keeping the efficiency on the other hand. A new temporal-based framework integrated with the multispike learning is developed for efficient recognition of visual patterns. Different approaches of encoding and learning under our framework are evaluated with the MNIST and Fashion-MNIST data sets. Experimental results demonstrate the efficient and effective performance of our temporal-based approaches across a variety of conditions, improving accuracies to higher levels that are even comparable to rate-based ones but importantly with a lighter network structure and far less number of spikes. This article attempts to extend the advanced multispike learning to the challenging task of image recognition and bring state of the arts in temporal-based approaches to a novel level. The experimental results could be potentially favorable to low-power and high-speed requirements in the field of artificial intelligence and contribute to attract more efforts toward brain-like computing.

引用

页码：3387 / 3399

页数：13

共 50 条

[21] PHONOLOGICAL ENCODING IN VISUAL WORD RECOGNITION
SPOEHR, KT
JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1978, 17 (02): : 127 - 141
[22] PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition
Hao, Yanbin
Zhou, Diansong
Wang, Zhicai
Ngo, Chong-Wah
Wang, Meng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 5820 - 5840
[23] Efficient Visual Recognition
Li Liu
Matti Pietikäinen
Jie Qin
Wanli Ouyang
Luc Van Gool
International Journal of Computer Vision, 2020, 128 : 1997 - 2001
[24] Efficient Visual Recognition
Liu, Li
Pietikainen, Matti
Qin, Jie
Ouyang, Wanli
Van Gool, Luc
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 1997 - 2001
[25] Learning action patterns in difference images for efficient action recognition
Lu, Guoliang
Kudo, Mineichi
NEUROCOMPUTING, 2014, 123 : 328 - 336
[26] Action Recognition with Temporal Scale-Invariant Deep Learning Framework
Huafeng Chen
Jun Chen
Ruimin Hu
Chen Chen
Zhongyuan Wang
中国通信, 2017, 14 (02) : 163 - 172
[27] Action Recognition with Temporal Scale-Invariant Deep Learning Framework
Chen, Huafeng
Chen, Jun
Hu, Ruimin
Chen, Chen
Wang, Zhongyuan
CHINA COMMUNICATIONS, 2017, 14 (02) : 163 - 172
[28] ROLE OF INFERIOR TEMPORAL NEURONS IN VISUAL-PATTERN RECOGNITION .1. TEMPORAL ENCODING OF CURRENT AND RECALLED INFORMATION
ESKANDAR, EN
RICHMOND, BJ
OPTICAN, LM
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1991, 32 (04) : 1036 - 1036
[29] Explainable machine learning framework for cataracts recognition using visual features
Wu, Xiao
Hu, Lingxi
Xiao, Zunjie
Zhang, Xiaoqing
Higashita, Risa
Liu, Jiang
VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2025, 8 (01)
[30] SMaTE: A Segment-Level Feature Mixing and Temporal Encoding Framework for Facial Expression Recognition
Kim, Nayeon
Cho, Sukhee
Bae, Byungjun
SENSORS, 2022, 22 (15)

← 1 2 3 4 5 →