Temporal Encoding and Multispike Learning Framework for Efficient Recognition of Visual Patterns

被引:10
|
作者
Yu, Qiang [1 ]
Song, Shiming [1 ]
Ma, Chenxiang [1 ]
Wei, Jianguo [2 ,3 ]
Chen, Shengyong [4 ]
Tan, Kay Chen [5 ,6 ]
机构
[1] Tianjin Univ, Tianjin Key Lab Cognit Comp & Applicat, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[3] Qinghai Univ Nationalities, Sch Comp Sci, Xining 810007, Qinghai, Peoples R China
[4] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300072, Peoples R China
[5] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[6] City Univ Hong Kong, Shenzhen Res Inst, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Encoding; Neurons; Image coding; Task analysis; Visualization; Feature extraction; Computational modeling; Image classification; multispike learning; neuromorphic computing; spiking neural networks (SNNs); temporal encoding; SPIKING NEURONS; NETWORK; ARCHITECTURE;
D O I
10.1109/TNNLS.2021.3052804
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biological systems under a parallel and spike-based computation endow individuals with abilities to have prompt and reliable responses to different stimuli. Spiking neural networks (SNNs) have thus been developed to emulate their efficiency and to explore principles of spike-based processing. However, the design of a biologically plausible and efficient SNN for image classification still remains as a challenging task. Previous efforts can be generally clustered into two major categories in terms of coding schemes being employed: rate and temporal. The rate-based schemes suffer inefficiency, whereas the temporal-based ones typically end with a relatively poor performance in accuracy. It is intriguing and important to develop an SNN with both efficiency and efficacy being considered. In this article, we focus on the temporal-based approaches in a way to advance their accuracy performance by a great margin while keeping the efficiency on the other hand. A new temporal-based framework integrated with the multispike learning is developed for efficient recognition of visual patterns. Different approaches of encoding and learning under our framework are evaluated with the MNIST and Fashion-MNIST data sets. Experimental results demonstrate the efficient and effective performance of our temporal-based approaches across a variety of conditions, improving accuracies to higher levels that are even comparable to rate-based ones but importantly with a lighter network structure and far less number of spikes. This article attempts to extend the advanced multispike learning to the challenging task of image recognition and bring state of the arts in temporal-based approaches to a novel level. The experimental results could be potentially favorable to low-power and high-speed requirements in the field of artificial intelligence and contribute to attract more efforts toward brain-like computing.
引用
收藏
页码:3387 / 3399
页数:13
相关论文
共 50 条
  • [21] PHONOLOGICAL ENCODING IN VISUAL WORD RECOGNITION
    SPOEHR, KT
    JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1978, 17 (02): : 127 - 141
  • [22] PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition
    Hao, Yanbin
    Zhou, Diansong
    Wang, Zhicai
    Ngo, Chong-Wah
    Wang, Meng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 5820 - 5840
  • [23] Efficient Visual Recognition
    Li Liu
    Matti Pietikäinen
    Jie Qin
    Wanli Ouyang
    Luc Van Gool
    International Journal of Computer Vision, 2020, 128 : 1997 - 2001
  • [24] Efficient Visual Recognition
    Liu, Li
    Pietikainen, Matti
    Qin, Jie
    Ouyang, Wanli
    Van Gool, Luc
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 1997 - 2001
  • [25] Learning action patterns in difference images for efficient action recognition
    Lu, Guoliang
    Kudo, Mineichi
    NEUROCOMPUTING, 2014, 123 : 328 - 336
  • [26] Action Recognition with Temporal Scale-Invariant Deep Learning Framework
    Huafeng Chen
    Jun Chen
    Ruimin Hu
    Chen Chen
    Zhongyuan Wang
    中国通信, 2017, 14 (02) : 163 - 172
  • [27] Action Recognition with Temporal Scale-Invariant Deep Learning Framework
    Chen, Huafeng
    Chen, Jun
    Hu, Ruimin
    Chen, Chen
    Wang, Zhongyuan
    CHINA COMMUNICATIONS, 2017, 14 (02) : 163 - 172
  • [28] ROLE OF INFERIOR TEMPORAL NEURONS IN VISUAL-PATTERN RECOGNITION .1. TEMPORAL ENCODING OF CURRENT AND RECALLED INFORMATION
    ESKANDAR, EN
    RICHMOND, BJ
    OPTICAN, LM
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1991, 32 (04) : 1036 - 1036
  • [29] Explainable machine learning framework for cataracts recognition using visual features
    Wu, Xiao
    Hu, Lingxi
    Xiao, Zunjie
    Zhang, Xiaoqing
    Higashita, Risa
    Liu, Jiang
    VISUAL COMPUTING FOR INDUSTRY BIOMEDICINE AND ART, 2025, 8 (01)
  • [30] SMaTE: A Segment-Level Feature Mixing and Temporal Encoding Framework for Facial Expression Recognition
    Kim, Nayeon
    Cho, Sukhee
    Bae, Byungjun
    SENSORS, 2022, 22 (15)