A Spiking Neural Network Framework for Robust Sound Classification

被引:75
|
作者
Wu, Jibin [1 ]
Chua, Yansong [2 ]
Zhang, Malu [1 ]
Li, Haizhou [1 ,2 ]
Tan, Kay Chen [3 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[2] ASTAR, Inst Infocomm Res, Singapore, Singapore
[3] City Univ Hong Kong, Dept Comp Sci, Kowloon Tong, Hong Kong, Peoples R China
关键词
spiking neural network; self-organizing map; automatic sound classification; maximum-margin Tempotron classifier; noise robust multi-condition training; HUMAN AUDITORY-CORTEX; EVENT CLASSIFICATION; RECOGNITION; NOISE;
D O I
10.3389/fnins.2018.00836
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Environmental sounds form part of our daily life. With the advancement of deep learning models and the abundance of training data, the performance of automatic sound classification (ASC) systems has improved significantly in recent years. However, the high computational cost, hence high power consumption, remains a major hurdle for large-scale implementation of ASC systems on mobile and wearable devices. Motivated by the observations that humans are highly effective and consume little power whilst analyzing complex audio scenes, we propose a biologically plausible ASC framework, namely SOM-SNN. This framework uses the unsupervised self-organizing map (SOM) for representing frequency contents embedded within the acoustic signals, followed by an event-based spiking neural network (SNN) for spatiotemporal spiking pattern classification. We report experimental results on the RWCP environmental sound and TIDIGITS spoken digits datasets, which demonstrate competitive classification accuracies over other deep learning and SNN-based models. The SOM-SNN framework is also shown to be highly robust to corrupting noise after multi-condition training, whereby the model is trained with noise-corrupted sound samples. Moreover, we discover the early decision making capability of the proposed framework: an accurate classification can be made with an only partial presentation of the input.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Development of a Library for Sound Classification Using Spiking Neural Network
    Tomas Molas, Jose
    Peralta, Ivan
    Martinez, Cesar
    Leonardo Rufiner, Hugo
    [J]. VI LATIN AMERICAN CONGRESS ON BIOMEDICAL ENGINEERING (CLAIB 2014), 2014, 49 : 651 - 654
  • [2] A Spiking Neural Network with Distributed Keypoint Encoding for Robust Sound Recognition
    Yao, Yanli
    Yu, Qiang
    Wang, Longbiao
    Dang, Jianwu
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [3] COMBINING ROBUST SPIKE CODING WITH SPIKING NEURAL NETWORKS FOR SOUND EVENT CLASSIFICATION
    Dennis, Jonathan
    Tran Huy Dat
    Li, Haizhou
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 176 - 180
  • [4] Noise robust sound event classification with convolutional neural network
    Ozer, Ilyas
    Ozer, Zeynep
    Findik, Oguz
    [J]. NEUROCOMPUTING, 2018, 272 : 505 - 512
  • [5] Spiking Echo State Convolutional Neural Network for Robust Time Series Classification
    Zhang, Anguo
    Zhu, Wei
    Li, Junyu
    [J]. IEEE ACCESS, 2019, 7 : 4927 - 4935
  • [6] A Spiking Neural Network Model for Sound Recognition
    Xiao, Rong
    Yan, Rui
    Tang, Huajin
    Tan, Kay Chen
    [J]. COGNITIVE SYSTEMS AND SIGNAL PROCESSING, ICCSIP 2016, 2017, 710 : 584 - 594
  • [7] On the use of spiking neural network for EEG classification
    Goel, Piyush
    Liu, Honghai
    Brown, David
    Datta, Avijit
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2008, 12 (04) : 295 - 304
  • [8] Sound classification and function approximation using spiking neural networks
    Amin, HH
    Fujii, RH
    [J]. ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 621 - 630
  • [9] Robust technique for environmental sound classification using convolutional recurrent neural network
    Anam Bansal
    Naresh Kumar Garg
    [J]. Multimedia Tools and Applications, 2024, 83 : 54755 - 54772
  • [10] Convolutional neural network based traffic sound classification robust to environmental noise
    Lee, Jaejun
    Kim, Wansoo
    Lee, Kyogu
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2018, 37 (06): : 469 - 474