Environment sound classification using an attention-based residual neural network

被引:30
|
作者
Tripathi, Achyut Mani [1 ]
Mishra, Aakansha [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Gauhati 781039, Assam, India
关键词
Attention mechanism; Convolutional neural network; Explainable; Environmental sound classification; Residual network; TEMPORAL RELATIONS; RECOGNITION;
D O I
10.1016/j.neucom.2021.06.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complexity of environmental sounds impose numerous challenges for their classification. The performance of Environmental Sound Classification (ESC) depends greatly on how good the feature extraction technique employed to extract generic and prototypical features from a sound is. The presence of silent and semantically irrelevant frames is ubiquitous during the classification of environmental sounds. To deal with such issues that persist in environmental sound classification, we introduce a novel attention-based deep model that supports focusing on semantically relevant frames. The proposed attention guided deep model efficiently learns spatio-temporal relationships that exist in the spectrogram of a signal. The efficacy of the proposed method is evaluated on two widely used Environmental Sound Classification datasets: ESC-10 and DCASE 2019 Task-1(A) datasets. The experiments performed and their results demonstrate that the proposed method yields comparable performance to state-of-the-art techniques. We obtained improvements of 11.50% and 19.50% in accuracy as compared to the accuracy of the baseline models of the ESC-10 and DCASE 2019 Task-1(A) datasets respectively. To support the attention outcomes that have focused on relevant regions, visual analysis of the attention feature map has also been presented. The resultant attention feature map conveys that the model focuses only on the spectrogram's semantically relevant regions while skipping the irrelevant regions. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:409 / 423
页数:15
相关论文
共 50 条
  • [1] Attention-Based Domain Adaptation Using Residual Network for Hyperspectral Image Classification
    Mdrafi, Robiulhossain
    Du, Qian
    Gurbuz, Ali Cafer
    Tang, Bo
    Ma, Li
    Younan, Nicolas H.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 6424 - 6433
  • [2] HEART SOUND CLASSIFICATION USING RESIDUAL NEURAL NETWORK AND CONVOLUTION BLOCK ATTENTION MODULE
    Frimpong, Enoch Adjei
    Qin Zhiguang
    Kwadwo, Tenagyei Edwin
    Rutherford, Patamia Agbeshi
    Baagyere, Edward Y.
    Turkson, Regina Esi
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [3] Environment Sound Classification using Multiple Feature Channels and Attention based Deep Convolutional Neural Network
    Sharma, Jivitesh
    Granmo, Ole-Christoffer
    Goodwin, Morten
    INTERSPEECH 2020, 2020, : 1186 - 1190
  • [4] Handwritten/Printed Receipt Classification using Attention-Based Convolutional Neural Network
    Yang, Fan
    Jin, Lianwen
    Yang, Weixin
    Feng, Ziyong
    Zhang, Shuye
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 384 - 389
  • [5] Deep attention-based neural networks for explainable heart sound classification
    Ren, Zhao
    Qian, Kun
    Dong, Fengquan
    Dai, Zhenyu
    Nejdl, Wolfgang
    Yamamoto, Yoshiharu
    Schuller, Bjoern W.
    MACHINE LEARNING WITH APPLICATIONS, 2022, 9
  • [6] Attention-based Sound Classification Pipeline with Sound Spectrum
    Tan, Ki In
    Yean, Seanglidet
    Lee, Bu Sung
    2023 IEEE SENSORS APPLICATIONS SYMPOSIUM, SAS, 2023,
  • [7] Attention-Based Convolutional Neural Network for Earthquake Event Classification
    Ku, Bonhwa
    Kim, Gwantae
    Ahn, Jae-Kwang
    Lee, Jimin
    Ko, Hanseok
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (12) : 2057 - 2061
  • [8] Attention-Based Recurrent Neural Network for Plant Disease Classification
    Lee, Sue Han
    Goeau, Herve
    Bonnet, Pierre
    Joly, Alexis
    FRONTIERS IN PLANT SCIENCE, 2020, 11
  • [9] Attention-Based Hierarchical Recurrent Neural Network for Phenotype Classification
    Xu, Nan
    Shen, Yanyan
    Zhu, Yanmin
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 465 - 476
  • [10] Attention-based Neural Network for Driving Environment Complexity Perception
    Zhang, Ce
    Eskandarian, Azim
    Du, Xuelai
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2781 - 2787