Environment sound classification using an attention-based residual neural network

被引:30
|
作者
Tripathi, Achyut Mani [1 ]
Mishra, Aakansha [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Gauhati 781039, Assam, India
关键词
Attention mechanism; Convolutional neural network; Explainable; Environmental sound classification; Residual network; TEMPORAL RELATIONS; RECOGNITION;
D O I
10.1016/j.neucom.2021.06.031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Complexity of environmental sounds impose numerous challenges for their classification. The performance of Environmental Sound Classification (ESC) depends greatly on how good the feature extraction technique employed to extract generic and prototypical features from a sound is. The presence of silent and semantically irrelevant frames is ubiquitous during the classification of environmental sounds. To deal with such issues that persist in environmental sound classification, we introduce a novel attention-based deep model that supports focusing on semantically relevant frames. The proposed attention guided deep model efficiently learns spatio-temporal relationships that exist in the spectrogram of a signal. The efficacy of the proposed method is evaluated on two widely used Environmental Sound Classification datasets: ESC-10 and DCASE 2019 Task-1(A) datasets. The experiments performed and their results demonstrate that the proposed method yields comparable performance to state-of-the-art techniques. We obtained improvements of 11.50% and 19.50% in accuracy as compared to the accuracy of the baseline models of the ESC-10 and DCASE 2019 Task-1(A) datasets respectively. To support the attention outcomes that have focused on relevant regions, visual analysis of the attention feature map has also been presented. The resultant attention feature map conveys that the model focuses only on the spectrogram's semantically relevant regions while skipping the irrelevant regions. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:409 / 423
页数:15
相关论文
共 50 条
  • [21] Malware Classification Using Attention-Based Transductive Learning Network
    Deng, Liting
    Wen, Hui
    Xin, Mingfeng
    Sun, Yue
    Sun, Limin
    Zhu, Hongsong
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT II, 2020, 336 : 403 - 418
  • [22] MAGNET: Multi-Label Text Classification using Attention-based Graph Neural Network
    Pal, Ankit
    Selvakumar, Muru
    Sankarasubbu, Malaikannan
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 494 - 505
  • [23] Elimination of stripe artifacts in light sheet fluorescence microscopy using an attention-based residual neural network
    Wei, Zechen
    Wu, Xiangjun
    Tong, Wei
    Zhang, Suhui
    Yang, Xin
    Tian, Jie
    Hui, Hui
    BIOMEDICAL OPTICS EXPRESS, 2022, 13 (03) : 1292 - 1311
  • [24] Attention Based Convolutional Neural Network with Multi-frequency Resolution Feature for Environment Sound Classification
    Minze Li
    Wu Huang
    Tao Zhang
    Neural Processing Letters, 2023, 55 : 4291 - 4306
  • [25] Attention Based Convolutional Neural Network with Multi-frequency Resolution Feature for Environment Sound Classification
    Li, Minze
    Huang, Wu
    Zhang, Tao
    NEURAL PROCESSING LETTERS, 2023, 55 (04) : 4291 - 4306
  • [26] Multivariate Time Series Classification With An Attention-Based Multivariate Convolutional Neural Network
    Tripathi, Achyut Mani
    Baruah, Rashmi Dutta
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [27] A Novel Attention-based Neural Network for Video Scene Classification in Complex Background
    Fu, Yan
    Xin, Ru
    Ye, Ou
    PROCEEDINGS OF THE 32ND INTERNATIONAL CONFERENCE ON COMPUTER ANIMATION AND SOCIAL AGENTS (CASA 2019), 2019, : 85 - 88
  • [28] Convolution- and Attention-Based Neural Network for Automated Sleep Stage Classification
    Zhu, Tianqi
    Luo, Wei
    Yu, Feng
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (11) : 1 - 13
  • [29] Attention based convolutional recurrent neural network for environmental sound classification
    Zhang, Zhichao
    Xu, Shugong
    Zhang, Shunqing
    Qiao, Tianhao
    Cao, Shan
    NEUROCOMPUTING, 2021, 453 (453) : 896 - 903
  • [30] Environmental Sound Classification Based on Attention Feature Fusion and Improved Residual Network
    Liu, Yuxing
    Wang, Mengjiao
    Zhang, Xin
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2023, 57 (04) : 371 - 379