A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION

被引:24
|
作者
Wang, You [1 ]
Feng, Chuyao [1 ]
Anderson, David, V [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
Environmental sound classification; convolutional neural network; temporal attention; multi-channel;
D O I
10.1109/ICASSP39728.2021.9413498
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recently, many attention-based deep neural networks have emerged and achieved state-of-the-art performance in environmental sound classification. The essence of attention mechanism is assigning contribution weights on different parts of features, namely channels, spectral or spatial contents, and temporal frames. In this paper, we propose an effective convolutional neural network structure with a multi-channel temporal attention (MCTA) block, which applies a temporal attention mechanism within each channel of the embedded features to extract channel-wise relevant temporal information. This multi-channel temporal attention structure will result in a distinct attention vector for each channel, which enables the network to fully exploit the relevant temporal information in different channels. The datasets used to test our model include ESC-50 and its subset ESC-10, along with development sets of DCASE 2018 and 2019. In our experiments, MCTA performed better than the single-channel temporal attention model and the non-attention model with the same number of parameters. Furthermore, we compared our model with some successful attention-based models and obtained competitive results with a relatively lighter network.
引用
收藏
页码:930 / 934
页数:5
相关论文
共 50 条
  • [1] A multi-channel attention graph convolutional neural network for node classification
    Zhai, Rui
    Zhang, Libo
    Wang, Yingqi
    Song, Yalin
    Yu, Junyang
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (04): : 3561 - 3579
  • [2] A multi-channel attention graph convolutional neural network for node classification
    Rui Zhai
    Libo Zhang
    Yingqi Wang
    Yalin Song
    Junyang Yu
    [J]. The Journal of Supercomputing, 2023, 79 : 3561 - 3579
  • [3] Multi-channel Convolutional Neural Networks with Multi-level Feature Fusion for Environmental Sound Classification
    Chong, Dading
    Zou, Yuexian
    Wang, Wenwu
    [J]. MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 157 - 168
  • [4] Multi-Channel Embedding Convolutional Neural Network Model for Arabic Sentiment Classification
    Dahou, Abdelghani
    Xiong, Shengwu
    Zhou, Junwei
    Abd Elaziz, Mohamed
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (04)
  • [5] Environmental sound classification using temporal-frequency attention based convolutional neural network
    Wenjie Mu
    Bo Yin
    Xianqing Huang
    Jiali Xu
    Zehua Du
    [J]. Scientific Reports, 11
  • [6] Environmental sound classification using temporal-frequency attention based convolutional neural network
    Mu, Wenjie
    Yin, Bo
    Huang, Xianqing
    Xu, Jiali
    Du, Zehua
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [7] Multi-channel lung sound classification with convolutional recurrent neural networks
    Messner, Elmar
    Fediuk, Melanie
    Swatek, Paul
    Scheidl, Stefan
    Smolle-Juettner, Freyja-Maria
    Olschewski, Horst
    Pernkopf, Franz
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 122
  • [8] Multi-channel Convolutional Neural Network for Precise Meme Classification
    Sherratt, Victoria
    Pimbblet, Kevin
    Dethlefs, Nina
    [J]. PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 190 - 198
  • [9] Attention based convolutional recurrent neural network for environmental sound classification
    Zhang, Zhichao
    Xu, Shugong
    Zhang, Shunqing
    Qiao, Tianhao
    Cao, Shan
    [J]. NEUROCOMPUTING, 2021, 453 (453) : 896 - 903
  • [10] Classification of Hyperspectral Data Using a Multi-Channel Convolutional Neural Network
    Chen, Chen
    Zhang, Jing-Jing
    Zheng, Chun-Hou
    Yan, Qing
    Xun, Li-Na
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 81 - 92