HIGH-RESOLUTION ATTENTION NETWORK WITH ACOUSTIC SEGMENT MODEL FOR ACOUSTIC SCENE CLASSIFICATION

被引:0
|
作者
Bai, Xue [1 ]
Du, Jun [1 ]
Pan, Jia [1 ]
Zhou, Heng-shun [1 ]
Tu, Yan-Hui [1 ]
Lee, Chin-Hui [2 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Acoustic scene classification; attention mechanism; acoustic segment model; fully convolutional neural network;
D O I
10.1109/icassp40776.2020.9053519
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The spectral information of acoustic scenes is diverse and complex, which poses challenges for acoustic scene tasks. To improve the classification performance, a variety of convolutional neural networks (CNNs) are proposed to extract richer semantic information of scene utterances. However, the different regions of the features extracted from CNN-based encoder have different importance. In this paper, we propose a novel strategy for acoustic scene classification, namely high-resolution attention network with acoustic segment model (HRAN-ASM). In this approach, we utilize fully CNN to obtain high-level semantic information and then adopt two-stage attention strategy to select the relevant acoustic scene segments. Besides, the acoustic segment model (ASM) proposed in our recent work provides embedding vectors for this attention mechanism. The performance is evaluated on DCASE 2018 Task 1a, showing 70:5% good classification accuracy under single system and no data expansion, which is superior to CNN-based self-attention mechanism and highly competitive.
引用
收藏
页码:656 / 660
页数:5
相关论文
共 50 条
  • [1] Deep Segment Model for Acoustic Scene Classification
    Wang, Yajian
    Du, Jun
    Chen, Hang
    Wang, Qing
    Lee, Chin-Hui
    [J]. INTERSPEECH 2022, 2022, : 4177 - 4181
  • [2] Deep mutual attention network for acoustic scene classification
    Xie, Wei
    He, Qianhua
    Yu, Zitong
    Li, Yanxiong
    [J]. Digital Signal Processing: A Review Journal, 2022, 123
  • [3] Deep mutual attention network for acoustic scene classification
    Xie, Wei
    He, Qianhua
    Yu, Zitong
    Li, Yanxiong
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 123
  • [4] An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances
    Hu, Hu
    Siniscalchi, Sabato Marco
    Wang, Yannan
    Bai, Xue
    Du, Jun
    Lee, Chin-Hui
    [J]. INTERSPEECH 2020, 2020, : 1201 - 1205
  • [5] Wavelet Attention ResNeXt Network for High-resolution Remote Sensing Scene Classification
    Song, Wanying
    Cong, Yifan
    Zhang, Yingying
    Zhang, Shiru
    [J]. 2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 330 - 333
  • [6] An Investigation of High-Resolution Modeling Units of Deep Neural Networks for Acoustic Scene Classification
    Bao, Xiao
    Gao, Tian
    Du, Jun
    Dai, Li-Rong
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3028 - 3035
  • [7] Attention based Residual Network for High-Resolution Remote Sensing Imagery Scene Classification
    Fan, Runyu
    Wang, Lizhe
    Feng, Ruyi
    Zhu, Yingqian
    [J]. 2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 1346 - 1349
  • [8] High Performance Neural Network based Acoustic Scene Classification
    Prakruthi, U. S.
    Kiran, Divya
    Ramasangu, Hariharan
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 781 - 784
  • [9] An Integrated Convolutional Neural Network with a Fusion Attention Mechanism for Acoustic Scene Classification
    Jiang, Pengxu
    Xie, Yue
    Zou, Cairong
    Zhao, Li
    Wang, Qingyun
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2023, E106A (08) : 1057 - 1061
  • [10] SE-HRNET: A DEEP HIGH-RESOLUTION NETWORK WITH ATTENTION FOR REMOTE SENSING SCENE CLASSIFICATION
    Li, Lingling
    Tian, Tian
    Li, Hang
    Wang, Lizhe
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 533 - 536