Improving sound event detection through enhanced feature extraction and attention mechanisms

被引:0
|
作者
Dongping Zhang [1 ]
Siyi Wu [1 ]
Zhanhong Lu [2 ]
Zhehao Zhang [3 ]
Haimiao Hu [4 ]
Jiabin Yu [1 ]
机构
[1] China Jiliang University,College of Information Engineering
[2] Hangzhou Hikvision Digital Technology Co.,Hangzhou Innovation Institute
[3] Ltd,undefined
[4] Hangzhou Aihua Intelligent Technology Co.,undefined
[5] Ltd,undefined
[6] Beihang University,undefined
关键词
D O I
10.1007/s11704-025-41108-7
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [31] Enhanced mechanisms of pooling and channel attention for deep learning feature maps
    Li H.
    Yue X.
    Meng L.
    PeerJ Computer Science, 2022, 8
  • [32] Sound Event Detection: A Journey Through DCASE Challenge Series
    Khandelwal, Tanmay
    Das, Rohan Kumar
    Chng, Eng Siong
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (01) : 1 - 63
  • [33] Joining Sound Event Detection and Localization Through Spatial Segregation
    Trowitzsch, Ivo
    Schymura, Christopher
    Kolossa, Dorothea
    Obermayer, Klaus
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 487 - 502
  • [34] CNN-TRANSFORMER WITH SELF-ATTENTION NETWORK FOR SOUND EVENT DETECTION
    Wakayama, Keigo
    Saito, Shoichiro
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 806 - 810
  • [35] Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet
    Zhang S.
    Zhang Y.
    Liao Y.
    Pang K.
    Wan Z.
    Zhou S.
    Mathematical Biosciences and Engineering, 2024, 21 (02) : 2004 - 2023
  • [36] Sound Event Localization and Detection Using Parallel Multi-attention Enhancement
    Zhengyu Chen
    Qinghua Huang
    Circuits, Systems, and Signal Processing, 2024, 43 (1) : 545 - 567
  • [37] Sound Event Localization and Detection Using Parallel Multi-attention Enhancement
    Chen, Zhengyu
    Huang, Qinghua
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (01) : 545 - 567
  • [38] A capsule network with pixel-based attention and BGRU for sound event detection
    Meng, Jiaxiang
    Wang, Xingmei
    Wang, Jinli
    Teng, Xuyang
    Xu, Yuezhu
    DIGITAL SIGNAL PROCESSING, 2022, 123
  • [39] SPARSE SELF-ATTENTION FOR SEMI-SUPERVISED SOUND EVENT DETECTION
    Guan, Yadong
    Xue, Jiabin
    Zheng, Guibin
    Han, Jiqing
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 821 - 825
  • [40] A REGION BASED ATTENTION METHOD FOR WEAKLY SUPERVISED SOUND EVENT DETECTION AND CLASSIFICATION
    Yan, Jie
    Song, Yan
    Guo, Wu
    Dai, Li-Rong
    McLoughlin, Ian
    Chen, Liang
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 755 - 759