Dual-branch attention module-based network with parameter sharing for joint sound event detection and localization

被引:2
|
作者
Zhou, Yuting [1 ]
Wan, Hongjie [1 ]
机构
[1] Beijing Univ Chem Technol, Informat Engn Dept, 15 North Third Ring Rd East, Beijing 100029, Peoples R China
关键词
Sound event detection and localization; Conformer; Attention mechanism; Multi-task learning; Soft parameter sharing;
D O I
10.1186/s13636-023-00292-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The goal of sound event detection and localization (SELD) is to identify each individual sound event class and its activity time from a piece of audio, while estimating its spatial location at the time of activity. Conformer combines the advantages of convolutional layers and Transformer, which is effective in tasks such as speech recognition. However, it achieves high performance relying on complex network structure and a large number of computations. In the SELD task of this paper, we propose to use an encoder with a simpler network structure, called the dual-branch attention module (DBAM). The module is improved based on the conformer using two parallel branches of attention and convolution, which can model both global and local contextual information. We also blend low-level and high-level features of the localization task. In addition, we add soft parameter sharing to the joint SELD network, which can efficiently exploit the potential relationship between the two subtasks, SED and DOA. The proposed method can effectively detect two sound events with overlapping occurrence in the same time period. We experimented with the open dataset DCASE 2020 task 3 proving that the proposed method achieves better SELD performance than the baseline model. Furthermore, we conducted ablation experiments for verifying the effectiveness of the dual-branch attention module and soft parameter sharing.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Dual-branch attention module-based network with parameter sharing for joint sound event detection and localization
    Yuting Zhou
    Hongjie Wan
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [2] A Method Based on Dual Cross-Modal Attention and Parameter Sharing for Polyphonic Sound Event Localization and Detection
    Lee, Sang-Hoon
    Hwang, Jung-Wook
    Song, Min-Hwan
    Park, Hyung-Min
    APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [3] Sound Event Localization and Detection Based on Dual Attention
    Xu, Chundong
    Liu, Hao
    Min, Yuan
    Zhen, Yadi
    Computer Engineering and Applications, 2023, 59 (19) : 99 - 105
  • [4] A dual-branch joint learning network for underwater object detection
    Wang, Bowen
    Wang, Zhi
    Guo, Wenhui
    Wang, Yanjiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [5] An attention-based RGBD dual-branch gesture recognition network
    Chen, Bo
    Xie, Pengwei
    Hao, Nan
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 8022 - 8027
  • [6] Hyperspectral unmixing method based on dual-branch multiscale residual attention network
    Chen, Congping
    Xu, Zhiwei
    Lu, Peng
    Cao, Nuo
    OPTICAL ENGINEERING, 2023, 62 (09)
  • [7] Hyperspectral Image Classification Based on Dual-Branch Spectral Multiscale Attention Network
    Shi, Cuiping
    Liao, Diling
    Xiong, Yi
    Zhang, Tianyu
    Wang, Liguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 10450 - 10467
  • [8] Text Detection on Industrial Barrel Label with Convolutional Attention and Dual-Branch Feature Network
    School of Computer Science and Technology, Changchun University of Science and Technology, Changchun, Jilin
    130022, China
    IEEJ Trans. Electr. Electron. Eng.,
  • [9] Text Detection on Industrial Barrel Label with Convolutional Attention and Dual-Branch Feature Network
    Wang, Ling
    Zhang, Jing
    Wang, Peng
    Bai, Yane
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2025, 20 (04) : 526 - 536
  • [10] Improving Deep Subdomain Adaptation by Dual-Branch Network Embedding Attention Module for SAR Ship Classification
    Zhao, Shuangmei
    Lang, Haitao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8038 - 8048