Sound event localization and detection based on deep learning

被引:0
|
作者
Zhao, Dada [1 ,2 ]
Ding, Kai [2 ]
Qi, Xiaogang [1 ]
Chen, Yu [2 ]
Feng, Hailin [1 ]
机构
[1] Xidian Univ, Sch Math & Stat, Xian 710071, Peoples R China
[2] Sci & Technol Near Surface Detect Lab, Wuxi 214035, Peoples R China
基金
中国国家自然科学基金;
关键词
sound event localization and detection (SELD); deep learning; convolutional recursive neural network (CRNN); chan- nel attention mechanism; DATA AUGMENTATION; NEURAL-NETWORKS; SPECTRUM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Acoustic source localization (ASL) and sound event detection (SED) are two widely pursued independent research fields. In recent years, in order to achieve a more complete spatial and temporal representation of sound field, sound event localization and detection (SELD) has become a very active research topic. This paper presents a deep learning -based multioverlapping sound event localization and detection algorithm in three-dimensional space. Log -Mel spectrum and generalized cross -correlation spectrum are joined together in channel dimension as input features. These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively. The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features. Finally, a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm. Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method.
引用
收藏
页码:294 / 301
页数:8
相关论文
共 50 条
  • [1] Sound event localization and detection based on deep learning
    ZHAO Dada
    DING Kai
    QI Xiaogang
    CHEN Yu
    FENG Hailin
    Journal of Systems Engineering and Electronics, 2024, 35 (02) : 294 - 301
  • [2] Sound Event Localization and Detection Based on Deep Learning
    Zhao, Dada
    Ding, Kai
    Qi, Xiaogang
    Chen, Yu
    Feng, Hailin
    Journal of Systems Engineering and Electronics, 2024, 35 (02) : 294 - 301
  • [3] A survey of Deep Learning for Polyphonic Sound event detection
    Dang, An
    Vu, Toan H.
    Wang, Jia-Ching
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT), 2017, : 75 - 78
  • [4] Filterbank Learning for Deep Neural Network Based Polyphonic Sound Event Detection
    Cakir, Emre
    Ozan, Ezgi Can
    Virtanen, Tuomas
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3399 - 3406
  • [5] Sound Event Localization and Detection Based on Dual Attention
    Xu, Chundong
    Liu, Hao
    Min, Yuan
    Zhen, Yadi
    Computer Engineering and Applications, 2023, 59 (19) : 99 - 105
  • [6] Sound Event Localization and Detection Based on Multiple DOA Beamforming and Multi-task Learning
    Xue, Wei
    Tong, Ying
    Zhang, Chao
    Ding, Guohong
    He, Xiaodong
    Zhou, Bowen
    INTERSPEECH 2020, 2020, : 5091 - 5095
  • [7] Rare Sound Event Detection Using Deep Learning and Data Augmentation
    Chen, Yanping
    Jin, Hongxia
    INTERSPEECH 2019, 2019, : 619 - 623
  • [8] Abnormal Event Detection Based on Deep Learning
    Wen J.
    Wang H.-J.
    Deng J.
    Liu P.-F.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (02): : 308 - 313
  • [9] Robot Detection and Localization Based on Deep Learning
    Luo, Sha
    Lu, Huimin
    Xiao, Junhao
    Yu, Qinghua
    Zheng, Zhiqiang
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7091 - 7095
  • [10] SSLIDE: SOUND SOURCE LOCALIZATION FOR INDOORS BASED ON DEEP LEARNING
    Wu, Yifan
    Ayyalasomayajula, Roshan
    Bianco, Michael J.
    Bharadia, Dinesh
    Gerstoft, Peter
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4680 - 4684