Sound Event Localization and Detection Using Convolutional Recurrent Neural Networks and Gated Linear Units

被引:0
|
作者
Komatsu, Tatsuya [1 ]
Togami, Masahito [1 ]
Takahashi, Tsubasa [1 ]
机构
[1] Line Corp, Res Labs, Tokyo, Japan
关键词
Sound Event Localization and Detection; Recurrent Convolutional Neural Network; Gated Linear Unit; SURVEILLANCE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a sound event localization and detection (SELD) method using a convolutional recurrent neural network (CRNN) with gated linear units (GLUs). The proposed method introduces to employ GLUs with convolutional neural network (CNN) layers of the CRNN to extract adequate spectral features from amplitude and phase spectra. When the CNNs extract features of high-dimensional dependencies of frequency bins, the GLUs weight the extracted features based on the importance of the bins, like attention mechanism. Extracted features from bins where sounds are absent, which is not informative and degrade the SELD performance, are weighted to 0 and ignored by GLUs. Only the features extracted from informative bins are used for the CNN output for better SELD performance. Obtained CNN outputs are fed to consecutive bidirectional gated recurrent units (GRUs), which capture temporal information. Finally, the GRU output are shared by two task-specific layers, which are sound event detection (SED) layers and direction of arrival (DoA) estimation layers, to obtain SELD results. Evaluation results using the TAU Spatial Sound Events 2019 - Ambisonic dataset show the effectiveness of GLUs in the proposed method, and it improves SELD performance up to 0:10 in F1-score, 0.15 in error rate, 16.4 degrees in DoA estimation error comparing to a CRNN baseline method.
引用
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [1] Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
    Adavanne, Sharath
    Politis, Archontis
    Nikunen, Joonas
    Virtanen, Tuomas
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 34 - 48
  • [2] Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection
    Cakir, Emre
    Parascandolo, Giambattista
    Heittola, Toni
    Huttunen, Heikki
    Virtanen, Tuomas
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1291 - 1303
  • [3] SOUND EVENT DETECTION VIA DILATED CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Li, Yanxiong
    Liu, Mingle
    Drossos, Konstantinos
    Virtanen, Tuomas
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 286 - 290
  • [4] Sound event localization and detection using element-wise attention gate and asymmetric convolutional recurrent neural networks
    Yan, Lean
    Guo, Min
    Li, Zhiqiang
    [J]. AI COMMUNICATIONS, 2023, 36 (02) : 147 - 157
  • [5] Review Helpfulness Prediction Using Convolutional Neural Networks and Gated Recurrent Units
    Basiri, Mohammad Ehsan
    Habibi, Shirin
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 191 - 196
  • [6] POLYPHONIC SOUND EVENT DETECTION USING TRANSPOSED CONVOLUTIONAL RECURRENT NEURAL NETWORK
    Chatterjee, Chandra Churh
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 661 - 665
  • [7] SOUND EVENT DETECTION USING SPATIAL FEATURES AND CONVOLUTIONAL RECURRENT NEURAL NETWORK
    Adavanne, Sharath
    Pertila, Pasi
    Virtanen, Tuomas
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 771 - 775
  • [8] Dysarthria Speech Detection Using Convolutional Neural Networks with Gated Recurrent Unit
    Shih, Dong-Her
    Liao, Ching-Hsien
    Wu, Ting-Wei
    Xu, Xiao-Yin
    Shih, Ming-Hung
    [J]. HEALTHCARE, 2022, 10 (10)
  • [9] Technical Sound Event Classification Applying Recurrent and Convolutional Neural Networks
    Rieder, Constantin
    Germann, Markus
    Mezger, Samuel
    Scherer, Klaus
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON DEEP LEARNING THEORY AND APPLICATIONS (DELTA), 2020, : 84 - 88
  • [10] Fault Detection and Localization in Distributed Systems Using Recurrent Convolutional Neural Networks
    Qi, Guangyang
    Yao, Lina
    Uzunov, Anton V.
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 33 - 48