Relational recurrent neural networks for polyphonic sound event detection

被引:0
|
作者
Junbo Ma
Ruili Wang
Wanting Ji
Hao Zheng
En Zhu
Jianping Yin
机构
[1] Massey University,School of Computer
[2] National University of Defense Technology,College of information engineering
[3] Zhejiang Gongshang University,School of Computer
[4] Nanjing Xiaozhuang University,undefined
[5] National University of Defense Technology,undefined
[6] Dongguan University of Technology,undefined
来源
关键词
Internet of Things; smart environment; deep neural networks; recurrent neural networks; sound event detection;
D O I
暂无
中图分类号
学科分类号
摘要
A smart environment is one of the application scenarios of the Internet of Things (IoT). In order to provide a ubiquitous smart environment for humans, a variety of technologies are developed. In a smart environment system, sound event detection is one of the fundamental technologies, which can automatically sense sound changes in the environment and detect sound events that cause changes. In this paper, we propose the use of Relational Recurrent Neural Network (RRNN) for polyphonic sound event detection, called RRNN-SED, which utilized the strength of RRNN in long-term temporal context extraction and relational reasoning across a polyphonic sound signal. Different from previous sound event detection methods, which rely heavily on convolutional neural networks or recurrent neural networks, the proposed RRNN-SED method can solve long-lasting and overlapping problems in polyphonic sound event detection. Specifically, since the historical information memorized inside RRNNs is capable of interacting with each other across a polyphonic sound signal, the proposed RRNN-SED method is effective and efficient in extracting temporal context information and reasoning the unique relational characteristic of the target sound events. Experimental results on two public datasets show that the proposed method achieved better sound event detection results in terms of segment-based F-score and segment-based error rate.
引用
收藏
页码:29509 / 29527
页数:18
相关论文
共 50 条
  • [41] SoundDet: Polyphonic Sound Event Detection and Localization from Raw Waveform
    He, Yuhang
    Trigoni, Niki
    Markham, Andrew
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [42] EVALUATION OF POST-PROCESSING ALGORITHMS FOR POLYPHONIC SOUND EVENT DETECTION
    Cances, Leo
    Guyot, Patrice
    Pellegrini, Thomas
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 318 - 322
  • [43] SOUND EVENT DETECTION USING SPATIAL FEATURES AND CONVOLUTIONAL RECURRENT NEURAL NETWORK
    Adavanne, Sharath
    Pertila, Pasi
    Virtanen, Tuomas
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 771 - 775
  • [44] DRUM TRANSCRIPTION FROM POLYPHONIC MUSIC WITH RECURRENT NEURAL NETWORKS
    Vogl, Richard
    Dorfer, Matthias
    Knees, Peter
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 201 - 205
  • [45] Sound Event Detection Using Derivative Features in Deep Neural Networks
    Kwak, Jin-Yeol
    Chung, Yong-Joo
    APPLIED SCIENCES-BASEL, 2020, 10 (14):
  • [46] SOUND EVENT DETECTION BY CONSISTENCY TRAINING AND PSEUDO-LABELING WITH FEATURE-PYRAMID CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Koh, Chih-Yuan
    Chen, You-Siang
    Liu, Yi-Wen
    Bai, Mingsian R.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 376 - 380
  • [47] Sound event localization and detection using element-wise attention gate and asymmetric convolutional recurrent neural networks
    Yan, Lean
    Guo, Min
    Li, Zhiqiang
    AI COMMUNICATIONS, 2023, 36 (02) : 147 - 157
  • [48] Chinese Event Detection Combining BERT Model with Recurrent Neural Networks
    Zhang Wei
    Wang Yongli
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1625 - 1629
  • [49] RED: Deep Recurrent Neural Networks for Sleep EEG Event Detection
    Tapia, Nicolas, I
    Estevez, Pablo A.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [50] Event Nugget Detection with Forward-Backward Recurrent Neural Networks
    Ghaeini, Reza
    Fern, Xiaoli Z.
    Huang, Liang
    Tadepalli, Prasad
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 369 - 373