Relational recurrent neural networks for polyphonic sound event detection

被引:0
|
作者
Junbo Ma
Ruili Wang
Wanting Ji
Hao Zheng
En Zhu
Jianping Yin
机构
[1] Massey University,School of Computer
[2] National University of Defense Technology,College of information engineering
[3] Zhejiang Gongshang University,School of Computer
[4] Nanjing Xiaozhuang University,undefined
[5] National University of Defense Technology,undefined
[6] Dongguan University of Technology,undefined
来源
关键词
Internet of Things; smart environment; deep neural networks; recurrent neural networks; sound event detection;
D O I
暂无
中图分类号
学科分类号
摘要
A smart environment is one of the application scenarios of the Internet of Things (IoT). In order to provide a ubiquitous smart environment for humans, a variety of technologies are developed. In a smart environment system, sound event detection is one of the fundamental technologies, which can automatically sense sound changes in the environment and detect sound events that cause changes. In this paper, we propose the use of Relational Recurrent Neural Network (RRNN) for polyphonic sound event detection, called RRNN-SED, which utilized the strength of RRNN in long-term temporal context extraction and relational reasoning across a polyphonic sound signal. Different from previous sound event detection methods, which rely heavily on convolutional neural networks or recurrent neural networks, the proposed RRNN-SED method can solve long-lasting and overlapping problems in polyphonic sound event detection. Specifically, since the historical information memorized inside RRNNs is capable of interacting with each other across a polyphonic sound signal, the proposed RRNN-SED method is effective and efficient in extracting temporal context information and reasoning the unique relational characteristic of the target sound events. Experimental results on two public datasets show that the proposed method achieved better sound event detection results in terms of segment-based F-score and segment-based error rate.
引用
收藏
页码:29509 / 29527
页数:18
相关论文
共 50 条
  • [31] POLYPHONIC SOUND EVENT AND SOUND ACTIVITY DETECTION: A MULTI-TASK APPROACH
    Pankajakshan, Arjun
    Bear, Helen L.
    Benetos, Emmanouil
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 323 - 327
  • [32] A SEQUENCE MATCHING NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION
    Thi Ngoc Tho Nguyen
    Jones, Douglas L.
    Gan, Woon-Seng
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 71 - 75
  • [33] Complex Activity Recognition Using Polyphonic Sound Event Detection
    Kang, Jaewoong
    Kim, Jooyeong
    Kim, Kunyoung
    Sohn, Mye
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 675 - 684
  • [34] AN IMPROVED EVENT-INDEPENDENT NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION
    Gao, Yin
    Iqbal, Turab
    Kong, Qiuqiang
    An, Fengyan
    Wang, Wenwu
    Plumbley, Mark D.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 885 - 889
  • [35] POLYPHONIC PIANO NOTE TRANSCRIPTION WITH RECURRENT NEURAL NETWORKS
    Boeck, Sebastian
    Schedl, Markus
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 121 - 124
  • [36] MULTI-SCALE RECURRENT NEURAL NETWORK FOR SOUND EVENT DETECTION
    Lu, Rui
    Duan, Zhiyao
    Zhang, Changshui
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 131 - 135
  • [37] Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset
    Viveros-Munoz, Rhoddy
    Huijse, Pablo
    Vargas, Victor
    Espejo, Diego
    Poblete, Victor
    Arenas, Jorge P.
    Vernier, Matthieu
    Vergara, Diego
    Suarez, Enrique
    DATA IN BRIEF, 2023, 50
  • [38] Weakly Labeled Semi-Supervised Sound Event Detection Based on Convolutional Independent Recurrent Neural Networks
    Yu, Changgeng
    Yang, Dewang
    Liu, Xuanyu
    OPTICAL MEMORY AND NEURAL NETWORKS, 2022, 31 (03) : 266 - 276
  • [39] Weakly Labeled Semi-Supervised Sound Event Detection Based on Convolutional Independent Recurrent Neural Networks
    Dewang Changgeng Yu
    Xuanyu Yang
    Optical Memory and Neural Networks, 2022, 31 : 266 - 276
  • [40] Fully Convolutional Dense Net based polyphonic sound event detection
    Zhe, He
    Ying, Li
    2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, BIG DATA AND BLOCKCHAIN (ICCBB 2018), 2018, : 191 - 196