Relational recurrent neural networks for polyphonic sound event detection

被引:0
|
作者
Junbo Ma
Ruili Wang
Wanting Ji
Hao Zheng
En Zhu
Jianping Yin
机构
[1] Massey University,School of Computer
[2] National University of Defense Technology,College of information engineering
[3] Zhejiang Gongshang University,School of Computer
[4] Nanjing Xiaozhuang University,undefined
[5] National University of Defense Technology,undefined
[6] Dongguan University of Technology,undefined
来源
关键词
Internet of Things; smart environment; deep neural networks; recurrent neural networks; sound event detection;
D O I
暂无
中图分类号
学科分类号
摘要
A smart environment is one of the application scenarios of the Internet of Things (IoT). In order to provide a ubiquitous smart environment for humans, a variety of technologies are developed. In a smart environment system, sound event detection is one of the fundamental technologies, which can automatically sense sound changes in the environment and detect sound events that cause changes. In this paper, we propose the use of Relational Recurrent Neural Network (RRNN) for polyphonic sound event detection, called RRNN-SED, which utilized the strength of RRNN in long-term temporal context extraction and relational reasoning across a polyphonic sound signal. Different from previous sound event detection methods, which rely heavily on convolutional neural networks or recurrent neural networks, the proposed RRNN-SED method can solve long-lasting and overlapping problems in polyphonic sound event detection. Specifically, since the historical information memorized inside RRNNs is capable of interacting with each other across a polyphonic sound signal, the proposed RRNN-SED method is effective and efficient in extracting temporal context information and reasoning the unique relational characteristic of the target sound events. Experimental results on two public datasets show that the proposed method achieved better sound event detection results in terms of segment-based F-score and segment-based error rate.
引用
下载
收藏
页码:29509 / 29527
页数:18
相关论文
共 50 条
  • [1] Relational recurrent neural networks for polyphonic sound event detection
    Ma, Junbo
    Wang, Ruili
    Ji, Wanting
    Zheng, Hao
    Zhu, En
    Yin, Jianping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 29509 - 29527
  • [2] Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection
    Cakir, Emre
    Parascandolo, Giambattista
    Heittola, Toni
    Huttunen, Heikki
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1291 - 1303
  • [3] RECURRENT NEURAL NETWORKS FOR POLYPHONIC SOUND EVENT DETECTION IN REAL LIFE RECORDINGS
    Parascandolo, Giambattista
    Huttunen, Heikki
    Virtanen, Tuomas
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6440 - 6444
  • [4] Polyphonic Sound Event Detection by Using Capsule Neural Networks
    Vesperini, Fabio
    Gabrielli, Leonardo
    Principi, Emanuele
    Squartini, Stefano
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (02) : 310 - 322
  • [5] POLYPHONIC SOUND EVENT DETECTION USING TRANSPOSED CONVOLUTIONAL RECURRENT NEURAL NETWORK
    Chatterjee, Chandra Churh
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 661 - 665
  • [6] Polyphonic Sound Event Detection Using Multi Label Deep Neural Networks
    Cakir, Emre
    Heittola, Toni
    Huttunen, Heikki
    Virtanen, Tuomas
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [7] Sound Event Detection via Conformer Recurrent Neural Networks
    Gao, Fangqing
    Li, Xin
    Wei, Xiukun
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4749 - 4754
  • [8] Convolutional Neural Networks with Multi-task Loss for Polyphonic Sound Event Detection
    Liu, Huang
    Wang, Xiu
    Guan, Fa-Qian
    Hu, Jin-Sen
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [9] SOUND EVENT DETECTION VIA DILATED CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Li, Yanxiong
    Liu, Mingle
    Drossos, Konstantinos
    Virtanen, Tuomas
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 286 - 290
  • [10] Metrics for Polyphonic Sound Event Detection
    Mesaros, Annamaria
    Heittola, Toni
    Virtanen, Tuomas
    APPLIED SCIENCES-BASEL, 2016, 6 (06):