Augmented Strategy For Polyphonic Sound Event Detection

被引:0
|
作者
Wang, Bolun [1 ]
Fu, Zhong-Hua [1 ,2 ]
Wu, Hao [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[2] Xian IFLYTEK Hyper Brain Informat Technol Co Ltd, Xian, Peoples R China
关键词
Sound event detection; Data augmentation; Model fusion; ACOUSTIC SCENES; CLASSIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Sound event detection is an important issue for many applications like audio content retrieval, intelligent monitoring, and scene-based interaction. The traditional studies on this topic are mainly focusing on identification of single sound event class. However, in real applications, several sound events usually happen concurrently and with different durations. That leads to a new detection task on polyphonic sound event classification along with event time boundaries. In this paper, we propose an augmented strategy for this task, which faces challenges of a large amount of unbalanced and weakly labelled training data. Specifically, the strategy includes data augmentation to enrich training set to eliminate data unbalance, a new loss function that combines cross entropy and F-score, and model fusion to integrate the powers of different classifiers. The performance of the strategy is validated on DCASE2019 dataset, and both the event and segment detections are significantly improved over the baseline system.
引用
收藏
页码:1496 / 1500
页数:5
相关论文
共 50 条
  • [11] A SEQUENCE MATCHING NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION
    Thi Ngoc Tho Nguyen
    Jones, Douglas L.
    Gan, Woon-Seng
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 71 - 75
  • [12] Complex Activity Recognition Using Polyphonic Sound Event Detection
    Kang, Jaewoong
    Kim, Jooyeong
    Kim, Kunyoung
    Sohn, Mye
    INNOVATIVE MOBILE AND INTERNET SERVICES IN UBIQUITOUS COMPUTING, IMIS-2018, 2019, 773 : 675 - 684
  • [13] Relational recurrent neural networks for polyphonic sound event detection
    Ma, Junbo
    Wang, Ruili
    Ji, Wanting
    Zheng, Hao
    Zhu, En
    Yin, Jianping
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 29509 - 29527
  • [14] AN IMPROVED EVENT-INDEPENDENT NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION
    Gao, Yin
    Iqbal, Turab
    Kong, Qiuqiang
    An, Fengyan
    Wang, Wenwu
    Plumbley, Mark D.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 885 - 889
  • [15] Polyphonic Sound Event Detection by Using Capsule Neural Networks
    Vesperini, Fabio
    Gabrielli, Leonardo
    Principi, Emanuele
    Squartini, Stefano
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (02) : 310 - 322
  • [16] Relational recurrent neural networks for polyphonic sound event detection
    Junbo Ma
    Ruili Wang
    Wanting Ji
    Hao Zheng
    En Zhu
    Jianping Yin
    Multimedia Tools and Applications, 2019, 78 : 29509 - 29527
  • [17] Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection
    Cakir, Emre
    Parascandolo, Giambattista
    Heittola, Toni
    Huttunen, Heikki
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1291 - 1303
  • [18] Dataset for polyphonic sound event detection tasks in urban soundscapes: The synthetic polyphonic ambient sound source (SPASS) dataset
    Viveros-Munoz, Rhoddy
    Huijse, Pablo
    Vargas, Victor
    Espejo, Diego
    Poblete, Victor
    Arenas, Jorge P.
    Vernier, Matthieu
    Vergara, Diego
    Suarez, Enrique
    DATA IN BRIEF, 2023, 50
  • [19] Fully Convolutional Dense Net based polyphonic sound event detection
    Zhe, He
    Ying, Li
    2018 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, BIG DATA AND BLOCKCHAIN (ICCBB 2018), 2018, : 191 - 196
  • [20] SoundDet: Polyphonic Sound Event Detection and Localization from Raw Waveform
    He, Yuhang
    Trigoni, Niki
    Markham, Andrew
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139