Frame-wise dynamic threshold based polyphonic acoustic event detection

被引:8
|
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Huang, David [1 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA, Australia
关键词
acoustic event detection; multi-label classification; dynamic threshold; NEURAL-NETWORKS;
D O I
10.21437/Interspeech.2017-746
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt multi-label classification techniques to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the global threshold has to be set manually and is highly dependent on the database being tested. To deal with this, we replaced the fixed threshold method with a frame-wise dynamic threshold approach in this paper. Two novel approaches, namely contour and regressor based dynamic threshold approaches are proposed in this work. Experimental results on the popular TUT Acoustic Scenes 2016 database of polyphonic events demonstrated the superior performance of the proposed approaches.
引用
收藏
页码:474 / 478
页数:5
相关论文
共 50 条
  • [41] Class-wise Centroid Distance Metric Learning for Acoustic Event Detection
    Lu, Xugang
    Shen, Peng
    Li, Sheng
    Tsao, Yu
    Kawai, Hisashi
    INTERSPEECH 2019, 2019, : 3614 - 3618
  • [42] Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    Zhao, Yuanjun
    Huang, Defeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (03) : 569 - 578
  • [43] Threshold-Based Widespread Event Detection
    Zhou, You
    Zhou, Yian
    Chen, Shigang
    2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 399 - 408
  • [44] Improved Frame-Wise Segmentation of Audio Signals for Smart Hearing Aid Using Particle Swarm Optimization-Based Clustering
    Mehrotra, Tushar
    Shukla, Neha
    Chaudhary, Tarunika
    Rajput, Gaurav Kumar
    Altuwairiqi, Majid
    Asif Shah, Mohd
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [45] Filterbank Learning for Deep Neural Network Based Polyphonic Sound Event Detection
    Cakir, Emre
    Ozan, Ezgi Can
    Virtanen, Tuomas
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3399 - 3406
  • [46] Fingerprint feature and dynamic threshold mechanism based on acoustic emission for bearing fault detection
    Wang, Cuiping
    Qi, Hongyuan
    Hou, Dongming
    Han, Defu
    Yang, Jiangtian
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 199
  • [47] Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet
    Zhang S.
    Zhang Y.
    Liao Y.
    Pang K.
    Wan Z.
    Zhou S.
    Mathematical Biosciences and Engineering, 2024, 21 (02) : 2004 - 2023
  • [48] Dynamic Threshold Based Keyframe Detection
    Yao, W.
    Rahardja, S.
    ICIEA 2010: PROCEEDINGS OF THE 5TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOL 4, 2010, : 401 - 405
  • [49] No need for frame-wise attenuation correction in dynamic Rubidium-82 PET for myocardial blood flow quantification (vol 26, pg 738, 2019)
    van Dijk, J. D.
    Jager, P. L.
    Ottervanger, J. P.
    Slump, C. H.
    van Dalen, J. A.
    JOURNAL OF NUCLEAR CARDIOLOGY, 2019, 26 (03) : 746 - 746
  • [50] Frame-Based Representation for Event Detection on Twitter
    Qin, Yanxia
    Zhang, Yue
    Zhang, Min
    Zheng, Dequan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (04): : 1180 - 1188