Frame-wise dynamic threshold based polyphonic acoustic event detection

被引:8
|
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Huang, David [1 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA, Australia
关键词
acoustic event detection; multi-label classification; dynamic threshold; NEURAL-NETWORKS;
D O I
10.21437/Interspeech.2017-746
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt multi-label classification techniques to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the global threshold has to be set manually and is highly dependent on the database being tested. To deal with this, we replaced the fixed threshold method with a frame-wise dynamic threshold approach in this paper. Two novel approaches, namely contour and regressor based dynamic threshold approaches are proposed in this work. Experimental results on the popular TUT Acoustic Scenes 2016 database of polyphonic events demonstrated the superior performance of the proposed approaches.
引用
收藏
页码:474 / 478
页数:5
相关论文
共 50 条
  • [1] CONTRASTIVE LOSS BASED FRAME-WISE FEATURE DISENTANGLEMENT FOR POLYPHONIC SOUND EVENT DETECTION
    Guan, Yadong
    Han, Jiqing
    Song, Hongwei
    Song, Wenjie
    Zheng, Guibin
    Zheng, Tieran
    He, Yongjun
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1021 - 1025
  • [2] CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier
    Tang, Tiantian
    Zhou, Xinyuan
    Long, Yanhua
    Li, Yijie
    Liang, Jiaen
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 939 - 944
  • [3] Song wave retrieval based on frame-wise phoneme recognition
    Yaguchi, Y
    Oka, R
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 503 - 509
  • [4] Frame-Wise Action Recognition Training Framework for Skeleton-Based Anomaly Behavior Detection
    Tani, Hiroaki
    Shibata, Tomoyuki
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 312 - 323
  • [5] Compression of CTC-Trained Acoustic Models by Dynamic Frame-Wise Distillation Or Segment-Wise N-Best Hypotheses Imitation
    Ding, Haisong
    Chen, Kai
    Huo, Qiang
    INTERSPEECH 2019, 2019, : 3218 - 3222
  • [6] Blind source separation using frame-wise DOA estimates based on duet
    Iwasaki, Nobuo
    Nishimura, Shunya
    Inoue, Katsuhiro
    Gotanda, Hiromu
    ICIC Express Letters, Part B: Applications, 2015, 6 (03): : 877 - 886
  • [7] The filter diagonalisation method for music signal analysis: frame-wise vibrato detection and estimation
    Yang, Luwei
    Rajab, Khalid Z.
    Chew, Elaine
    JOURNAL OF MATHEMATICS AND MUSIC, 2017, 11 (01) : 42 - 60
  • [8] Frame-Wise CNN-Based Filtering for Intra-Frame Quality Enhancement of HEVC Videos
    Huang, Hongyue
    Schiopu, Ionut
    Munteanu, Adrian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2100 - 2113
  • [9] UNSUPERVISED TRAINING OF DETECTION THRESHOLD FOR POLYPHONIC MUSICAL NOTE TRACKING BASED ON EVENT PERIODICITY
    Tavares, Tiago Fernandes
    Arnal Barbedo, Jayme Garcia
    Attux, Romis
    Lopes, Amauri
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 21 - 25
  • [10] A TRACK-WISE ENSEMBLE EVENT INDEPENDENT NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION
    Hu, Jinbo
    Cao, Yin
    Wu, Ming
    Kong, Qiuqiang
    Yang, Feiran
    Plumbley, Mark D.
    Yang, Jun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9196 - 9200