CONFIDENCE BASED ACOUSTIC EVENT DETECTION

被引:0
|
作者
Xia, Xianjun [1 ]
Togneri, Roberto [1 ]
Sohel, Ferdous [2 ]
Huang, David [1 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Nedlands, WA, Australia
[2] Murdoch Univ, Sch Engn & Informat Technol, Murdoch, WA, Australia
关键词
acoustic event detection; multi-label classification; confidence; multi-variable regression;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic event detection, the determination of the acoustic event type and the localisation of the event, has been widely applied in many real-world applications. Many works adopt the multi-label classification technique to perform the polyphonic acoustic event detection with a global threshold to detect the active acoustic events. However, the manually labeled boundaries are error-prone and cannot always be accurate, especially when the frame length is too short to be accurately labeled by human annotators. To deal with this, a confidence is assigned to each frame and acoustic event detection is performed using a multi-variable regression approach in this paper. Experimental results on the latest TUT sound event 2017 database of polyphonic events demonstrate the superior performance of the proposed approach compared to the multi-label classification based AED method.
引用
收藏
页码:306 / 310
页数:5
相关论文
共 50 条
  • [31] SPECTROGRAM PATCH BASED ACOUSTIC EVENT DETECTION AND CLASSIFICATION IN SPEECH OVERLAPPING CONDITIONS
    Espi, Miquel
    Fujimoto, Masakiyo
    Kubo, Yotaro
    Nakatani, Tomohiro
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 117 - 121
  • [32] Frame-wise dynamic threshold based polyphonic acoustic event detection
    Xia, Xianjun
    Togneri, Roberto
    Sohel, Ferdous
    Huang, David
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 474 - 478
  • [33] A Blind Segmentation Approach to Acoustic Event Detection Based on I-Vector
    Huang, Zhen
    Cheng, You-Chi
    Li, Kehuang
    Hautamaki, Ville
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2281 - 2285
  • [34] TUT acoustic event detection system 2007
    Heittola, Toni
    Klapuri, Anssi
    MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2008, 4625 : 364 - 370
  • [35] An Acoustic Surveillance System for Critical Event Detection
    Guneren, Hilal
    Unal, Erdem
    Bahadirlar, Yildirim
    Guler, Emin Cagatay
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1413 - 1416
  • [36] On Learning Disentangled Representation for Acoustic Event Detection
    Gao, Lijian
    Mao, Qirong
    Dong, Ming
    Jing, Yu
    Chinnam, Ratna
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2006 - 2014
  • [37] Feature analysis and selection for acoustic event detection
    Zhuang, Xiaodan
    Zhou, Xi
    Huang, Thomas S.
    Hasegawa-Johnson, Mark
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 17 - 20
  • [38] ACOUSTIC EVENT DETECTION IN REAL LIFE RECORDINGS
    Mesaros, Annamaria
    Heittola, Toni
    Eronen, Antti
    Virtanen, Tuomas
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1267 - 1271
  • [39] Event Recognition with Automatic Album Detection based on Sequential Grouping of Confidence Scores and Neural Attention
    Savchenko, Andrey
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [40] Real-world acoustic event detection
    Zhuang, Xiaodan
    Zhou, Xi
    Hasegawa-Johnson, Mark A.
    Huang, Thomas S.
    PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1543 - 1551