TEACHER-STUDENT TRAINING FOR ACOUSTIC EVENT DETECTION USING AUDIOSET

被引:0
|
作者
Shi, Ruibo [1 ]
Ng, Raymond W. M. [1 ]
Swietojanski, Pawel [2 ]
机构
[1] Emotech Labs, London, England
[2] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
关键词
Acoustic Event Detection; Weakly-supervised training; Teacher-Student Training; Attention;
D O I
10.1109/icassp.2019.8683048
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper studies Acoustic Event Detection (AED) systems and the problem of their rapid and easy customisation to arbitrary deployment scenarios. Due to inherent challenges related to annotation processes of AED data (time-consuming and error-prone due to often unclear time-stamping), most of the available large-scale datasets for AED are released with weak clip-level labels, which also affects how one should design weakly-supervised training procedures. In this paper, we investigate a teacher-student training approach of learning low-complexity student models, using large teachers. We first show that state-of-the-art performance can be achieved by a Convolutional Neural Network (CNN) model with appropriate attention mechanism. Then we describe a framework that enables learning arbitrary small-footprint, generic or domain-expert, AED systems from generic teachers. We carry experiments on Audioset -a large-scale weakly labelled dataset of acoustic events.
引用
收藏
页码:875 / 879
页数:5
相关论文
共 50 条
  • [1] Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training
    Dinkel, Heinrich
    Wang, Shuai
    Xu, Xuenan
    Wu, Mengyue
    Yu, Kai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1542 - 1555
  • [2] SEQUENCE TEACHER-STUDENT TRAINING OF ACOUSTIC MODELS FOR AUTOMATIC FREE SPEAKING LANGUAGE ASSESSMENT
    Wang, Y.
    Wong, J. H. M.
    Gales, M. J. F.
    Knill, K. M.
    Ragni, A.
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 994 - 1000
  • [3] Semisupervised Cross Domain Teacher-Student Mutual Training for Damaged Building Detection
    Pan, Jie
    Yin, Pengyu
    Sun, Xian
    Tan, Junxiang
    Li, Wei
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 8191 - 8203
  • [4] Acoustic scene classification using teacher-student learning with soft-labels
    Heo, Hee-Soo
    Jung, Jee-weon
    Shim, Hye-jin
    Yu, Ha-Jin
    [J]. INTERSPEECH 2019, 2019, : 614 - 618
  • [5] The Influence of Student Personality and Teacher-student Interactions on Teacher-student Relationship Quality
    Tan, Tengteng
    Wang, Naiyi
    [J]. PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON APPLIED SOCIAL SCIENCE RESEARCH (ICASSR-2013), 2013, 38 : 174 - 177
  • [6] PROGRESSIVE TEACHER-STUDENT TRAINING FRAMEWORK FOR MUSIC TAGGING
    Lu, Rui
    Zheng, Baigong
    Hai, Jiarui
    Tao, Fei
    Duan, Zhiyao
    Liu, Ji
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3129 - 3133
  • [7] MULTI-TASK ENSEMBLES WITH TEACHER-STUDENT TRAINING
    Wong, Jeremy H. M.
    Gales, Mark J. F.
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 84 - 90
  • [8] Teacher-Student Mutual Training for Semi-Supervised Object Detection Based on PPYOLOE
    Zhang, Guoshan
    Wei, Jinman
    [J]. Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2024, 57 (04): : 415 - 423
  • [9] Teacher-Student Mutual Training for Semi-Supervised Object Detection Based on PPYOLOE
    Zhang G.
    Wei J.
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2024, 57 (04): : 415 - 423
  • [10] Teacher-Student BLSTM Mask Model for Robust Acoustic Beamforming
    Liu, Zhaoyi
    Zou, Yuexian
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 638 - 643