RCT: Random Consistency Training for Semi-Supervised Sound Event Detection

被引:1
|
作者
Shao, Nian [1 ,2 ]
Loweimi, Erfan [3 ]
Li, Xiaofei [1 ,2 ]
机构
[1] Westlake Univ, Hangzhou, Peoples R China
[2] Westlake Inst Adv Study, Hangzhou, Peoples R China
[3] Univ Edinburgh, CSTR, Edinburgh, Midlothian, Scotland
来源
关键词
semi-supervised learning; sound event detection; data augmentation; consistency regularization; hard mixup;
D O I
10.21437/Interspeech.2022-10037
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sound event detection (SED), as a core module of acoustic environmental analysis, suffers from the problem of data deficiency. The integration of semi-supervised learning (SSL) largely mitigates such problem. This paper researches on several core modules of SSL, and introduces a random consistency training (RCT) strategy. First, a hard mixup data augmentation is proposed to account for the additive property of sounds. Second, a random augmentation scheme is applied to stochastically combine different types of data augmentation methods with high flexibility. Third, a self-consistency loss is proposed to be fused with the teacher-student model, aiming at stabilizing the training. Performance-wise, the proposed modules outperform their respective competitors, and as a whole the proposed SED strategies achieve 44.0% and 67.1% in terms of the PSDS1 and PSDS2 metrics proposed by the DCASE challenge, which notably outperforms other widely-used alternatives.
引用
收藏
页码:1541 / 1545
页数:5
相关论文
共 50 条
  • [21] Semi-supervised New Event Type Induction and Event Detection
    Huang, Lifu
    Ji, Heng
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 718 - 724
  • [22] Efficient Semi-supervised Consistency Training for Natural Language Understanding
    Leung, George
    Tan, Joshua
    [J]. 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2022, 2022, : 86 - 93
  • [23] Semi-supervised Left Atrium Segmentation with Mutual Consistency Training
    Wu, Yicheng
    Xu, Minfeng
    Ge, Zongyuan
    Cai, Jianfei
    Zhang, Lei
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 297 - 306
  • [24] Confidence Learning for Semi-Supervised Acoustic Event Detection
    Liu, Yuzhuo
    Chen, Hangting
    Wang, Jian
    Wang, Pei
    Zhang, Pengyuan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [25] Semi-supervised adapted HMMs for unusual event detection
    Zhang, D
    Gatica-Perez, D
    Bengio, S
    McCowan, I
    [J]. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol 1, Proceedings, 2005, : 611 - 618
  • [26] Semi-supervised semantic segmentation using cross-consistency training for pavement crack detection
    Liu, Xiaoyu
    Wu, Kuanghuai
    Cai, Xu
    Huang, Wenke
    [J]. ROAD MATERIALS AND PAVEMENT DESIGN, 2024, 25 (06) : 1368 - 1380
  • [27] Weakly Labeled Semi-Supervised Sound Event Detection with Multi-Scale Residual Attention
    Tang, Maolin
    Zhao, Qijun
    Liu, Zhengxi
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [28] Consistency-based Semi-supervised Learning for Object Detection
    Jeong, Jisoo
    Lee, Seungeui
    Kim, Jeesoo
    Kwak, Nojun
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [29] ConsE: Consistency Exploitation for Semi-Supervised Anomaly Detection in Graphs
    Chang, Wenjing
    Yu, Jianjun
    Zhou, Xiaojun
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [30] Active Semi-supervised Grasp Pose Detection with Geometric Consistency
    Bai, Fan
    Zhu, Delong
    Cheng, Hu
    Xu, Peng
    Meng, Max Q. -H.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE-ROBIO 2021), 2021, : 1402 - 1408