COUPLE LEARNING FOR SEMI-SUPERVISED SOUND EVENT DETECTION

被引:0
|
作者
Tao, Rui [1 ]
Yan, Long [1 ]
Ouchi, Kazushige [1 ]
Wang, Xiangdong [2 ]
机构
[1] Toshiba China R&D Ctr, Beijing, Peoples R China
[2] Chinese Acad Sci, Beijing Key Lab Mobile Comp & Pervas Device, Inst Comp Technol, Beijing, Peoples R China
来源
关键词
semi-supervised; pseudo-label; Mean Teacher; sound event detection;
D O I
10.21437/Interspeech.2022-103
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The recently proposed Mean Teacher method, which exploits large-scale unlabeled data in a self-ensembling manner, has achieved state-of-the-art results in several semi-supervised learning benchmarks. Spurred by current achievements, this paper proposes an effective Couple Learning method that combines a well-trained model and a Mean Teacher model. The suggested pseudo-labels generated model (PLG) increases strongly- and weakly-labeled data to improve the Mean Teacher method's performance. Moreover, the Mean Teacher's consistency cost reduces the noise impact in the pseudo-labels introduced by detection errors. The experimental results on Task 4 of the DCASE2020 challenge demonstrate the superiority of the proposed method, achieving about 44.25% F1-score on the validation set without post-processing, significantly outperforming the baseline system's 32.39%. furthermore, this paper also propose a simple and effective experiment called the Variable Order Input (VOI) experiment, which proves the significance of the Couple Learning method. Our developed Couple Learning code is available on GitHub.
引用
收藏
页码:2398 / 2402
页数:5
相关论文
共 50 条
  • [1] Regression-based Sound Event Detection with Semi-supervised Learning
    Liu, Chia-Chuan
    Chen, Chia-Ping
    Lu, Chung-Li
    Chan, Bo-cheng
    Cheng, Yu-Han
    Chuang, Hsiang-Feng
    Chen, Wei-Yu
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2336 - 2342
  • [2] SEMI-SUPERVISED LEARNING HELPS IN SOUND EVENT CLASSIFICATION
    Zhang, Zixing
    Schuller, Bjoern
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 333 - 336
  • [3] An Effective Perturbation based Semi-Supervised Learning Method for Sound Event Detection
    Zheng, Xu
    Song, Yan
    Yan, Jie
    Dai, Li-Rong
    McLoughlin, Ian
    Liu, Lin
    [J]. INTERSPEECH 2020, 2020, : 841 - 845
  • [4] GUIDED LEARNING FOR WEAKLY-LABELED SEMI-SUPERVISED SOUND EVENT DETECTION
    Lin, Liwei
    Wang, Xiangdong
    Liu, Hong
    Qian, Yueliang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 626 - 630
  • [5] Semi-Supervised NMF-CNN for Sound Event Detection
    Chan, Teck Kai
    Chin, Cheng Siong
    Li, Ye
    [J]. IEEE ACCESS, 2021, 9 : 130529 - 130542
  • [6] On Local Temporal Embedding for Semi-Supervised Sound Event Detection
    Gao, Lijian
    Mao, Qirong
    Dong, Ming
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1687 - 1698
  • [7] Confidence Learning for Semi-Supervised Acoustic Event Detection
    Liu, Yuzhuo
    Chen, Hangting
    Wang, Jian
    Wang, Pei
    Zhang, Pengyuan
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [8] SPARSE SELF-ATTENTION FOR SEMI-SUPERVISED SOUND EVENT DETECTION
    Guan, Yadong
    Xue, Jiabin
    Zheng, Guibin
    Han, Jiqing
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 821 - 825
  • [9] RCT: Random Consistency Training for Semi-Supervised Sound Event Detection
    Shao, Nian
    Loweimi, Erfan
    Li, Xiaofei
    [J]. INTERSPEECH 2022, 2022, : 1541 - 1545
  • [10] Comparative Assessment of Data Augmentation for Semi-Supervised Polyphonic Sound Event Detection
    Delphin-Poulat, Lionel
    Nicol, Rozenn
    Plapous, Cyril
    Peron, Katell
    [J]. PROCEEDINGS OF THE 2020 27TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2020, : 46 - 53