Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation

被引:3
|
作者
Huang, Kuan Po [1 ]
Fu, Yu-Kuan [2 ]
Zhang, Yu [3 ]
Lee, Hung-yi [4 ]
机构
[1] Natl Taiwan Univ, Grad Inst Comp Sci & Informat Engn, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Phys, Taipei, Taiwan
[3] Google Brain, New York, NY USA
[4] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
来源
关键词
domain adversarial training; self-supervised models; speech processing tasks; continual training; SUPERB;
D O I
10.21437/Interspeech.2022-519
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech distortions are a long-standing problem that degrades the performance of supervisely trained speech processing models. It is high time that we enhance the robustness of speech processing models to obtain good performance when encountering speech distortions while not hurting the original performance on clean speech. In this work, we propose to improve the robustness of speech processing models by domain adversarial training (DAT). We conducted experiments based on the SUPERB framework on five different speech processing tasks. In case we do not always have knowledge of the distortion types for speech data, we analyzed the binary-domain and multi-domain settings, where the former treats all distorted speech as one domain, and the latter views different distortions as different domains. In contrast to supervised training methods, we obtained promising results in target domains where speech data is distorted with different distortions including new unseen distortions introduced during testing.
引用
收藏
页码:2193 / 2197
页数:5
相关论文
共 50 条
  • [31] Distribution regularized self-supervised learning for domain adaptation of semantic segmentation
    Iqbal, Javed
    Rawal, Hamza
    Ha, Rehan
    Chi, Yu-Tseh
    Ali, Mohsen
    IMAGE AND VISION COMPUTING, 2022, 124
  • [32] FogAdapt: Self-supervised domain adaptation for semantic segmentation of foggy images
    Iqbal, Javed
    Hafiz, Rehan
    Ali, Mohsen
    NEUROCOMPUTING, 2022, 501 : 844 - 856
  • [33] Self-supervised Domain Adaptation for Forgery Localization of JPEG Compressed Images
    Rao, Yuan
    Ni, Jiangqun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15014 - 15023
  • [34] Self-Supervised Domain Adaptation for Visual Navigation with Global Map Consistency
    Lee, Eun Sun
    Kim, Junho
    Kim, Young Min
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1868 - 1877
  • [35] SELF-SUPERVISED LEARNING BASED DOMAIN ADAPTATION FOR ROBUST SPEAKER VERIFICATION
    Chen, Zhengyang
    Wang, Shuai
    Qian, Yanmin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5834 - 5838
  • [36] Self-Supervised Adversarial Learning for Domain Adaptation of Pavement Distress Classification
    Wu, Yanwen
    Hong, Mingjian
    Li, Ao
    Huang, Sheng
    Liu, Huijun
    Ge, Yongxin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1966 - 1977
  • [37] Unsupervised New-set Domain Adaptation with Self-supervised Knowledge
    Wang Y.-Y.
    Sun G.-W.
    Zhao G.-X.
    Xue H.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (04): : 1170 - 1182
  • [38] Robust self-supervised learning for source-free domain adaptation
    Liang Tian
    Lihua Zhou
    Hao Zhang
    Zhenbin Wang
    Mao Ye
    Signal, Image and Video Processing, 2023, 17 : 2405 - 2413
  • [39] Self-supervised domain adaptation on point clouds via homomorphic augmentation ☆
    Yang, Jiming
    Da, Feipeng
    Hong, Ru
    COMPUTERS & GRAPHICS-UK, 2024, 121
  • [40] Robust self-supervised learning for source-free domain adaptation
    Tian, Liang
    Zhou, Lihua
    Zhang, Hao
    Wang, Zhenbin
    Ye, Mao
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) : 2405 - 2413