Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation

被引:3
|
作者
Huang, Kuan Po [1 ]
Fu, Yu-Kuan [2 ]
Zhang, Yu [3 ]
Lee, Hung-yi [4 ]
机构
[1] Natl Taiwan Univ, Grad Inst Comp Sci & Informat Engn, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Phys, Taipei, Taiwan
[3] Google Brain, New York, NY USA
[4] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
来源
关键词
domain adversarial training; self-supervised models; speech processing tasks; continual training; SUPERB;
D O I
10.21437/Interspeech.2022-519
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech distortions are a long-standing problem that degrades the performance of supervisely trained speech processing models. It is high time that we enhance the robustness of speech processing models to obtain good performance when encountering speech distortions while not hurting the original performance on clean speech. In this work, we propose to improve the robustness of speech processing models by domain adversarial training (DAT). We conducted experiments based on the SUPERB framework on five different speech processing tasks. In case we do not always have knowledge of the distortion types for speech data, we analyzed the binary-domain and multi-domain settings, where the former treats all distorted speech as one domain, and the latter views different distortions as different domains. In contrast to supervised training methods, we obtained promising results in target domains where speech data is distorted with different distortions including new unseen distortions introduced during testing.
引用
收藏
页码:2193 / 2197
页数:5
相关论文
共 50 条
  • [1] Self-Supervised Domain Adaptation for Computer Vision Tasks
    Xu, Jiaolong
    Xiao, Liang
    Lopez, Antonio M.
    IEEE ACCESS, 2019, 7 : 156694 - 156706
  • [2] Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks
    Tanaka, Tomohiro
    Masumura, Ryo
    Sato, Hiroshi
    Ihori, Mana
    Matsuura, Kohei
    Ashihara, Takanori
    Moriya, Takafumi
    INTERSPEECH 2022, 2022, : 1066 - 1070
  • [3] Improving Speech Emotion Recognition Using Self-Supervised Learning with Domain-Specific Audiovisual Tasks
    Goncalves, Lucas
    Busso, Carlos
    INTERSPEECH 2022, 2022, : 1168 - 1172
  • [4] PADA: PRUNING ASSISTED DOMAIN ADAPTATION FOR SELF-SUPERVISED SPEECH REPRESENTATIONS
    Lodagala, Vasista Sai
    Ghosh, Sreyan
    Umesh, S.
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 136 - 143
  • [5] A framework for self-supervised federated domain adaptation
    Bin Wang
    Gang Li
    Chao Wu
    WeiShan Zhang
    Jiehan Zhou
    Ye Wei
    EURASIP Journal on Wireless Communications and Networking, 2022
  • [6] Self-Supervised Domain Adaptation with Consistency Training
    Xiao, Liang
    Xu, Jiaolong
    Zhao, Dawei
    Wang, Zhiyu
    Wang, Li
    Nie, Yiming
    Dai, Bin
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6874 - 6880
  • [7] SELF-SUPERVISED DOMAIN ADAPTATION IN CROWD COUNTING
    Nguyen, Pha
    Truong, Thanh-Dat
    Huang, Miaoqing
    Liang, Yi
    Le, Ngan
    Luu, Khoa
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2786 - 2790
  • [8] Enhancing unsupervised domain adaptation by exploiting the conceptual consistency of multiple self-supervised tasks
    Hui SUN
    Ming LI
    Science China(Information Sciences), 2023, 66 (04) : 126 - 139
  • [9] A framework for self-supervised federated domain adaptation
    Wang, Bin
    Li, Gang
    Wu, Chao
    Zhang, WeiShan
    Zhou, Jiehan
    Wei, Ye
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2022, 2022 (01)
  • [10] Enhancing unsupervised domain adaptation by exploiting the conceptual consistency of multiple self-supervised tasks
    Sun, Hui
    Li, Ming
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (04)