Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation

被引:3
|
作者
Huang, Kuan Po [1 ]
Fu, Yu-Kuan [2 ]
Zhang, Yu [3 ]
Lee, Hung-yi [4 ]
机构
[1] Natl Taiwan Univ, Grad Inst Comp Sci & Informat Engn, Taipei, Taiwan
[2] Natl Taiwan Univ, Dept Phys, Taipei, Taiwan
[3] Google Brain, New York, NY USA
[4] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
来源
关键词
domain adversarial training; self-supervised models; speech processing tasks; continual training; SUPERB;
D O I
10.21437/Interspeech.2022-519
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech distortions are a long-standing problem that degrades the performance of supervisely trained speech processing models. It is high time that we enhance the robustness of speech processing models to obtain good performance when encountering speech distortions while not hurting the original performance on clean speech. In this work, we propose to improve the robustness of speech processing models by domain adversarial training (DAT). We conducted experiments based on the SUPERB framework on five different speech processing tasks. In case we do not always have knowledge of the distortion types for speech data, we analyzed the binary-domain and multi-domain settings, where the former treats all distorted speech as one domain, and the latter views different distortions as different domains. In contrast to supervised training methods, we obtained promising results in target domains where speech data is distorted with different distortions including new unseen distortions introduced during testing.
引用
收藏
页码:2193 / 2197
页数:5
相关论文
共 50 条
  • [41] Self-Supervised Domain Adaptation for 6DoF Pose Estimation
    Jin, Juseong
    Jeong, Eunju
    Cho, Joonmyun
    Kim, Young-Gon
    IEEE ACCESS, 2024, 12 : 101528 - 101535
  • [42] Self-supervised domain adaptation for machinery remaining useful life prediction
    Le Xuan, Quy
    Munderloh, Marco
    Ostermann, Joern
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 250
  • [43] Distribution regularized self-supervised learning for domain adaptation of semantic segmentation
    Iqbal, Javed
    Rawal, Hamza
    Hafiz, Rehan
    Chi, Yu-Tseh
    Ali, Mohsen
    Image and Vision Computing, 2022, 124
  • [44] SSLChange: A Self-Supervised Change Detection Framework Based on Domain Adaptation
    Zhao, Yitao
    Celik, Turgay
    Liu, Nanqing
    Gao, Feng
    Li, Heng-Chao
    IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [45] CONFORMER-BASED SELF-SUPERVISED LEARNING FOR NON-SPEECH AUDIO TASKS
    Srivastava, Sangeeta
    Wang, Yun
    Tjandra, Andros
    Kumar, Anurag
    Liu, Chunxi
    Singh, Kritika
    Saraf, Yatharth
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8862 - 8866
  • [46] Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks
    Pascual, Santiago
    Ravanelli, Mirco
    Serra, Joan
    Bonafonte, Antonio
    Bengio, Yoshua
    INTERSPEECH 2019, 2019, : 161 - 165
  • [47] Improving BERT With Self-Supervised Attention
    Chen, Yiren
    Kou, Xiaoyu
    Bai, Jiangang
    Tong, Yunhai
    IEEE ACCESS, 2021, 9 : 144129 - 144139
  • [48] Plugging Self-Supervised Monocular Depth into Unsupervised Domain Adaptation for Semantic Segmentation
    Cardace, Adriano
    De Luigi, Luca
    Ramirez, Pierluigi Zama
    Salti, Samuele
    Di Stefano, Luigi
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1999 - 2009
  • [49] Cuepervision: self-supervised learning for continuous domain adaptation without catastrophic forgetting
    Schutera, Mark
    Hafner, Frank M.
    Abhau, Jochen
    Hagenmeyer, Veit
    Mikut, Ralf
    Reischl, Markus
    IMAGE AND VISION COMPUTING, 2021, 106
  • [50] Domain Adaptation With Self-Supervised Learning and Feature Clustering for Intelligent Fault Diagnosis
    Lu, Nannan
    Xiao, Hanhan
    Ma, Zhanguo
    Yan, Tong
    Han, Min
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (06) : 7657 - 7670