New Datasets and Controllable Iterative Data Augmentation Method for Code-switching ASR Error Correction

被引:0
|
作者
Wan, Zhaohong [1 ,2 ]
Wan, Xiaojun [1 ,2 ]
Peng, Wei [3 ]
Li, Rongjun [3 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[2] Peking Univ, MOE Key Lab Computat Linguist, Beijing, Peoples R China
[3] Huawei Technol, Artificial Intelligence Applicat Res Ctr, Shenzhen, Peoples R China
基金
国家重点研发计划; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the wide use of automatic speech recognition(ASR) systems, researchers pay more attention to the ASR error correction task to improve the quality of recognition results. In particular, ASR in bilingual or multilingual settings, namely code-switching ASR, has greater challenges and research value. In this paper, we first present code-switching ASR correction datasets obtained from solid ASR systems and automatic annotators. The datasets contain Chinese-English code-switching dialogues of bilingual speakers in Singapore, Malaysia, and Hong Kong. Based on this task, we propose a controllable iterative (CI) data augmentation method for improving the performance of mainstream ASR error correction systems. With a small amount of training data, our proposed method has the ability to iteratively produce abundant pseudo parallel data from the monolingual corpus for Chinese-English code-switching ASR correction. Results of experiments show that our method achieves the best performance compared with the rulebased, back-translation-based data augmentation methods and large language model ChatGPT.
引用
收藏
页码:8075 / 8087
页数:13
相关论文
共 25 条
  • [1] Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
    Yilmaz, Emre
    van den Heuvel, Henk
    van Leeuwen, David A.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1933 - 1937
  • [2] Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
    Liang, Zheng
    Song, Zheshu
    Ma, Ziyang
    Du, Chenpeng
    Yu, Kai
    Chen, Xie
    INTERSPEECH 2023, 2023, : 919 - 923
  • [3] Iterative decoding and error code correction method in holographic data storage
    Süto, A
    Lorincz, E
    OPTICAL COMMUNICATION THEORY AND TECHNIQUES, 2005, : 87 - 94
  • [4] Improving code-switching speech recognition with data augmentation and system combination
    Ma, Duo
    Xu, Haihua
    Li, Guanyu
    Chng, Eng Siong
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1308 - 1312
  • [5] DATA AUGMENTATION FOR END-TO-END CODE-SWITCHING SPEECH RECOGNITION
    Du, Chenpeng
    Li, Hao
    Lu, Yizhou
    Wang, Lan
    Qian, Yanmin
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 194 - 200
  • [6] Lattice-based Data Augmentation for Code-switching Speech Recognition
    Hartanto, Roland
    Uto, Kuniaki
    Shinoda, Koichi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1667 - 1672
  • [7] MINIMUM WORD ERROR TRAINING FOR NON-AUTOREGRESSIVE TRANSFORMER-BASED CODE-SWITCHING ASR
    Peng, Yizhou
    Zhang, Jicheng
    Xu, Haihua
    Huang, Hao
    Chng, Eng Siong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7807 - 7811
  • [8] Acoustic data augmentation for Mandarin-English code-switching speech recognition
    Long, Yanhua
    Li, Yijie
    Zhang, Qiaozheng
    Wei, Shuang
    Ye, Hong
    Yang, Jichen
    APPLIED ACOUSTICS, 2020, 161
  • [9] TEXTUAL DATA AUGMENTATION FOR ARABIC-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
    Hussein, Amir
    Chowdhury, Shammur Absar
    Abdelali, Ahmed
    Dehak, Najim
    Ali, Ahmed
    Khudanpur, Sanjeev
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 777 - 784
  • [10] Code-switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation
    Chang, Ching-Ting
    Chuang, Shun-Po
    Lee, Hung-Yi
    INTERSPEECH 2019, 2019, : 554 - 558