LEARNING ASR-ROBUST CONTEXTUALIZED EMBEDDINGS FOR SPOKEN LANGUAGE UNDERSTANDING

被引:0
|
作者
Huang, Chao-Wei [1 ]
Chen, Yun-Nung [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
关键词
spoken language understanding; contextualized embedding; ASR robustness; RECURRENT NEURAL-NETWORKS;
D O I
10.1109/icassp40776.2020.9054689
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Employing pre-trained language models (LM) to extract contextualized word representations has achieved state-of-the-art performance on various NLP tasks. However, applying this technique to noisy transcripts generated by automatic speech recognizer (ASR) is concerned. Therefore, this paper focuses on making contextualized representations more ASR-robust. We propose a novel confusion-aware fine-tuning method to mitigate the impact of ASR errors on pre-trained LMs. Specifically, we fine-tune LMs to produce similar representations for acoustically confusable words that are obtained from word confusion networks (WCNs) produced by ASR. Experiments on multiple benchmark datasets show that the proposed method significantly improves the performance of spoken language understanding when performing on ASR transcripts(1).
引用
收藏
页码:8009 / 8013
页数:5
相关论文
共 50 条
  • [1] ASR-Robust Spoken Language Understanding on ASR-GLUE dataset
    Feng, Lingyun
    Yu, Jianwei
    Cai, Deng
    Liu, Songxiang
    Zheng, Hai-Tao
    Wang, Yan
    INTERSPEECH 2022, 2022, : 1101 - 1105
  • [2] MCLF: A Multi-grained Contrastive Learning Framework for ASR-robust Spoken Language Understanding
    Huang, Zhiqi
    Chen, Dongsheng
    Zhu, Zhihong
    Cheng, Xuxin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 7936 - 7949
  • [3] MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts
    Cheng, Xuxin
    Zhu, Zhihong
    Zhuang, Xianwei
    Chen, Zhanpeng
    Huang, Zhiqi
    Zou, Yuexian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 14868 - 14879
  • [4] Towards an ASR error robust Spoken Language Understanding System
    Ruan, Weitong
    Nechaev, Yaroslav
    Chen, Luoxin
    Su, Chengwei
    Kiss, Imre
    INTERSPEECH 2020, 2020, : 901 - 905
  • [5] Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
    Chang, Ya-Hsin
    Chen, Yun-Nung
    INTERSPEECH 2022, 2022, : 3458 - 3462
  • [6] ROBUST SPOKEN LANGUAGE UNDERSTANDING WITH UNSUPERVISED ASR-ERROR ADAPTATION
    Zhu, Su
    Lan, Ouyu
    Yu, Kai
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6179 - 6183
  • [7] ARoBERT: An ASR Robust Pre-Trained Language Model for Spoken Language Understanding
    Wang, Chengyu
    Dai, Suyang
    Wang, Yipeng
    Yang, Fei
    Qiu, Minghui
    Chen, Kehan
    Zhou, Wei
    Huang, Jun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1207 - 1218
  • [8] ASRLM: ASR-Robust Language Model Pre-training via Generative and Discriminative Learning
    Hu, Qian
    Han, Xue
    Wang, Yiting
    Wang, Yitong
    Deng, Chao
    Feng, Junlan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 407 - 419
  • [9] ASR error management for improving spoken language understanding
    Simonnet, Edwin
    Ghannay, Sahar
    Camelin, Nathalie
    Esteve, Yannick
    De Mori, Renato
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3329 - 3333
  • [10] JOINT LEARNING OF WORD AND LABEL EMBEDDINGS FOR SEQUENCE LABELLING IN SPOKEN LANGUAGE UNDERSTANDING
    Wu, Jiewen
    D'Haro, Luis Fernando
    Chen, Nancy F.
    Krishnaswamy, Pavitra
    Banchs, Rafael E.
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 800 - 806