REMEMBER THE CONTEXT! ASR SLOT ERROR CORRECTION THROUGH MEMORIZATION

被引:3
|
作者
Bekal, Dhanush [1 ]
Shenoy, Ashish [1 ]
Sunkara, Monica [1 ]
Bodapati, Sravan [1 ]
Kirchhoff, Katrin [1 ]
机构
[1] Amazon AWS AI, Seattle, WA 98109 USA
关键词
speech recognition; slot error correction; transformer; k-nearest-neighbor search; long tail recognition; DOMAIN ADAPTATION;
D O I
10.1109/ASRU51503.2021.9688109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems. Although it is a critical step with direct impact on downstream tasks such as language understanding, many domain agnostic ASR systems tend to perform poorly on domain specific or long tail words. They are often supplemented with slot error correcting systems but it is often hard for any neural model to directly output such rare entity words. To address this problem, we propose k-nearest neighbor (k-NN) search that outputs domain-specific entities from an explicit datastore. We improve error correction rate by conveniently augmenting a pretrained joint phoneme and text based transformer sequence to sequence model with k-NN search during inference. We evaluate our proposed approach on five different domains containing long tail slot entities such as full names, airports, street names, cities, states. Our best performing error correction model shows a relative improvement of 7.4% in word error rate (WER) on rare word entities over the baseline and also achieves a relative WER improvement of 9.8% on an out of vocabulary (OOV) test set.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 50 条
  • [1] CI based error correction for ASR text
    Gong, Yuan
    Lei, Li
    Proceedings of 2006 International Conference on Artificial Intelligence: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 750 - 754
  • [2] Pronunciation guided copy and correction model for ASR error correction
    Dong, Ling
    Wang, Wenjun
    Yu, Zhengtao
    Huang, Yuxin
    Guo, Junjun
    Zhou, Guojiang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (10) : 4787 - 4799
  • [3] Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism
    Fan, Jiaxin
    Zhang, Yong
    Li, Hanzhang
    Wang, Jianzong
    Li, Zhitao
    Ouyang, Sheng
    Cheng, Ning
    Xiao, Jing
    INTERSPEECH 2023, 2023, : 2173 - 2177
  • [4] Towards Understanding ASR Error Correction for Medical Conversations
    Mani, Anirudh
    Palaskar, Shruti
    Konam, Sandeep
    NATURAL LANGUAGE PROCESSING FOR MEDICAL CONVERSATIONS, 2020, : 7 - 11
  • [5] ASR Error Correction with Augmented Transformer for Entity Retrieval
    Wang, Haoyu
    Dong, Shuyan
    Liu, Yue
    Logan, James
    Agrawal, Ashish Kumar
    Liu, Yang
    INTERSPEECH 2020, 2020, : 1550 - 1554
  • [6] ASR Error Correction with Constrained Decoding on Operation Prediction
    Yang, Jingyuan
    Li, Rongjun
    Peng, Wei
    INTERSPEECH 2022, 2022, : 3874 - 3878
  • [7] Robust ASR Error Correction with Conservative Data Filtering
    Udagawa, Takuma
    Suzuki, Masayuki
    Muraoka, Masayasu
    Kurata, Gakuto
    EMNLP 2024 - 2024 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Industry Track, 2024, : 256 - 266
  • [8] ASR ERROR CORRECTION AND DOMAIN ADAPTATION USING MACHINE TRANSLATION
    Mani, Anirudh
    Palaskar, Shruti
    Meripo, Nimshi Venkat
    Konam, Sandeep
    Metze, Florian
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6344 - 6348
  • [9] Boost Transformer with BERT and copying mechanism for ASR error correction
    Li, Wenkun
    Di, Hui
    Wang, Lina
    Ouchi, Kazushige
    Lu, Jing
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] Lexical Error Guard: Leveraging Large Language Models for Enhanced ASR Error Correction
    Si, Mei
    Cobas, Omar
    Fababeir, Michael
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (04): : 2435 - 2446