REMEMBER THE CONTEXT! ASR SLOT ERROR CORRECTION THROUGH MEMORIZATION

被引:3
|
作者
Bekal, Dhanush [1 ]
Shenoy, Ashish [1 ]
Sunkara, Monica [1 ]
Bodapati, Sravan [1 ]
Kirchhoff, Katrin [1 ]
机构
[1] Amazon AWS AI, Seattle, WA 98109 USA
关键词
speech recognition; slot error correction; transformer; k-nearest-neighbor search; long tail recognition; DOMAIN ADAPTATION;
D O I
10.1109/ASRU51503.2021.9688109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems. Although it is a critical step with direct impact on downstream tasks such as language understanding, many domain agnostic ASR systems tend to perform poorly on domain specific or long tail words. They are often supplemented with slot error correcting systems but it is often hard for any neural model to directly output such rare entity words. To address this problem, we propose k-nearest neighbor (k-NN) search that outputs domain-specific entities from an explicit datastore. We improve error correction rate by conveniently augmenting a pretrained joint phoneme and text based transformer sequence to sequence model with k-NN search during inference. We evaluate our proposed approach on five different domains containing long tail slot entities such as full names, airports, street names, cities, states. Our best performing error correction model shows a relative improvement of 7.4% in word error rate (WER) on rare word entities over the baseline and also achieves a relative WER improvement of 9.8% on an out of vocabulary (OOV) test set.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 50 条
  • [31] New Datasets and Controllable Iterative Data Augmentation Method for Code-switching ASR Error Correction
    Wan, Zhaohong
    Wan, Xiaojun
    Peng, Wei
    Li, Rongjun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8075 - 8087
  • [32] Enhancing Multimodal Emotion Recognition through ASR Error Compensation and LLM Fine-Tuning
    Kyung, Jehyun
    Heo, Serin
    Chang, Joon-Hyuk
    INTERSPEECH 2024, 2024, : 4683 - 4687
  • [33] Passive quantum error correction of linear optics networks through error averaging
    Marshman, Ryan J.
    Lund, Austin P.
    Rohde, Peter P.
    Ralph, Timothy C.
    PHYSICAL REVIEW A, 2018, 97 (02)
  • [34] Extracting error thresholds through the framework of approximate quantum error correction condition
    Zhao, Yuanchen
    Liu, Dong E.
    PHYSICAL REVIEW RESEARCH, 2024, 6 (04):
  • [35] Quantum error correction through dissipative evolution.
    Warren, WS
    Barnes, JP
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2000, 220 : U190 - U190
  • [36] Continuous quantum error correction through local operations
    Mascarenhas, Eduardo
    Marques, Breno
    Cunha, Marcelo Terra
    Santos, Marcelo Franca
    PHYSICAL REVIEW A, 2010, 82 (03)
  • [37] Label Error Correction and Generation through Label Relationships
    Cui, Zijun
    Zhang, Yong
    Ji, Qiang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3693 - 3700
  • [38] Efficient Image Transmission Through Analog Error Correction
    Liu, Yang
    Li , Jing
    Xie, Kai
    2011 IEEE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2011,
  • [39] Efficient PUF Error Correction through Response Weighting
    Wen, Yuejiang
    Lao, Yingjie
    2018 IEEE 61ST INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2018, : 849 - 852
  • [40] Modeling and comparative analysis of Forward Error Correction in the context of multipath redundancy
    Rolando Herrero
    Telecommunication Systems, 2017, 65 : 783 - 794