REMEMBER THE CONTEXT! ASR SLOT ERROR CORRECTION THROUGH MEMORIZATION

被引:3
|
作者
Bekal, Dhanush [1 ]
Shenoy, Ashish [1 ]
Sunkara, Monica [1 ]
Bodapati, Sravan [1 ]
Kirchhoff, Katrin [1 ]
机构
[1] Amazon AWS AI, Seattle, WA 98109 USA
关键词
speech recognition; slot error correction; transformer; k-nearest-neighbor search; long tail recognition; DOMAIN ADAPTATION;
D O I
10.1109/ASRU51503.2021.9688109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems. Although it is a critical step with direct impact on downstream tasks such as language understanding, many domain agnostic ASR systems tend to perform poorly on domain specific or long tail words. They are often supplemented with slot error correcting systems but it is often hard for any neural model to directly output such rare entity words. To address this problem, we propose k-nearest neighbor (k-NN) search that outputs domain-specific entities from an explicit datastore. We improve error correction rate by conveniently augmenting a pretrained joint phoneme and text based transformer sequence to sequence model with k-NN search during inference. We evaluate our proposed approach on five different domains containing long tail slot entities such as full names, airports, street names, cities, states. Our best performing error correction model shows a relative improvement of 7.4% in word error rate (WER) on rare word entities over the baseline and also achieves a relative WER improvement of 9.8% on an out of vocabulary (OOV) test set.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 50 条
  • [21] Soft error resilient system design through error correction
    Mitra, Subhasish
    Zhang, Ming
    Seifert, Norbert
    Mak, T. M.
    Kim, Kee Sup
    IFIP VLSI-SOC 2006: IFIP WG 10.5 INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION & SYSTEM-ON-CHIP, 2006, : 332 - +
  • [22] Soft error resilient system design through error correction
    Mitra, Subhasish
    Zhang, Ming
    Seifert, Norbert
    Mak, T. M.
    Kim, Kee Sup
    VLSI-SOC: RESEARCH TRENDS IN VLSI AND SYSTEMS ON CHIP, 2008, : 143 - +
  • [23] Lattice re-scoring during manual editing for automatic error correction of ASR transcripts
    Runarsdottir, Anna, V
    Helgadottir, Inga R.
    Gudnason, Jon
    INTERSPEECH 2019, 2019, : 3810 - 3814
  • [24] ALTERNATIVE HYPOTHESIS GENERATION USING A WEIGHTED KERNEL FEATURE MATRIX FOR ASR SUBSTITUTION ERROR CORRECTION
    Liu, Chao-Hong
    Wu, Chung-Hsien
    Sarwono, David
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 1 - 5
  • [25] Effective ASR Error Correction Leveraging Phonetic, Semantic Information and N-best hypotheses
    Wang, Hsin-Wei
    Yan, Bi-Cheng
    Wang, Yi-Cheng
    Chen, Berlin
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 117 - 122
  • [26] ASR Independent Hybrid Recurrent Neural Network Based Error Correction for Dialog System Applications
    Choi, Junhwi
    Ryu, Seonghan
    Lee, Kyusong
    Kim, Yonghee
    Koo, Sangjun
    Bang, Jeesoo
    Park, Seonyeong
    Lee, Gary Geunbae
    MULTIMODAL ANALYSES ENABLING ARTIFICIAL AGENTS IN HUMAN-MACHINE INTERACTION, 2015, 8757 : 69 - 77
  • [27] IMPROVING MEMORY RELIABILITY THROUGH ERROR CORRECTION
    FERRISPRABHU, AV
    COMPUTER DESIGN, 1979, 18 (07): : 137 - &
  • [28] EVALUATING ASR-9 AZIMUTH ERROR MODELS THROUGH ANALYSIS OF TARGETS OF OPPORTUNITY
    Mayer, Colin
    Tzanos, Panos
    2011 IEEE/AIAA 30TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2011,
  • [29] Context-Dependent Error Correction of Spoken Referring Expressions
    Zukerman, Ingrid
    Partovi, Andisheh
    Kim, Su Nam
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2032 - 2036
  • [30] CARE: context-aware sequencing read error correction
    Kallenborn, Felix
    Hildebrandt, Andreas
    Schmidt, Bertil
    BIOINFORMATICS, 2021, 37 (07) : 889 - 895