REMEMBER THE CONTEXT! ASR SLOT ERROR CORRECTION THROUGH MEMORIZATION

被引:3
|
作者
Bekal, Dhanush [1 ]
Shenoy, Ashish [1 ]
Sunkara, Monica [1 ]
Bodapati, Sravan [1 ]
Kirchhoff, Katrin [1 ]
机构
[1] Amazon AWS AI, Seattle, WA 98109 USA
关键词
speech recognition; slot error correction; transformer; k-nearest-neighbor search; long tail recognition; DOMAIN ADAPTATION;
D O I
10.1109/ASRU51503.2021.9688109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate recognition of slot values such as domain specific words or named entities by automatic speech recognition (ASR) systems forms the core of the Goal-oriented Dialogue Systems. Although it is a critical step with direct impact on downstream tasks such as language understanding, many domain agnostic ASR systems tend to perform poorly on domain specific or long tail words. They are often supplemented with slot error correcting systems but it is often hard for any neural model to directly output such rare entity words. To address this problem, we propose k-nearest neighbor (k-NN) search that outputs domain-specific entities from an explicit datastore. We improve error correction rate by conveniently augmenting a pretrained joint phoneme and text based transformer sequence to sequence model with k-NN search during inference. We evaluate our proposed approach on five different domains containing long tail slot entities such as full names, airports, street names, cities, states. Our best performing error correction model shows a relative improvement of 7.4% in word error rate (WER) on rare word entities over the baseline and also achieves a relative WER improvement of 9.8% on an out of vocabulary (OOV) test set.
引用
收藏
页码:236 / 243
页数:8
相关论文
共 50 条
  • [42] RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation
    Zhang, Yue
    Cui, Leyang
    Zhao, Enbo
    Bi, Wei
    Shi, Shunting
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16780 - 16793
  • [43] Error Correction Using Long Context Match for Smartphone Speech Recognition
    Liang, Yuan
    Iwano, Koji
    Shinoda, Koichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (11): : 1932 - 1942
  • [44] Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
    Futami, Hayato
    Inaguma, Hirofumi
    Ueno, Sei
    Mimura, Masato
    Sakai, Shinsuke
    Kawahara, Tatsuya
    INTERSPEECH 2022, 2022, : 3889 - 3893
  • [45] A Study on Error Feature Analysis and Error Correction in English Translation Through Machine Translation
    Tao G.
    Informatica (Slovenia), 2023, 47 (08): : 13 - 18
  • [46] Modeling the US demand for imports through cointegration and error correction
    Carone, G
    JOURNAL OF POLICY MODELING, 1996, 18 (01) : 1 - 48
  • [47] Overcoming Collaborative Inhibition through Error Correction: A Classroom Experiment
    Gadgil, Soniya
    Nokes-Malach, Timothy J.
    APPLIED COGNITIVE PSYCHOLOGY, 2012, 26 (03) : 410 - 420
  • [48] Spatial noise filtering through error correction for quantum sensing
    Layden, David
    Cappellaro, Paola
    NPJ QUANTUM INFORMATION, 2018, 4
  • [49] Portfolio Insurance through Error-Correction Neural Networks
    Kovalnogov, Vladislav N.
    Fedorov, Ruslan, V
    Generalov, Dmitry A.
    Chukalin, Andrey, V
    Katsikis, Vasilios N.
    Mourtas, Spyridon D.
    Simos, Theodore E.
    MATHEMATICS, 2022, 10 (18)
  • [50] Delay Variation Compensation through Error Correction using Razor
    Chua, Adelson N.
    Maestro, Rico Jossel M.
    Alba, Mark Earvin V.
    Lofamia, Wes Vernon V.
    Pelayo, Bernard Raymond D.
    Fabay, Ken Bryan F.
    Jardin, John Cris F.
    Jocson, Kervin John C.
    Madamba, Joy Alinda R.
    Hizon, John Richard E.
    Alarcon, Louis P.
    PROCEEDINGS 2015 6TH INTERNATIONAL WORKSHOP ON CMOS VARIABILITY (VARI), 2015, : 5 - 8