ASR ERROR DETECTION USING RECURRENT NEURAL NETWORK LANGUAGE MODEL AND COMPLEMENTARY ASR

被引:0
|
作者
Tam, Yik-Cheung [1 ]
Lei, Yun [1 ]
Zheng, Jing [1 ]
Wang, Wen [1 ]
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
关键词
ASR error detection; recurrent neural network language model; deep neural network acoustic model; complementary ASR; CONFIDENCE MEASURES; SPEECH RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detecting automatic speech recognition (ASR) errors can play an important role for effective human-computer spoken dialogue system, as recognition errors can hinder accurate system understanding of user intents. Our goal is to locate errors in an utterance so that the dialogue manager can pose appropriate clarification questions to the users. We propose two approaches to improve ASR error detection: (1) using recurrent neural network language models to capture long-distance word context within and across previous utterances; (2) using a complementary ASR system. The intuition is that when two complementary ASR systems disagree on a region in an utterance, this region is most likely an error. We train a neural network predictor of errors using a variety of features. We performed experiments on both English and Iraqi Arabic ASR and observed significant improvement in error detection using the proposed methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] System-independent ASR error detection and classification using Recurrent Neural Network
    Errattahia, Rahhal
    Hannani, Asmaa E. L.
    Hain, Thomas
    Ouahmane, Hassan
    COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 187 - 199
  • [2] ASR ERROR DETECTION AND RECOGNITION RATE ESTIMATION USING DEEP BIDIRECTIONAL RECURRENT NEURAL NETWORKS
    Ogawa, Atsunori
    Hori, Takaaki
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4370 - 4374
  • [3] Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
    Khassanov, Yerbolat
    Chng, Eng Siong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3343 - 3347
  • [4] ASR ERROR DETECTION IN A CONVERSATIONAL SPOKEN LANGUAGE TRANSLATION SYSTEM
    Chen, Wei
    Ananthakrishnan, Sankaranarayanan
    Kumar, Rohit
    Prasad, Rohit
    Natarajan, Prem
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7418 - 7422
  • [5] ASR Independent Hybrid Recurrent Neural Network Based Error Correction for Dialog System Applications
    Choi, Junhwi
    Ryu, Seonghan
    Lee, Kyusong
    Kim, Yonghee
    Koo, Sangjun
    Bang, Jeesoo
    Park, Seonyeong
    Lee, Gary Geunbae
    MULTIMODAL ANALYSES ENABLING ARTIFICIAL AGENTS IN HUMAN-MACHINE INTERACTION, 2015, 8757 : 69 - 77
  • [6] WORD EMBEDDINGS COMBINATION AND NEURAL NETWORKS FOR ROBUSTNESS IN ASR ERROR DETECTION
    Ghannay, Sahar
    Esteve, Yannick
    Camelin, Nathalie
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1671 - 1675
  • [7] Improving Acoustic Model for English ASR System using Deep Neural Network
    Quoc Bao Nguyen
    Tat Thang Vu
    Chi Mai Luong
    2015 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES - RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2015, : 25 - 29
  • [8] REVISITING RECURRENT NEURAL NETWORKS FOR ROBUST ASR
    Vinyals, Oriol
    Ravuri, Suman V.
    Povey, Daniel
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4085 - 4088
  • [9] Class-Based Neural Network Language Model for Second-Pass Rescoring in ASR
    Dai, Lingfeng
    Liu, Qi
    Yu, Kai
    INTERSPEECH 2021, 2021, : 2022 - 2026
  • [10] Toward the Ultimate ASR Language Model
    Jelinek, Frederick
    Parada, Carolina
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 15 - 15