ASR ERROR DETECTION USING RECURRENT NEURAL NETWORK LANGUAGE MODEL AND COMPLEMENTARY ASR

被引：0

作者：

Tam, Yik-Cheung ^{[1
]}

Lei, Yun ^{[1
]}

Zheng, Jing ^{[1
]}

Wang, Wen ^{[1
]}

机构：

[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

ASR error detection; recurrent neural network language model; deep neural network acoustic model; complementary ASR; CONFIDENCE MEASURES; SPEECH RECOGNITION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Detecting automatic speech recognition (ASR) errors can play an important role for effective human-computer spoken dialogue system, as recognition errors can hinder accurate system understanding of user intents. Our goal is to locate errors in an utterance so that the dialogue manager can pose appropriate clarification questions to the users. We propose two approaches to improve ASR error detection: (1) using recurrent neural network language models to capture long-distance word context within and across previous utterances; (2) using a complementary ASR system. The intuition is that when two complementary ASR systems disagree on a region in an utterance, this region is most likely an error. We train a neural network predictor of errors using a variety of features. We performed experiments on both English and Iraqi Arabic ASR and observed significant improvement in error detection using the proposed methods.

引用

页数：5

共 50 条

[1] System-independent ASR error detection and classification using Recurrent Neural Network
Errattahia, Rahhal
Hannani, Asmaa E. L.
Hain, Thomas
Ouahmane, Hassan
COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 187 - 199
[2] ASR ERROR DETECTION AND RECOGNITION RATE ESTIMATION USING DEEP BIDIRECTIONAL RECURRENT NEURAL NETWORKS
Ogawa, Atsunori
Hori, Takaaki
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4370 - 4374
[3] Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
Khassanov, Yerbolat
Chng, Eng Siong
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3343 - 3347
[4] ASR ERROR DETECTION IN A CONVERSATIONAL SPOKEN LANGUAGE TRANSLATION SYSTEM
Chen, Wei
Ananthakrishnan, Sankaranarayanan
Kumar, Rohit
Prasad, Rohit
Natarajan, Prem
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7418 - 7422
[5] ASR Independent Hybrid Recurrent Neural Network Based Error Correction for Dialog System Applications
Choi, Junhwi
Ryu, Seonghan
Lee, Kyusong
Kim, Yonghee
Koo, Sangjun
Bang, Jeesoo
Park, Seonyeong
Lee, Gary Geunbae
MULTIMODAL ANALYSES ENABLING ARTIFICIAL AGENTS IN HUMAN-MACHINE INTERACTION, 2015, 8757 : 69 - 77
[6] WORD EMBEDDINGS COMBINATION AND NEURAL NETWORKS FOR ROBUSTNESS IN ASR ERROR DETECTION
Ghannay, Sahar
Esteve, Yannick
Camelin, Nathalie
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 1671 - 1675
[7] Improving Acoustic Model for English ASR System using Deep Neural Network
Quoc Bao Nguyen
Tat Thang Vu
Chi Mai Luong
2015 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES - RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2015, : 25 - 29
[8] REVISITING RECURRENT NEURAL NETWORKS FOR ROBUST ASR
Vinyals, Oriol
Ravuri, Suman V.
Povey, Daniel
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4085 - 4088
[9] Class-Based Neural Network Language Model for Second-Pass Rescoring in ASR
Dai, Lingfeng
Liu, Qi
Yu, Kai
INTERSPEECH 2021, 2021, : 2022 - 2026
[10] Toward the Ultimate ASR Language Model
Jelinek, Frederick
Parada, Carolina
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 15 - 15

← 1 2 3 4 5 →