Data mining for detecting errors in dictation speech recognition

被引：13

作者：

Zhou, L ^{[1
]}

Shi, Y

Feng, JJ

Sears, A

机构：

[1] UMBC, Dept Informat Syst, Baltimore, MD 21250 USA

[2] UMBC, Dept Comp Sci & Elect Engn, Baltimore, MD 21250 USA

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 05期

关键词：

document retrieval; speech data mining; speech recognition;

D O I：

10.1109/TSA.2005.851874

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The efficiency promised by a dictation speech recognition (DSR) system is lessened by the need for correcting recognition errors. Error detection is the precursor of error correction. Developing effective techniques for error detection can thus lead to improved error correction. Current research on error detection has focused mainly on transcription and/or domain-specific speech. Error detection in DSR has been studied less. We propose data mining models for detecting errors in DSR. Instead of relying on internal parameters from DSR systems, we propose a loosely coupled approach to error detection based on features extracted from the DSR output. The features mainly came from two sources: confidence scores and linguistics parsing. Link grammar was innovatively applied to error detection. Three data mining techniques, including Naive Bayes, neural networks, and Support Vector Machines (SVMs), were evaluated on 5M DSR corpora. The experimental results showed that significant performance was achieved in that F-measures for error detection ranged from 55.3 % to 62.5 %. This study provided insights into the merit of different data-mining techniques and different types of features in error detection.

引用

页码：681 / 688

页数：8

共 50 条

[1] Confidence Measures for Detecting Speech Recognition Errors
Gada, Jigar
Rao, Preeti
Samudravijaya, K.
[J]. 2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
[2] Speech and Speech Recognition during Dictation Corrections
Vertanen, Keith
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1890 - 1893
[3] EFFECTIVE DATA-DRIVEN FEATURE LEARNING FOR DETECTING NAME ERRORS IN AUTOMATIC SPEECH RECOGNITION
He, Ji
Marin, Alex
Ostendorf, Mari
[J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 230 - 235
[4] Detecting and correcting automatic speech recognition errors with a new model
Arslan, Recep Sinan
BariSci, Necaattin
Arici, Nursal
Kocer, Sabri
[J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (05) : 2298 - 2311
[5] Dictation and speech recognition technology as test accommodations
MacArthur, CA
Cavalier, AR
[J]. EXCEPTIONAL CHILDREN, 2004, 71 (01) : 43 - 58
[6] Hypothesis Combination for Slovak Dictation Speech Recognition
Lojka, Martin
Juhar, Jozef
[J]. 2014 56TH INTERNATIONAL SYMPOSIUM ELMAR (ELMAR), 2014, : 43 - 46
[7] Research on speech recognition models in the Chinese dictation machine
Zheng, Fang
Wu, Wenhu
Fang, Ditang
[J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 1997, 37 (09): : 37 - 40
[8] SPEECH RECOGNITION IN THE OFFICE - HOW THE TECHNOLOGY SUPPORTS DICTATION
SHARMAN, RA
[J]. COMPUTER JOURNAL, 1994, 37 (09): : 735 - 744
[9] SPEECH RECOGNITION - KURZWEIL BRINGS VOICE DICTATION TO WINDOWS
ANDREWS, D
[J]. BYTE, 1994, 19 (08): : 48 - 48
[10] Speech recognition in the office: how the technology supports dictation
[J]. Sharman, R.A., 1600, Oxford Univ Press, Oxford, United Kingdom (37):

← 1 2 3 4 5 →