Data mining for detecting errors in dictation speech recognition

被引:13
|
作者
Zhou, L [1 ]
Shi, Y
Feng, JJ
Sears, A
机构
[1] UMBC, Dept Informat Syst, Baltimore, MD 21250 USA
[2] UMBC, Dept Comp Sci & Elect Engn, Baltimore, MD 21250 USA
来源
关键词
document retrieval; speech data mining; speech recognition;
D O I
10.1109/TSA.2005.851874
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The efficiency promised by a dictation speech recognition (DSR) system is lessened by the need for correcting recognition errors. Error detection is the precursor of error correction. Developing effective techniques for error detection can thus lead to improved error correction. Current research on error detection has focused mainly on transcription and/or domain-specific speech. Error detection in DSR has been studied less. We propose data mining models for detecting errors in DSR. Instead of relying on internal parameters from DSR systems, we propose a loosely coupled approach to error detection based on features extracted from the DSR output. The features mainly came from two sources: confidence scores and linguistics parsing. Link grammar was innovatively applied to error detection. Three data mining techniques, including Naive Bayes, neural networks, and Support Vector Machines (SVMs), were evaluated on 5M DSR corpora. The experimental results showed that significant performance was achieved in that F-measures for error detection ranged from 55.3 % to 62.5 %. This study provided insights into the merit of different data-mining techniques and different types of features in error detection.
引用
收藏
页码:681 / 688
页数:8
相关论文
共 50 条
  • [1] Confidence Measures for Detecting Speech Recognition Errors
    Gada, Jigar
    Rao, Preeti
    Samudravijaya, K.
    [J]. 2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [2] Speech and Speech Recognition during Dictation Corrections
    Vertanen, Keith
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1890 - 1893
  • [3] EFFECTIVE DATA-DRIVEN FEATURE LEARNING FOR DETECTING NAME ERRORS IN AUTOMATIC SPEECH RECOGNITION
    He, Ji
    Marin, Alex
    Ostendorf, Mari
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 230 - 235
  • [4] Detecting and correcting automatic speech recognition errors with a new model
    Arslan, Recep Sinan
    BariSci, Necaattin
    Arici, Nursal
    Kocer, Sabri
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (05) : 2298 - 2311
  • [5] Dictation and speech recognition technology as test accommodations
    MacArthur, CA
    Cavalier, AR
    [J]. EXCEPTIONAL CHILDREN, 2004, 71 (01) : 43 - 58
  • [6] Hypothesis Combination for Slovak Dictation Speech Recognition
    Lojka, Martin
    Juhar, Jozef
    [J]. 2014 56TH INTERNATIONAL SYMPOSIUM ELMAR (ELMAR), 2014, : 43 - 46
  • [7] Research on speech recognition models in the Chinese dictation machine
    Zheng, Fang
    Wu, Wenhu
    Fang, Ditang
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 1997, 37 (09): : 37 - 40
  • [8] SPEECH RECOGNITION IN THE OFFICE - HOW THE TECHNOLOGY SUPPORTS DICTATION
    SHARMAN, RA
    [J]. COMPUTER JOURNAL, 1994, 37 (09): : 735 - 744
  • [9] SPEECH RECOGNITION - KURZWEIL BRINGS VOICE DICTATION TO WINDOWS
    ANDREWS, D
    [J]. BYTE, 1994, 19 (08): : 48 - 48
  • [10] Speech recognition in the office: how the technology supports dictation
    [J]. Sharman, R.A., 1600, Oxford Univ Press, Oxford, United Kingdom (37):