DISCRIMINATIVE RECOGNITION RATE ESTIMATION FOR N-BEST LIST AND ITS APPLICATION TO N-BEST RESCORING

被引:0
|
作者
Ogawa, Atsunori
Hori, Takaaki
Nakamura, Atsushi
机构
关键词
Speech recognition; discriminative recognition rate estimation; N-best list; N-best rescoring; SPEECH RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Techniques for estimating recognition rates without using reference transcriptions are essential if we are to judge whether or not speech recognition technology is applicable to a new task. We have proposed a discriminative recognition rate estimation (DRRE) method for 1-best recognition hypotheses and shown its good estimation performance experimentally. In this paper, we extend our DRRE to N-best lists of recognition hypotheses by modifying its feature extraction procedures and efficiently selecting N-best hypotheses for its discriminative model training. In addition, we apply our extended DRRE to N-best rescoring. In the experiments, the extended DRRE also showed good estimation performance for the N-best lists. And using the estimated recognition rates, the 1-best word accuracy was significantly improved by N-best rescoring from the baseline.
引用
收藏
页码:6832 / 6836
页数:5
相关论文
共 50 条
  • [21] Results of the N-Best 2008 Dutch Speech Recognition Evaluation
    van Leeuwen, David A.
    Kessens, Judith
    Sanders, Eric
    van den Heuvel, Henk
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2531 - +
  • [22] Parsing N-best lists of handwritten sentences
    Zimmermann, M
    Chappelier, JC
    Bunke, H
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 572 - 576
  • [23] RESCORING N-BEST SPEECH RECOGNITION LIST BASED ON ONE-ON-ONE HYPOTHESIS COMPARISON USING ENCODER-CLASSIFIER MODEL
    Ogawa, Atsunori
    Delcroix, Marc
    Karita, Shigeki
    Nakatani, Tomohiro
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6099 - 6103
  • [24] Improving pronunciation inference using n-best list, acoustics and orthography
    Anumanchipalli, Gopala Krishna
    Ravishankar, Mosur
    Reddy, Raj
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 925 - +
  • [25] Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition
    Shi, Yongzhe
    Zhang, Wei-Qiang
    Cai, Meng
    Liu, Jia
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [26] Utterance verification using search confusion rate and its N-best approach
    Kim, K
    Kim, H
    Hahn, M
    ETRI JOURNAL, 2005, 27 (04) : 461 - 464
  • [27] DISCRIMINATIVE LEARNING USING LINGUISTIC FEATURES TO RESCORE N-BEST SPEECH HYPOTHESES
    Georgescul, Maria
    Rayner, Manny
    Bouillon, Pierrette
    Tsourakis, Nikos
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 97 - 100
  • [28] COMPUTING THE N-BEST LOOPLESS PATHS IN A NETWORK
    CLARKE, S
    KRIKORIAN, A
    RAUSEN, J
    JOURNAL OF THE SOCIETY FOR INDUSTRIAL AND APPLIED MATHEMATICS, 1963, 11 (04): : 1096 - 1102
  • [29] Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition
    Yongzhe Shi
    Wei-Qiang Zhang
    Meng Cai
    Jia Liu
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [30] LOOSE PHRASE EXTRACTION WITH n-BEST ALIGNMENTS
    Xue Yongzeng Li Sheng (Microsoft Key Laboratory of Natural Language Processing and Speech
    Journal of Electronics(China), 2007, (04) : 567 - 571