DISCRIMINATIVE RECOGNITION RATE ESTIMATION FOR N-BEST LIST AND ITS APPLICATION TO N-BEST RESCORING

被引:0
|
作者
Ogawa, Atsunori
Hori, Takaaki
Nakamura, Atsushi
机构
关键词
Speech recognition; discriminative recognition rate estimation; N-best list; N-best rescoring; SPEECH RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Techniques for estimating recognition rates without using reference transcriptions are essential if we are to judge whether or not speech recognition technology is applicable to a new task. We have proposed a discriminative recognition rate estimation (DRRE) method for 1-best recognition hypotheses and shown its good estimation performance experimentally. In this paper, we extend our DRRE to N-best lists of recognition hypotheses by modifying its feature extraction procedures and efficiently selecting N-best hypotheses for its discriminative model training. In addition, we apply our extended DRRE to N-best rescoring. In the experiments, the extended DRRE also showed good estimation performance for the N-best lists. And using the estimated recognition rates, the 1-best word accuracy was significantly improved by N-best rescoring from the baseline.
引用
收藏
页码:6832 / 6836
页数:5
相关论文
共 50 条
  • [31] N-Best Evaluation for Academic Hiring and Promotion
    Frank, Michael C.
    TRENDS IN COGNITIVE SCIENCES, 2019, 23 (12) : 983 - 985
  • [32] A discriminative training framework using N-best speech recognition transcriptions and scores for spoken utterance classification
    Yaman, Sibel
    Deng, Li
    Yu, Dong
    Wang, Ye-Yi
    Acero, Alex
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 5 - +
  • [33] N-best maximal decoders for part models
    Park, Dennis
    Ramanan, Deva
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 2627 - 2634
  • [34] NEURAL ORACLE SEARCH ON N-BEST HYPOTHESES
    Variani, Ehsan
    Chen, Tongzhou
    Apfel, James
    Ramabhadran, Bhuvana
    Lee, Seungji
    Moreno, Pedro
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7824 - 7828
  • [35] Improved N-Best Extraction with an Evaluation on Language Data
    Bjoerklund, Johanna
    Drewes, Frank
    Jonsson, Anna
    COMPUTATIONAL LINGUISTICS, 2022, 48 (01) : 119 - 153
  • [36] The ESAT 2008 System for N-Best Dutch Speech Recognition Benchmark
    Demuynck, Kris
    Puurula, Antti
    Van Compernolle, Dirk
    Wambacq, Patrick
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 339 - 344
  • [37] A word graph based N-Best search in continuous speech recognition
    Tran, BH
    Seide, F
    Steinbiss, V
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2127 - 2130
  • [38] Automated generation of N-best pronunciations of proper nouns
    Deshmukh, N
    Weber, M
    Picone, J
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 283 - 286
  • [39] Explicit, N-best formant features for vowel classification
    Schmid, P
    Barnard, E
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 991 - 994
  • [40] Leveraging ASR N-best in Deep Entity Retrieval
    Wang, Haoyu
    Chen, John
    Laali, Majid
    Durda, Kevin
    King, Jeff
    Campbell, William
    Liu, Yang
    INTERSPEECH 2021, 2021, : 261 - 265