DISCRIMINATIVE RECOGNITION RATE ESTIMATION FOR N-BEST LIST AND ITS APPLICATION TO N-BEST RESCORING

被引:0
|
作者
Ogawa, Atsunori
Hori, Takaaki
Nakamura, Atsushi
机构
关键词
Speech recognition; discriminative recognition rate estimation; N-best list; N-best rescoring; SPEECH RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Techniques for estimating recognition rates without using reference transcriptions are essential if we are to judge whether or not speech recognition technology is applicable to a new task. We have proposed a discriminative recognition rate estimation (DRRE) method for 1-best recognition hypotheses and shown its good estimation performance experimentally. In this paper, we extend our DRRE to N-best lists of recognition hypotheses by modifying its feature extraction procedures and efficiently selecting N-best hypotheses for its discriminative model training. In addition, we apply our extended DRRE to N-best rescoring. In the experiments, the extended DRRE also showed good estimation performance for the N-best lists. And using the estimated recognition rates, the 1-best word accuracy was significantly improved by N-best rescoring from the baseline.
引用
收藏
页码:6832 / 6836
页数:5
相关论文
共 50 条
  • [1] N-best list rescoring using syntactic trigrams
    Salgado-Garza, LR
    Stern, RM
    Nolazco, JA
    MICAI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 2972 : 79 - 88
  • [2] Correcting, Rescoring and Matching: An N-best List Selection Framework for Speech Recognition
    Kuo, Chin-Hung
    Chen, Kuan-Yu
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 729 - 734
  • [3] Automatic acoustic segmentation in N-best list rescoring for lecture speech recognition
    Shen, Peng
    Lu, Xugang
    Kawai, Hisashi
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [4] BERT-based Semantic Model for Rescoring N-best Speech Recognition List
    Fohr, Dominique
    Illina, Irina
    INTERSPEECH 2021, 2021, : 1867 - 1871
  • [5] Multimodal N-best List Rescoring with Weakly Supervised Pre-training in Hybrid Speech Recognition
    Song, Yuanfeng
    Huang, Xiaoling
    Zhao, Xuefang
    Jiang, Di
    Wong, Raymond Chi-Wing
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1336 - 1341
  • [6] Semantic Features Based N-Best Rescoring Methods for Automatic Speech Recognition
    Liu, Chang
    Zhang, Pengyuan
    Li, Ta
    Yan, Yonghong
    APPLIED SCIENCES-BASEL, 2019, 9 (23):
  • [7] Multirate ASR models for phone-class dependent N-best list rescoring
    Gadde, VR
    Sönmez, K
    Franco, H
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 157 - 161
  • [8] An N-Best Candidates-Based Discriminative Training for Speech Recognition Applications
    Chen, Jung-Kuei
    Soong, Frank K.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 206 - 216
  • [9] HALLUCINATED N-BEST LISTS FOR DISCRIMINATIVE LANGUAGE MODELING
    Sagae, K.
    Lehr, M.
    Prud'hommeaux, E.
    Xu, P.
    Glenn, N.
    Karakos, D.
    Khudanpur, S.
    Roark, B.
    Saraclar, M.
    Shafran, I.
    Bikel, D.
    Callison-Burch, C.
    Cao, Y.
    Hall, K.
    Hasler, E.
    Koehn, P.
    Lopez, A.
    Post, M.
    Rileyh, D.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5001 - 5004
  • [10] N-Best Rescoring by Phoneme Classifiers using Subclass AdaBoost Algorithm
    Fujimura, Hiroshi
    Shinohara, Yusuke
    Masuko, Takashi
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3326 - 3330