N-best rescoring for speech recognition using penalized logistic regression machines with garbage class

被引:0
|
作者
Birkenes, Oystein [1 ,2 ]
Matsui, Tomoko [1 ]
Tanabe, Kunio [3 ]
Myrvoll, Tor Andre [1 ,2 ]
机构
[1] Inst Stat Math, Minato Ku, Tokyo 106, Japan
[2] Norwegian Univ Sci & Technol, Dept Elect & Telecommun, Trondheim, Norway
[3] Waseda Univ, Tokyo, Japan
关键词
speech recognition; N-best rescoring; PLRM; garbage class; Aurora2;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the-art pattern recognition approaches like neural networks or kernel methods have only had limited success in speech recognition. The difficulties often encountered include the varying lengths of speech signals as well as how to deal with sequences of labels (e.g., digit strings) and unknown segmentation. In this paper we present a combined hidden Markov model (HMM) and penalized logistic regression machine (PLRM) approach to continuous speech recognition that can cope with both of these difficulties. The key ingredients of our approach are N-best rescoring and PLRM with garbage class. Experiments on the Aurora2 connected digits database show significant increase in recognition accuracy relative to a purely HMM-based system.
引用
收藏
页码:449 / +
页数:2
相关论文
共 50 条
  • [21] N-best vector quantization for isolated word speech recognition
    Nose, Masaya
    Maki, Shuichi
    Yartiane, Noburnoto
    Morikawa, Yoshitaka
    PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-8, 2007, : 2053 - +
  • [22] Results of the N-Best 2008 Dutch Speech Recognition Evaluation
    van Leeuwen, David A.
    Kessens, Judith
    Sanders, Eric
    van den Heuvel, Henk
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2531 - +
  • [23] Determination of the number of candidates using recognition scores for N-best based speech interface
    Cho, K
    Yamashita, Y
    Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 268 - 272
  • [24] USING N-BEST RECOGNITION OUTPUT FOR EXTRACTIVE SUMMARIZATION AND KEYWORD EXTRACTION IN MEETING SPEECH
    Liu, Yang
    Xie, Shasha
    Liu, Fei
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5310 - 5313
  • [25] Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition
    Birkenes, Oystein
    Matsui, Tomoko
    Tanabe, Kunio
    Siniscalchi, Sabato Marco
    Myrvoll, Tor Andre
    Johnsen, Magne Hallstein
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1440 - 1454
  • [26] The ESAT 2008 System for N-Best Dutch Speech Recognition Benchmark
    Demuynck, Kris
    Puurula, Antti
    Van Compernolle, Dirk
    Wambacq, Patrick
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 339 - 344
  • [27] A word graph based N-Best search in continuous speech recognition
    Tran, BH
    Seide, F
    Steinbiss, V
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2127 - 2130
  • [28] Automatic Speech Recognition of Code Switching Speech using 1-Best Rescoring
    Ahmed, Basem H. A.
    Tan, Tien-Ping
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 137 - 140
  • [29] An N-Best Candidates-Based Discriminative Training for Speech Recognition Applications
    Chen, Jung-Kuei
    Soong, Frank K.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 206 - 216
  • [30] A discriminative training framework using N-best speech recognition transcriptions and scores for spoken utterance classification
    Yaman, Sibel
    Deng, Li
    Yu, Dong
    Wang, Ye-Yi
    Acero, Alex
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 5 - +