Boosting systems for large vocabulary continuous speech recognition

被引:14
|
作者
Saon, George [1 ]
Soltau, Hagen [1 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
Speech recognition; Boosting; Acoustic modeling;
D O I
10.1016/j.specom.2011.07.011
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We employ a variant of the popular Adaboost algorithm to train multiple acoustic models such that the aggregate system exhibits improved performance over the individual recognizers. Each model is trained sequentially on re-weighted versions of the training data. At each iteration, the weights are decreased for the frames that are correctly decoded by the current system. These weights are then multiplied with the frame-level statistics for the decision trees and Gaussian mixture components of the next iteration system. The composite system uses a log-linear combination of HMM state observation likelihoods. We report experimental results on several broadcast news transcription setups which differ in the language being spoken (English and Arabic) and amounts of training data. Additionally, we study the impact of boosting on maximum likelihood (ML) and discriminatively trained acoustic models. Our findings suggest that significant gains can be obtained for small amounts of training data even after feature and model-space discriminative training. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:212 / 218
页数:7
相关论文
共 50 条
  • [1] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [2] Boosting the performance of connectionist large vocabulary speech recognition
    Cook, G
    Robinson, T
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1305 - 1308
  • [3] Boosting acoustic models in large vocabulary speech recognition
    Meyer, C
    Schramm, H
    [J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 255 - 260
  • [4] Vietnamese Large Vocabulary Continuous Speech Recognition
    Ngoc Thang Vu
    Schultz, Tanja
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
  • [5] Advances in large vocabulary continuous speech recognition
    Zweig, G
    Picheny, M
    [J]. ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
  • [6] Boosting HMM acoustic models in large vocabulary speech recognition
    Meyer, C
    Schramm, H
    [J]. SPEECH COMMUNICATION, 2006, 48 (05) : 532 - 548
  • [7] Developments in large vocabulary, continuous speech recognition of German
    AddaDecker, M
    Adda, G
    Lamel, L
    Gauvain, JL
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 153 - 156
  • [8] Combating Reverberation in Large Vocabulary Continuous Speech Recognition
    Mitra, Vikramjit
    Van Hout, Julien
    McLaren, Mitchell
    Wang, Wen
    Graciarena, Martin
    Vergyri, Dimitra
    Franco, Horacio
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2449 - 2453
  • [9] The RWTH large vocabulary continuous speech recognition system
    Ney, H
    Welling, L
    Ortmanns, S
    Beulen, K
    Wessel, F
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 853 - 856
  • [10] Utilizing Lipreading in Large Vocabulary Continuous Speech Recognition
    Palecek, Karel
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 767 - 776