Improving the performance of an LVCSR system through ensembles of acoustic models

被引:0
|
作者
Zhang, R [1 ]
Rudnicky, AI [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes our work on applying ensembles of acoustic models to the problem of large vocabulary continuous speech recognition (LVCSR). We propose three algorithms for constructing ensembles. The first two have their roots in bagging algorithms; however, instead of randomly sampling examples our algorithms construct training sets based on the word error rate. The third one is a boosting style algorithm. Different from other boosting methods which demand large resources for computation and storage, our method present a more efficient solution suitable for acoustic model training. We also investigate a method that seeks optimal combination for models. We report experimental results on a large real world corpus collected from the Carnegie Mellon Communicator dialog system. Significant improvements on system performance are observed in that up to 15.56% relative reduction on word error rate is achieved.
引用
收藏
页码:876 / 879
页数:4
相关论文
共 50 条
  • [31] Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation
    Liu, X.
    Gales, M. J. F.
    Woodland, P. C.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2868 - 2871
  • [32] The Testing and Improving of Acoustic Performance on Muffler
    Hu, Guoyou
    Xia, Pinqi
    Lu, Wenfeng
    MATERIALS PROCESSING TECHNOLOGY, PTS 1-4, 2011, 291-294 : 2060 - +
  • [33] IMPROVING THE PERFORMANCE OF A PORT SYSTEM THROUGH SERVICE DEMAND REALLOCATION
    ZOGRAFOS, KG
    MARTINEZ, W
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 1990, 24 (02) : 79 - 97
  • [34] Improving ATR system performance through Sequences of Classification Tasks
    Kabban, Christine M. Schubert
    Oxley, Mark E.
    SIGNAL PROCESSING, SENSOR/INFORMATION FUSION, AND TARGET RECOGNITION XXVII, 2018, 10646
  • [35] Improving Public Health System Performance Through Multiorganizational Partnerships
    Mays, Glen P.
    Scutchfield, F. Douglas
    PREVENTING CHRONIC DISEASE, 2010, 7 (06):
  • [36] Improving Firm Performance Through a Mobile Auditing Assistance System
    Shiau, Wen-Lung
    INTERNATIONAL JOURNAL OF ENTERPRISE INFORMATION SYSTEMS, 2014, 10 (04) : 22 - 35
  • [37] Note: Improving the performance of a geophone through suspension system configuration
    Yang, Dapeng
    Li, Nianru
    Liu, Changying
    Lin, Jun
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2014, 85 (12):
  • [38] IMPROVING ONLINE CONTINUAL LEARNING PERFORMANCE AND STABILITY WITH TEMPORAL ENSEMBLES
    Soutif-Cormerais, Albin
    Carta, Antonio
    Van de Weijer, Joost
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 828 - 845
  • [39] Improved genomic prediction performance with ensembles of diverse models
    Tomura, Shunichiro
    Wilkinson, Melanie J.
    Cooper, Mark
    Powell, Owen
    G3-GENES GENOMES GENETICS, 2025,
  • [40] Improving acoustic models with captioned multimedia speech
    Jang, PJ
    Hauptmann, AG
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 767 - 771