Improving the performance of an LVCSR system through ensembles of acoustic models

被引:0
|
作者
Zhang, R [1 ]
Rudnicky, AI [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes our work on applying ensembles of acoustic models to the problem of large vocabulary continuous speech recognition (LVCSR). We propose three algorithms for constructing ensembles. The first two have their roots in bagging algorithms; however, instead of randomly sampling examples our algorithms construct training sets based on the word error rate. The third one is a boosting style algorithm. Different from other boosting methods which demand large resources for computation and storage, our method present a more efficient solution suitable for acoustic model training. We also investigate a method that seeks optimal combination for models. We report experimental results on a large real world corpus collected from the Carnegie Mellon Communicator dialog system. Significant improvements on system performance are observed in that up to 15.56% relative reduction on word error rate is achieved.
引用
收藏
页码:876 / 879
页数:4
相关论文
共 50 条
  • [41] CASE STUDY OF REAL MANUFACTURING SYSTEM IMPROVING THROUGH SIMULATION MODELS
    Sujov, Erika
    Bambura, Roman
    Cierna, Helena
    MM SCIENCE JOURNAL, 2020, 2020 : 3779 - 3783
  • [42] Improving system performance through operating system optimization on embedded devices platform
    Daud, Shuhaizar
    Khalib, Zahereel Ishwar Abdul
    Ahmad, R. Badlishah
    2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 1010 - 1015
  • [43] Improving Classification Performance in Dendritic Neuron Models through Practical Initialization Strategies
    Wen, Xiaohao
    Zhou, Mengchu
    Albeshri, Aiiad
    Huang, Lukui
    Luo, Xudong
    Ning, Dan
    SENSORS, 2024, 24 (06)
  • [44] CONSTRUCTING ENSEMBLES OF DISSIMILAR ACOUSTIC MODELS USING HIDDEN ATTRIBUTES OF TRAINING DATA
    Fukuda, Takashi
    Tachibana, Ryuki
    Chaudhari, Upendra
    Ramabhadran, Bhuvana
    Zhan, Puming
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4141 - 4144
  • [45] Inductive Classification Through Evidence-Based Models and Their Ensembles
    Rizzo, Giuseppe
    d'Amato, Claudia
    Fanizzi, Nicola
    Esposito, Floriana
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, ESWC 2015, 2015, 9088 : 418 - 433
  • [46] IMPROVING UNIVERSITY PERFORMANCE THROUGH ICT BASED KNOWLEDGE MANAGEMENT SYSTEM
    Numprasertchai, Somchai
    Poovarawan, Yuen
    INTERNATIONAL JOURNAL OF INNOVATION AND TECHNOLOGY MANAGEMENT, 2008, 5 (02) : 167 - 178
  • [47] Improving performance through Lean
    Bhasin, Sanjay
    INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2011, 6 (01) : 23 - 36
  • [48] Improving Acoustic Models in TORGO Dysarthric Speech Database
    Joy, Neethu Mariam
    Umesh, S.
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2018, 26 (03) : 637 - 645
  • [49] Improving Acoustic Models for Russian Spontaneous Speech Recognition
    Prudnikov, Alexey
    Medennikov, Ivan
    Mendelev, Valentin
    Korenevsky, Maxim
    Khokhlov, Yuri
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 234 - 242
  • [50] On Improving Acoustic Models For TORGO Dysarthric Speech Database
    Joy, Neethu Mariam
    Umesh, S.
    Abraham, Basil
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2695 - 2699