Improving the performance of an LVCSR system through ensembles of acoustic models

被引:0
|
作者
Zhang, R [1 ]
Rudnicky, AI [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes our work on applying ensembles of acoustic models to the problem of large vocabulary continuous speech recognition (LVCSR). We propose three algorithms for constructing ensembles. The first two have their roots in bagging algorithms; however, instead of randomly sampling examples our algorithms construct training sets based on the word error rate. The third one is a boosting style algorithm. Different from other boosting methods which demand large resources for computation and storage, our method present a more efficient solution suitable for acoustic model training. We also investigate a method that seeks optimal combination for models. We report experimental results on a large real world corpus collected from the Carnegie Mellon Communicator dialog system. Significant improvements on system performance are observed in that up to 15.56% relative reduction on word error rate is achieved.
引用
收藏
页码:876 / 879
页数:4
相关论文
共 50 条
  • [21] Improving the Performance of ASR System by Building Acoustic Models using Spectro-Temporal and Phase-Based Features
    Anirban Dutta
    G. Ashishkumar
    Ch. V. Rama Rao
    Circuits, Systems, and Signal Processing, 2022, 41 : 1609 - 1632
  • [22] Improving the Performance of ASR System by Building Acoustic Models using Spectro-Temporal and Phase-Based Features
    Dutta, Anirban
    Ashishkumar, G.
    Rao, Ch V. Rama
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (03) : 1609 - 1632
  • [23] Improving knowledge of structural system behavior through multiple models
    Smith, Ian F. C.
    Saitta, Sandro
    JOURNAL OF STRUCTURAL ENGINEERING, 2008, 134 (04) : 553 - 561
  • [24] How good are ensembles in improving QSAR models? The case with eCoRIA
    Khedkar, Vijay M.
    Joseph, Jose
    Pissurlenkar, Raghuvir
    Saran, Anil
    Coutinho, Evans C.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2015, 33 (04): : 749 - 769
  • [25] Improving Speech Recognition through Automatic Selection of Age Group - Specific Acoustic Models
    Haemaelaeinen, Annika
    Meinedo, Hugo
    Tjalve, Michael
    Pellegrini, Thomas
    Trancoso, Isabel
    Dias, Miguel Sales
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, 2014, 8775 : 12 - 23
  • [26] Improving Parallelism in System Level Models by Assessing PDES Performance
    Arasteh, Emad Malekzadeh
    Domer, Rainer
    PROCEEDINGS OF THE 2021 FORUM ON SPECIFICATION & DESIGN LANGUAGES (FDL), 2021,
  • [27] Improving Estuarine Hydrodynamic Forecasts Through Numerical Model Ensembles
    Iglesias, Isabel
    Pinho, Jose Luis
    Avilez-Valente, Paulo
    Melo, Willian
    Bio, Ana
    Gomes, Ana
    Vieira, Jose
    Bastos, Luisa
    Veloso-Gomes, Fernando
    FRONTIERS IN MARINE SCIENCE, 2022, 9
  • [28] Improving the analysis of biological ensembles through extended similarity measures
    Chang, Liwei
    Perez, Alberto
    Miranda-Quintana, Ramon Alain
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2021, 24 (01) : 444 - 451
  • [29] Improving the Performance of Input Interfaces Through Scaling and Human Motor Models
    Miguel Munoz, Luis
    Casals, Alicia
    HUMAN-COMPUTER INTERACTION, 2016, 31 (05): : 385 - 419
  • [30] Improving the performance of QoS models in MANETs through interference monitoring and correction
    Wahab, Shaliza Hayati A.
    Ould-Khaoua, Mohamed
    Mackenzie, Lewis M.
    ICIET 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND EMERGING TECHNOLOGIES, 2007, : 42 - 47