Combined optimisation of baseforms and subword models for an HMM based speech recogniser

被引:0
|
作者
Holter, T
Svendsen, T
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper a framework for combined optimisation of baseforms and subword models for a speech recogniser is proposed. Given a set of subword Hidden Markov Models (HMMs) and a set of utterances of a specific word, the modified tree-trellis algorithm and the Baum-Welch re-estimation procedure is used iteratively to achieve a combined optimisation of baseforms and subword models. The DARPA Resource Management (RM) database was used to evaluate the combined optimisation scheme. The proposed method resulted in a monotonic increase in the likelihood score of both test- and training data. When compared to the initial lexicon derived from the DARPA RM-distribution and a set of initial HMMs, a 13% reduction in word error rate is achieved at best.
引用
收藏
页码:321 / 324
页数:4
相关论文
共 50 条
  • [1] Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units
    Holter, T
    Svendsen, T
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 199 - 206
  • [2] Advances in subword-based HMM-DNN speech recognition across languages
    Smit, Peter
    Virpioja, Sami
    Kurimo, Mikko
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [3] Transforming Features to Compensate Speech Recogniser Models for Noise
    van Dalen, R. C.
    Flego, F.
    Gales, M. J. F.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2459 - 2462
  • [4] Designing Syllable Models for an HMM Based Speech Recognition System
    Proenca, Kseniya
    Demuynck, Kris
    Van Compernolle, Dirk
    [J]. Speech and Computer, 2016, 9811 : 216 - 223
  • [5] FACTOR ANALYZED VOICE MODELS FOR HMM-BASED SPEECH SYNTHESIS
    Kazumi, Kyosuke
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4234 - 4237
  • [6] Synthesis of stressed speech from isolated neutral speech using HMM-based models
    BouGhazale, SE
    Hansen, JHL
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1860 - 1863
  • [7] Speech recognition with HMM models for cochlear prostheses
    Sakka, Z
    Kachouri, A
    Samet, M
    [J]. 2004 IEEE International Conference on Industrial Technology (ICIT), Vols. 1- 3, 2004, : 1478 - 1481
  • [8] Subword unit based speech recognition in car environments
    Fischer, A
    Stahl, V
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 257 - 260
  • [9] AN HMM-BASED FORMALISM FOR AUTOMATIC SUBWORD UNIT DERIVATION AND PRONUNCIATION GENERATION
    Razavi, Marzieh
    Magimai-Doss, Mathew
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4639 - 4643
  • [10] On HMM speech recognition based on complex speech analysis
    Kinjo, Tatsuhiko
    Funaki, Keiichi
    [J]. IECON 2006 - 32ND ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS, VOLS 1-11, 2006, : 2605 - +