Analysis of HMM Temporal Evolution for Automatic Speech Recognition and Utterance Verification

被引:0
|
作者
Casar, Marta [1 ]
Fonollosa, Jose A. R. [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, Dept Signal Theory & Commun, Barcelona, Spain
关键词
speech recognition; HMM acoustic modeling; state scores; utterance verification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a double layer speech recognition and utterance verification system based on the analysis of the temporal evolution of HMM's state scores. For the lower layer, it uses standard HMM-based acoustic modeling, followed by a Viterbi grammar-free decoding step which provides us with the state scores of the acoustic models. In the second layer, these state scores are added to the regular set of acoustic parameters, building a new set of expanded HMMs. Using this expanded set of HMMs for speech recognition a significant improvement in performance is achieved. Next, we will use this new architecture for utterance verification in a "second opinion" framework. We will consign to the second layer evaluating the reliability of decoding using the acoustic models from the first layer. An outstanding improvement in performance versus a baseline verification algorithm has been achieved.
引用
收藏
页码:613 / 616
页数:4
相关论文
共 50 条
  • [1] Analysis of HMM temporal evolution for automatic speech recognition and verification
    Casar, Marta
    Fonollosa, Jose A. R.
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 359 - 366
  • [2] UTTERANCE CLASSIFICATION CONFIDENCE IN AUTOMATIC SPEECH RECOGNITION
    KIMBALL, R
    ROTHKOPF, MH
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (02): : 188 - 189
  • [3] A new hybrid decoding algorithm for speech recognition and utterance verification
    Koo, MW
    Lee, CH
    Juang, BH
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 303 - 310
  • [4] Utterance verification in continuous speech recognition: Decoding and training procedures
    Lleida, E
    Rose, RC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (02): : 126 - 139
  • [5] Speech recognition and utterance verification based on a generalized confidence score
    Koo, MW
    Lee, CH
    Juang, BH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 821 - 832
  • [6] AUTOMATIC PRONUNCIATION VERIFICATION FOR SPEECH RECOGNITION
    Rao, Kanishka
    Peng, Fuchun
    Beaufays, Francoise
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5162 - 5166
  • [7] Generative factor analyzed HMM for automatic speech recognition
    Yao, KS
    Paliwal, KK
    Lee, TW
    SPEECH COMMUNICATION, 2005, 45 (04) : 435 - 454
  • [8] HMM AUTOMATIC SPEECH RECOGNITION SYSTEM OF ARABIC ALPHADIGITS
    Alghamdi, Mansour M.
    Alotaibi, Yousef Ajami
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C) : 137 - 155
  • [9] Using teager energy cepstrum and HMM distances in automatic speech recognition and analysis of unvoiced speech
    Heracleous, Panikos
    World Academy of Science, Engineering and Technology, 2009, 35 : 633 - 639
  • [10] Using word temporal structure in HMM speech recognition
    Fissore, L
    Laface, P
    Ravera, F
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 975 - 978