Analysis of HMM Temporal Evolution for Automatic Speech Recognition and Utterance Verification

被引:0
|
作者
Casar, Marta [1 ]
Fonollosa, Jose A. R. [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, Dept Signal Theory & Commun, Barcelona, Spain
关键词
speech recognition; HMM acoustic modeling; state scores; utterance verification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a double layer speech recognition and utterance verification system based on the analysis of the temporal evolution of HMM's state scores. For the lower layer, it uses standard HMM-based acoustic modeling, followed by a Viterbi grammar-free decoding step which provides us with the state scores of the acoustic models. In the second layer, these state scores are added to the regular set of acoustic parameters, building a new set of expanded HMMs. Using this expanded set of HMMs for speech recognition a significant improvement in performance is achieved. Next, we will use this new architecture for utterance verification in a "second opinion" framework. We will consign to the second layer evaluating the reliability of decoding using the acoustic models from the first layer. An outstanding improvement in performance versus a baseline verification algorithm has been achieved.
引用
收藏
页码:613 / 616
页数:4
相关论文
共 50 条
  • [41] Automatic speech segmentation based on HMM
    Kroul, Martin
    RADIOENGINEERING, 2007, 16 (02) : 56 - 61
  • [42] Discriminative utterance verification for connected digits recognition
    Rahim, MG
    Lee, CH
    Juang, BH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03): : 266 - 277
  • [43] Acoustic Analysis for Automatic Speech Recognition
    O'Shaughnessy, Douglas
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1038 - 1053
  • [44] Utterance verification for spontaneous mandarin speech keyword spotting
    Xin, L
    Wang, BX
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C397 - C401
  • [45] LEARNING UTTERANCE-LEVEL NORMALISATION USING VARIATIONAL AUTOENCODERS FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Tan, Shawn
    Sim, Khe Chai
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 43 - 49
  • [46] Prediction of Speech Recognition Accuracy for Utterance Classification
    Korenevsky, Maxim L.
    Smirnov, Andrey B.
    Mendelev, Valentin S.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1275 - 1279
  • [47] An improved HMM speech recognition model
    Yuan, Lichi
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315
  • [48] HMM speech recognition with reduced training
    Foo, SW
    Yap, T
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 1016 - 1019
  • [49] Non-parametric probability estimation for HMM-based automatic speech recognition
    Lefèvre, F
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (2-3): : 113 - 136
  • [50] HMM-based automatic speech commands and instructions recognition system for Polish language
    Wydra, S
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS IV, 2006, 6159