Analysis of HMM Temporal Evolution for Automatic Speech Recognition and Utterance Verification

被引:0
|
作者
Casar, Marta [1 ]
Fonollosa, Jose A. R. [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, Dept Signal Theory & Commun, Barcelona, Spain
关键词
speech recognition; HMM acoustic modeling; state scores; utterance verification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a double layer speech recognition and utterance verification system based on the analysis of the temporal evolution of HMM's state scores. For the lower layer, it uses standard HMM-based acoustic modeling, followed by a Viterbi grammar-free decoding step which provides us with the state scores of the acoustic models. In the second layer, these state scores are added to the regular set of acoustic parameters, building a new set of expanded HMMs. Using this expanded set of HMMs for speech recognition a significant improvement in performance is achieved. Next, we will use this new architecture for utterance verification in a "second opinion" framework. We will consign to the second layer evaluating the reliability of decoding using the acoustic models from the first layer. An outstanding improvement in performance versus a baseline verification algorithm has been achieved.
引用
收藏
页码:613 / 616
页数:4
相关论文
共 50 条
  • [31] Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech
    Caballero-Morales, Santiago-Omar
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [32] HMM Adaptation using Statistical Linear Approximation for Robust Automatic Speech Recognition
    Berkovitch, Michael
    Shallom, Ilan D.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1301 - 1304
  • [33] Refinement of HMM Model Parameters for Punjabi Automatic Speech Recognition (PASR) System
    Kadyan, Virender
    Mantri, Archana
    Aggarwal, R. K.
    IETE JOURNAL OF RESEARCH, 2018, 64 (05) : 673 - 688
  • [34] An HMM-Like Dynamic Time Warping Scheme for Automatic Speech Recognition
    Ding, Ing-Jr
    Hsu, Yen-Ming
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [35] A temporal auditory model with adaptation for automatic speech recognition
    Haque, Serajul
    Togneri, Roberto
    Zaknich, Anthony
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1141 - +
  • [36] Automatic Speech Recognition with Primarily Temporal Envelope Information
    Lin, Payton
    Chen, Fei
    Wang, Syu Siang
    Lai, Ying Hui
    Tsao, Yu
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 476 - 480
  • [37] AUTOMATIC SPEECH SEMANTIC RECOGNITION AND VERIFICATION IN AIR TRAFFIC CONTROL
    Johnson, Daniel R.
    Nenov, Val I.
    Espinoza, Gustavo
    2013 IEEE/AIAA 32ND DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2013,
  • [38] Automatic Speech Semantic Recognition and Verification in Air Traffic Control
    Johnson, Daniel R.
    Nenov, Val
    2013 IEEE/AIAA 32ND DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2013,
  • [39] Automatic utterance segmentation tool for speech corpus
    Ozawa, Mitsuhiro
    Tsuge, Satoru
    Shishibori, Masami
    Kita, Kenji
    Fukumi, Minoru
    Ren, Fuji
    Kuroiwa, Shingo
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 401 - +
  • [40] I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System
    Choi, Woo Yong
    Song, Hwa Jeon
    Chung, Hoon
    Kang, Jeomja
    Park, Jeon Gue
    2016 FIRST IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET (ICCCI 2016), 2016, : 316 - 319