Analysis of HMM Temporal Evolution for Automatic Speech Recognition and Utterance Verification

被引：0

作者：

Casar, Marta ^{[1
]}

Fonollosa, Jose A. R. ^{[1
]}

机构：

[1] Univ Politecn Cataluna, TALP Res Ctr, Dept Signal Theory & Commun, Barcelona, Spain

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech recognition; HMM acoustic modeling; state scores; utterance verification;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a double layer speech recognition and utterance verification system based on the analysis of the temporal evolution of HMM's state scores. For the lower layer, it uses standard HMM-based acoustic modeling, followed by a Viterbi grammar-free decoding step which provides us with the state scores of the acoustic models. In the second layer, these state scores are added to the regular set of acoustic parameters, building a new set of expanded HMMs. Using this expanded set of HMMs for speech recognition a significant improvement in performance is achieved. Next, we will use this new architecture for utterance verification in a "second opinion" framework. We will consign to the second layer evaluating the reliability of decoding using the acoustic models from the first layer. An outstanding improvement in performance versus a baseline verification algorithm has been achieved.

引用

页码：613 / 616

页数：4

共 50 条

[31] Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech
Caballero-Morales, Santiago-Omar
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
[32] HMM Adaptation using Statistical Linear Approximation for Robust Automatic Speech Recognition
Berkovitch, Michael
Shallom, Ilan D.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1301 - 1304
[33] Refinement of HMM Model Parameters for Punjabi Automatic Speech Recognition (PASR) System
Kadyan, Virender
Mantri, Archana
Aggarwal, R. K.
IETE JOURNAL OF RESEARCH, 2018, 64 (05) : 673 - 688
[34] An HMM-Like Dynamic Time Warping Scheme for Automatic Speech Recognition
Ding, Ing-Jr
Hsu, Yen-Ming
MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
[35] A temporal auditory model with adaptation for automatic speech recognition
Haque, Serajul
Togneri, Roberto
Zaknich, Anthony
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1141 - +
[36] Automatic Speech Recognition with Primarily Temporal Envelope Information
Lin, Payton
Chen, Fei
Wang, Syu Siang
Lai, Ying Hui
Tsao, Yu
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 476 - 480
[37] AUTOMATIC SPEECH SEMANTIC RECOGNITION AND VERIFICATION IN AIR TRAFFIC CONTROL
Johnson, Daniel R.
Nenov, Val I.
Espinoza, Gustavo
2013 IEEE/AIAA 32ND DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2013,
[38] Automatic Speech Semantic Recognition and Verification in Air Traffic Control
Johnson, Daniel R.
Nenov, Val
2013 IEEE/AIAA 32ND DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2013,
[39] Automatic utterance segmentation tool for speech corpus
Ozawa, Mitsuhiro
Tsuge, Satoru
Shishibori, Masami
Kita, Kenji
Fukumi, Minoru
Ren, Fuji
Kuroiwa, Shingo
PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 401 - +
[40] I-vector Based Utterance Verification for Large-Vocabulary Speech Recognition System
Choi, Woo Yong
Song, Hwa Jeon
Chung, Hoon
Kang, Jeomja
Park, Jeon Gue
2016 FIRST IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET (ICCCI 2016), 2016, : 316 - 319

← 1 2 3 4 5 →