Speech recognition using hybrid hidden Markov model and NN classifier

被引：4

作者：

Kundu A. ^{[1
]}

Bayya A. ^{[2
]}

机构：

[1] U.S. West Advanced Technologies, Boulder

[2] Rockwell International Corporation, 4311 Jamboree Rd., Newport Beach

来源：

International Journal of Speech Technology | 1998年 / 2卷 / 3期

关键词：

Baum-welch (BW) algorithm; Hidden markov model; Hybrid classifier; Modified viterbi algorithm (MVA); Multilayer perceptrons; Neural nets; Segmental K-means algorithm;

D O I：

10.1007/BF02111210

中图分类号：

学科分类号：

摘要：

This paper discusses the use of an integrated HMM/NN classifier for speech recognition. The proposed classifier combines the time normalization property of the HMM classifier with the superior discriminative ability of the neural net (NN) classifier. Speech signals display a strong time varying characteristic. Although the neural net has been successful in many classification problems, its success (compared to HMM) is secondary to HMM in the field of speech recognition. The main reason is the lack of time normalization characteristics of most neural net structures (time-delay neural net is one notable exception but its structure is very complex). In the proposed integrated hybrid HMM/NN classifier, a left-to-right HMM module is used first to segment the observation sequence of every exemplar into a fixed number of states. Subsequently, all the frames belonging to the same state are replaced by one average frame. Thus, every exemplar, irrespective of its time scale variation, is transformed into a fixed number of frames, i.e., a static pattern. The multilayer perceptron (MLP) neural net is then used as the classifier for these time normalized exemplars. Some experimental results using telephone speech databases are presented to demonstrate the potential of this hybrid integrated classifier. © 1998 Kluwer Academic Publishers.

引用

页码：227 / 240

页数：13

共 50 条

[1] Murmured Speech Recognition Using Hidden Markov Model
Kumar, Rajesh T.
Videla, Lakshmi Sarvani
SivaKumar, Soubraylu
Asalg, Gopala Gupta
Haritha, D.
[J]. 2020 7TH IEEE INTERNATIONAL CONFERENCE ON SMART STRUCTURES AND SYSTEMS (ICSSS 2020), 2020, : 53 - 57
[2] Automatic Urdu Speech Recognition Using Hidden Markov Model
Asadullah
Shaukat, Arslan
Ali, Hazrat
Akram, Usman
[J]. 2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139
[3] Speech recognition of monosyllables using hidden Markov model in VHDL
Vaidhyanathan, A
Lakshmiprabha, V
[J]. TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A76 - A79
[4] Belief Hidden Markov Model for Speech Recognition
Jendoubi, Siwar
Ben Yaghlane, Boutheina
Martin, Arnaud
[J]. 2013 5TH INTERNATIONAL CONFERENCE ON MODELING, SIMULATION AND APPLIED OPTIMIZATION (ICMSAO), 2013,
[5] Hybrid Hidden Markov Model and Artificial Neural Network for Automatic Speech Recognition
Tang, Xian
[J]. PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 682 - 685
[6] HYBRID APPROACH TO SPEECH RECOGNITION USING HIDDEN MARKOV-MODELS AND MARKOV-CHAINS
DAI, J
[J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (05): : 273 - 279
[7] Speech Recognition for English to Indonesian Translator Using Hidden Markov Model
Muhammad, Hariz Zakka
Nasrun, Muhammad
Setianingsih, Casi
Murti, Muhammad Ary
[J]. 2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 255 - 260
[8] Tone recognition of Vietnamese continuous speech using hidden Markov model
Quang, Nguyen Hong
Pascal, Nocera
Eric, Castelli
Van Loan, Trinh
[J]. 2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 233 - +
[9] Visual Speech Recognition Using Optical Flow and Hidden Markov Model
Usha Sharma
Sushila Maheshkar
A. N. Mishra
Rahul Kaushik
[J]. Wireless Personal Communications, 2019, 106 : 2129 - 2147
[10] Visual Speech Recognition Using Optical Flow and Hidden Markov Model
Sharma, Usha
Maheshkar, Sushila
Mishra, A. N.
Kaushik, Rahul
[J]. WIRELESS PERSONAL COMMUNICATIONS, 2019, 106 (04) : 2129 - 2147

← 1 2 3 4 5 →