Speech recognition using hybrid hidden Markov model and NN classifier

被引:4
|
作者
Kundu A. [1 ]
Bayya A. [2 ]
机构
[1] U.S. West Advanced Technologies, Boulder
[2] Rockwell International Corporation, 4311 Jamboree Rd., Newport Beach
关键词
Baum-welch (BW) algorithm; Hidden markov model; Hybrid classifier; Modified viterbi algorithm (MVA); Multilayer perceptrons; Neural nets; Segmental K-means algorithm;
D O I
10.1007/BF02111210
中图分类号
学科分类号
摘要
This paper discusses the use of an integrated HMM/NN classifier for speech recognition. The proposed classifier combines the time normalization property of the HMM classifier with the superior discriminative ability of the neural net (NN) classifier. Speech signals display a strong time varying characteristic. Although the neural net has been successful in many classification problems, its success (compared to HMM) is secondary to HMM in the field of speech recognition. The main reason is the lack of time normalization characteristics of most neural net structures (time-delay neural net is one notable exception but its structure is very complex). In the proposed integrated hybrid HMM/NN classifier, a left-to-right HMM module is used first to segment the observation sequence of every exemplar into a fixed number of states. Subsequently, all the frames belonging to the same state are replaced by one average frame. Thus, every exemplar, irrespective of its time scale variation, is transformed into a fixed number of frames, i.e., a static pattern. The multilayer perceptron (MLP) neural net is then used as the classifier for these time normalized exemplars. Some experimental results using telephone speech databases are presented to demonstrate the potential of this hybrid integrated classifier. © 1998 Kluwer Academic Publishers.
引用
收藏
页码:227 / 240
页数:13
相关论文
共 50 条
  • [1] Murmured Speech Recognition Using Hidden Markov Model
    Kumar, Rajesh T.
    Videla, Lakshmi Sarvani
    SivaKumar, Soubraylu
    Asalg, Gopala Gupta
    Haritha, D.
    [J]. 2020 7TH IEEE INTERNATIONAL CONFERENCE ON SMART STRUCTURES AND SYSTEMS (ICSSS 2020), 2020, : 53 - 57
  • [2] Automatic Urdu Speech Recognition Using Hidden Markov Model
    Asadullah
    Shaukat, Arslan
    Ali, Hazrat
    Akram, Usman
    [J]. 2016 INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2016), 2016, : 135 - 139
  • [3] Speech recognition of monosyllables using hidden Markov model in VHDL
    Vaidhyanathan, A
    Lakshmiprabha, V
    [J]. TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A76 - A79
  • [4] Belief Hidden Markov Model for Speech Recognition
    Jendoubi, Siwar
    Ben Yaghlane, Boutheina
    Martin, Arnaud
    [J]. 2013 5TH INTERNATIONAL CONFERENCE ON MODELING, SIMULATION AND APPLIED OPTIMIZATION (ICMSAO), 2013,
  • [5] Hybrid Hidden Markov Model and Artificial Neural Network for Automatic Speech Recognition
    Tang, Xian
    [J]. PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 682 - 685
  • [6] HYBRID APPROACH TO SPEECH RECOGNITION USING HIDDEN MARKOV-MODELS AND MARKOV-CHAINS
    DAI, J
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (05): : 273 - 279
  • [7] Speech Recognition for English to Indonesian Translator Using Hidden Markov Model
    Muhammad, Hariz Zakka
    Nasrun, Muhammad
    Setianingsih, Casi
    Murti, Muhammad Ary
    [J]. 2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 255 - 260
  • [8] Tone recognition of Vietnamese continuous speech using hidden Markov model
    Quang, Nguyen Hong
    Pascal, Nocera
    Eric, Castelli
    Van Loan, Trinh
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 233 - +
  • [9] Visual Speech Recognition Using Optical Flow and Hidden Markov Model
    Usha Sharma
    Sushila Maheshkar
    A. N. Mishra
    Rahul Kaushik
    [J]. Wireless Personal Communications, 2019, 106 : 2129 - 2147
  • [10] Visual Speech Recognition Using Optical Flow and Hidden Markov Model
    Sharma, Usha
    Maheshkar, Sushila
    Mishra, A. N.
    Kaushik, Rahul
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2019, 106 (04) : 2129 - 2147