Hybrid continuous speech recognition systems by HMM, MLP and SVM: a comparative study

被引:21
|
作者
Zarrouk, Elyes [1 ]
Ben Ayed, Yassine [1 ]
Gargouri, Faiez [2 ]
机构
[1] Natl Sch Engn Sfax, Sfax, Tunisia
[2] Sfax Univ, Higher Inst Comp Sci & Multimedia, Sfax, Tunisia
关键词
Automatic speech recognition; Hybrid System; Hidden Markov Models; Multi layer perceptron; Support vector machines;
D O I
10.1007/s10772-013-9221-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a new hybrid method for continuous Arabic speech recognition based on triphones modelling. To do this, we apply Support VectorsMachine (SVM) as an estimator of posterior probabilities within the Hidden Markov Models (HMM) standards. In this work, we describe a new approach of categorising Arabic vowels to long and short vowels to be applied on the labeling phase of speech signals. Using this new labeling method, we deduce that SVM/HMM hybrid model is more efficient then HMMs standards and the hybrid system Multi-Layer Perceptron (MLP) with HMM. The obtained results for the Arabic speech recognition system based on triphones are 64.68 % with HMMs, 72.39 % with MLP/HMM and 74.01 % for SVM/HMM hybrid model. The WER obtained for the recognition of continuous speech by the three systems proves the performance of SVM/HMM by obtaining the lowest average for 4 tested speakers 11.42 %.
引用
收藏
页码:223 / 233
页数:11
相关论文
共 50 条
  • [21] MLP and SVM networks - a comparative study
    Osowski, S
    Siwek, K
    Markiewicz, T
    [J]. NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 37 - 40
  • [22] A NN/HMM hybrid for continuous speech recognition with a discriminant nonlinear feature extraction
    Rigoll, G
    Willett, D
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 9 - 12
  • [23] Combining TDNN and HMM in a Hybrid System for Improved Continuous-Speech Recognition
    Dugast, Christian
    Devillers, Laurence
    Aubert, Xavier
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 217 - 223
  • [24] New feedback method of hybrid HMM/ANN methods for continuous speech recognition
    Lee, TZ
    Chen, DW
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 509 - 512
  • [25] HIERARCHICAL HYBRID MLP/HMM OR RATHER MLP FEATURES FOR A DISCRIMINATIVELY TRAINED GAUSSIAN HMM: A COMPARISON FOR OFFLINE HANDWRITING RECOGNITION
    Dreuw, Philippe
    Doetsch, Patrick
    Plahl, Christian
    Ney, Hermann
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [26] HMM/MLP hybrid speech recognizer for the Portuguese telephone SpeechDat corpus
    Hagen, A
    Neto, JP
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2003, 2721 : 126 - 134
  • [27] HMM/NN hybrids for continuous speech recognition
    Alim, OAA
    Elboghdadly, N
    El Shaar, NM
    [J]. PROCEEDINGS OF THE EIGHTEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2001, : 509 - 516
  • [28] Hybrid SVM/HMM Model for the Arab Phonemes Recognition
    Zarrouk, Elyes
    Benayed, Yassine
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (05) : 574 - 582
  • [29] Hybrid modeling of PHMM and HMM for speech recognition
    Ogawa, T
    Kobayashi, T
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 140 - 143
  • [30] Musical beat recognition using a MLP-HMM hybrid classifier
    Castro, PAC
    Dexter, I
    Garcia, S
    Cajote, RD
    [J]. TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : A104 - A107