Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition

被引:10
|
作者
Nahar, Khalid M. O. [1 ]
Abu Shquier, Mohammed [2 ]
Al-Khatib, Wasfi G. [3 ]
Al-Muhtaseb, Husni [3 ]
Elshafei, Moustafa [4 ]
机构
[1] Yarmouk Univ, Fac Comp Sci & Informat Technol, Dept Comp Sci, Irbid 21163, Jordan
[2] Jarash Univ, Fac Comp Sci & Informat Technol, Dept Comp Sci, Jarash, Jordan
[3] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Dhahran 31261, Saudi Arabia
[4] King Fahd Univ Petr & Minerals, Dept Syst Engn, Dhahran 31261, Saudi Arabia
关键词
Learning vector quantization (LVQ); Codebooks; K-means algorithm; Phonemes transcription; Hidden Markov model (HMM); Hybrid LVQ/HMM model;
D O I
10.1007/s10772-016-9337-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In attempt to increase the rate of Arabic phonemes recognition, we introduce a novel hybrid recognition algorithm. The algorithm is composed of the learning vector quantization (LVQ) and hidden Markov model (HMM). The hybrid algorithm used to recognizing Arabic phonemes in continuous open-vocabulary speech. A recorded Arabic corpus of different TV news for modern standard Arabic was used for training and testing purposes. We employ a data driven approach to generate the training feature vectors that embed the frame neighboring correlation information. Next, we generate the phonemes codebooks using the K-means splitting algorithm. Then, we trained the generated codebooks using the LVQ algorithm. We achieved a performance of 98.49 % during independent classification training and 90 % during dependent classification training. When using the trained LVQ codebooks in Arabic utterance transcription, the phoneme recognition rate was 72 % using LVQ only. We combined the LVQ codebooks with the single state HMM model using enhanced Viterbi algorithm which includes the phonemes bigrams. We achieved 89 % of Arabic phonemes recognition rate based on the hybrid LVQ/HMM algorithm.
引用
收藏
页码:495 / 508
页数:14
相关论文
共 50 条
  • [1] Hybrid SVM/HMM Model for the Recognition of Arabic Triphones-based Continuous Speech
    Zarrouk, Elyes
    Benayed, Yassine
    Gargouri, Faiez
    [J]. 2013 10TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2013,
  • [2] Nonspecific Speech Recognition based on HMM/LVQ Hybrid Network
    Liang Shuling
    Wang Chaoli
    Du Jiaming
    [J]. ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 645 - 648
  • [3] Hybrid SVM/HMM Model for the Arab Phonemes Recognition
    Zarrouk, Elyes
    Benayed, Yassine
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (05) : 574 - 582
  • [4] HMM/ANN hybrid model for continuous Malayalam speech recognition
    Mohamed, Anuj
    Nair, K. N. Ramachandran
    [J]. INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 616 - 622
  • [5] A hybrid ANN/HMM models for arabic speech recognition using optimal codebook
    Ettaouil, Mohamed
    Lazaar, Mohamed
    En-Naimani, Zakariae
    [J]. 2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
  • [6] Speech/speaker recognition using a HMM/GMM hybrid model
    Rodriguez, E
    Ruiz, B
    Garcia-Crespo, A
    Garcia, F
    [J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 227 - 234
  • [7] Statistical Analysis of Arabic Phonemes Used in Arabic Speech Recognition
    Nahar, Khalid M. O.
    Elshafei, Mustafa
    Al-Khatib, Wasfi G.
    Al-Muhtaseb, Husni
    Alghamdi, Mansour M.
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 533 - 542
  • [8] The hybrid model of speech recognition based on HMM and HMMNN
    Wang Sheguo
    Tong Jianing
    Yuan Yujin
    [J]. 2009 INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY, VOLS 1 AND 2, 2009, : 926 - +
  • [9] A new hybrid HMM/ANN model for speech recognition
    Xi, XJ
    Lin, KH
    Zhou, CL
    Cai, J
    [J]. Artificial Intelligence Applications and Innovations II, 2005, 187 : 223 - 230
  • [10] Tree-Based HMM State Tying for Arabic Continuous Speech Recognition
    Azim, Mona A.
    Hamid, A. Aziz A.
    Badr, Nagwa L.
    Tolba, M. F.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 96 - 103