Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition

被引：10

作者：

Nahar, Khalid M. O. ^{[1
]}

Abu Shquier, Mohammed ^{[2
]}

Al-Khatib, Wasfi G. ^{[3
]}

Al-Muhtaseb, Husni ^{[3
]}

Elshafei, Moustafa ^{[4
]}

机构：

[1] Yarmouk Univ, Fac Comp Sci & Informat Technol, Dept Comp Sci, Irbid 21163, Jordan

[2] Jarash Univ, Fac Comp Sci & Informat Technol, Dept Comp Sci, Jarash, Jordan

[3] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Dhahran 31261, Saudi Arabia

[4] King Fahd Univ Petr & Minerals, Dept Syst Engn, Dhahran 31261, Saudi Arabia

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2016年 / 19卷 / 03期

关键词：

Learning vector quantization (LVQ); Codebooks; K-means algorithm; Phonemes transcription; Hidden Markov model (HMM); Hybrid LVQ/HMM model;

D O I：

10.1007/s10772-016-9337-5

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In attempt to increase the rate of Arabic phonemes recognition, we introduce a novel hybrid recognition algorithm. The algorithm is composed of the learning vector quantization (LVQ) and hidden Markov model (HMM). The hybrid algorithm used to recognizing Arabic phonemes in continuous open-vocabulary speech. A recorded Arabic corpus of different TV news for modern standard Arabic was used for training and testing purposes. We employ a data driven approach to generate the training feature vectors that embed the frame neighboring correlation information. Next, we generate the phonemes codebooks using the K-means splitting algorithm. Then, we trained the generated codebooks using the LVQ algorithm. We achieved a performance of 98.49 % during independent classification training and 90 % during dependent classification training. When using the trained LVQ codebooks in Arabic utterance transcription, the phoneme recognition rate was 72 % using LVQ only. We combined the LVQ codebooks with the single state HMM model using enhanced Viterbi algorithm which includes the phonemes bigrams. We achieved 89 % of Arabic phonemes recognition rate based on the hybrid LVQ/HMM algorithm.

引用

页码：495 / 508

页数：14

共 50 条

[1] Hybrid SVM/HMM Model for the Recognition of Arabic Triphones-based Continuous Speech
Zarrouk, Elyes
Benayed, Yassine
Gargouri, Faiez
[J]. 2013 10TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2013,
[2] Nonspecific Speech Recognition based on HMM/LVQ Hybrid Network
Liang Shuling
Wang Chaoli
Du Jiaming
[J]. ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 645 - 648
[3] Hybrid SVM/HMM Model for the Arab Phonemes Recognition
Zarrouk, Elyes
Benayed, Yassine
[J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (05) : 574 - 582
[4] HMM/ANN hybrid model for continuous Malayalam speech recognition
Mohamed, Anuj
Nair, K. N. Ramachandran
[J]. INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 616 - 622
[5] A hybrid ANN/HMM models for arabic speech recognition using optimal codebook
Ettaouil, Mohamed
Lazaar, Mohamed
En-Naimani, Zakariae
[J]. 2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
[6] Speech/speaker recognition using a HMM/GMM hybrid model
Rodriguez, E
Ruiz, B
Garcia-Crespo, A
Garcia, F
[J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 227 - 234
[7] Statistical Analysis of Arabic Phonemes Used in Arabic Speech Recognition
Nahar, Khalid M. O.
Elshafei, Mustafa
Al-Khatib, Wasfi G.
Al-Muhtaseb, Husni
Alghamdi, Mansour M.
[J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 533 - 542
[8] The hybrid model of speech recognition based on HMM and HMMNN
Wang Sheguo
Tong Jianing
Yuan Yujin
[J]. 2009 INTERNATIONAL CONFERENCE ON E-BUSINESS AND INFORMATION SYSTEM SECURITY, VOLS 1 AND 2, 2009, : 926 - +
[9] A new hybrid HMM/ANN model for speech recognition
Xi, XJ
Lin, KH
Zhou, CL
Cai, J
[J]. Artificial Intelligence Applications and Innovations II, 2005, 187 : 223 - 230
[10] Tree-Based HMM State Tying for Arabic Continuous Speech Recognition
Azim, Mona A.
Hamid, A. Aziz A.
Badr, Nagwa L.
Tolba, M. F.
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 96 - 103

← 1 2 3 4 5 →