Tree-Based HMM State Tying for Arabic Continuous Speech Recognition

被引：0

作者：

Azim, Mona A. ^{[1
]}

Hamid, A. Aziz A. ^{[1
]}

Badr, Nagwa L. ^{[1
]}

Tolba, M. F. ^{[1
]}

机构：

[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt

来源：

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016 | 2017年 / 533卷

关键词：

Arabic phonemes; Tri-phones hmms; Speech recognition; SIMILARITY;

D O I：

10.1007/978-3-319-48308-5_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One of the major challenges in building Hidden Markov Models (HMMs) for continuous speech recognition systems is the balance between the available training set and the recognition performance. For large vocabulary recognition systems, context dependent models are usually required to obtain higher recognition accuracy. This is crucial as most of the language contexts may not occur in the training set. This paper proposes an Arabic phonetic decision tree necessary to build tied state tri-phone HMMs. Experimental results based on the proposed decision tree show a promising recognition accuracy when compared with the traditional context independent models using the same training and testing sets. The maximum recognition accuracy achieved by the proposed approach was 92.8 % whereas it reached 61.5 % when tested using context independent HMMs.

引用

页码：96 / 103

页数：8

共 50 条

[1] Robust decision tree state tying for continuous speech recognition
Reichl, W
Chou, W
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05): : 555 - 566
[2] Hybrid SVM/HMM Model for the Recognition of Arabic Triphones-based Continuous Speech
Zarrouk, Elyes
Benayed, Yassine
Gargouri, Faiez
2013 10TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2013,
[3] Bayesian Decision Tree State Tying for Conversational Speech Recognition
Hu, Rusheng
Zhao, Yunxin
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1738 - 1741
[4] Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
Nahar, Khalid M. O.
Abu Shquier, Mohammed
Al-Khatib, Wasfi G.
Al-Muhtaseb, Husni
Elshafei, Moustafa
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 495 - 508
[5] Decision tree based state tying for speech recognition using DNN derived embeddings
Li, Xiangang
Wu, Xihong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 123 - 127
[6] Knowledge-based adaptive decision tree state tying for conversational speech recognition
Hu, Rusheng
Zhao, Yunxin
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2160 - 2168
[7] Decision tree-based context dependent sublexical units for Continuous Speech Recognition of Basque
de Ipiña, KL
Graña, M
Ezeiza, N
Hernández, M
Zulueta, E
Ezeiza, A
PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 259 - 265
[8] HMM BASED RECOGNITION OF CHINESE TONES IN CONTINUOUS SPEECH
Zhao Li (Department of Radio Engineering
Journal of Electronics(China), 2000, (01) : 9 - 14
[9] HMM based recognition of Chinese tones in continuous speech
Cheng, ML
Cheng, XM
Zhao, L
PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 916 - 919
[10] Decision tree-based acoustic models for speech recognition
Masami Akamine
Jitendra Ajmera
EURASIP Journal on Audio, Speech, and Music Processing, 2012

← 1 2 3 4 5 →