Tree-Based HMM State Tying for Arabic Continuous Speech Recognition

被引:0
|
作者
Azim, Mona A. [1 ]
Hamid, A. Aziz A. [1 ]
Badr, Nagwa L. [1 ]
Tolba, M. F. [1 ]
机构
[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt
关键词
Arabic phonemes; Tri-phones hmms; Speech recognition; SIMILARITY;
D O I
10.1007/978-3-319-48308-5_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the major challenges in building Hidden Markov Models (HMMs) for continuous speech recognition systems is the balance between the available training set and the recognition performance. For large vocabulary recognition systems, context dependent models are usually required to obtain higher recognition accuracy. This is crucial as most of the language contexts may not occur in the training set. This paper proposes an Arabic phonetic decision tree necessary to build tied state tri-phone HMMs. Experimental results based on the proposed decision tree show a promising recognition accuracy when compared with the traditional context independent models using the same training and testing sets. The maximum recognition accuracy achieved by the proposed approach was 92.8 % whereas it reached 61.5 % when tested using context independent HMMs.
引用
收藏
页码:96 / 103
页数:8
相关论文
共 50 条
  • [1] Robust decision tree state tying for continuous speech recognition
    Reichl, W
    Chou, W
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (05): : 555 - 566
  • [2] Hybrid SVM/HMM Model for the Recognition of Arabic Triphones-based Continuous Speech
    Zarrouk, Elyes
    Benayed, Yassine
    Gargouri, Faiez
    2013 10TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2013,
  • [3] Bayesian Decision Tree State Tying for Conversational Speech Recognition
    Hu, Rusheng
    Zhao, Yunxin
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1738 - 1741
  • [4] Arabic phonemes recognition using hybrid LVQ/HMM model for continuous speech recognition
    Nahar, Khalid M. O.
    Abu Shquier, Mohammed
    Al-Khatib, Wasfi G.
    Al-Muhtaseb, Husni
    Elshafei, Moustafa
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (03) : 495 - 508
  • [5] Decision tree based state tying for speech recognition using DNN derived embeddings
    Li, Xiangang
    Wu, Xihong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 123 - 127
  • [6] Knowledge-based adaptive decision tree state tying for conversational speech recognition
    Hu, Rusheng
    Zhao, Yunxin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2160 - 2168
  • [7] Decision tree-based context dependent sublexical units for Continuous Speech Recognition of Basque
    de Ipiña, KL
    Graña, M
    Ezeiza, N
    Hernández, M
    Zulueta, E
    Ezeiza, A
    PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 259 - 265
  • [8] HMM BASED RECOGNITION OF CHINESE TONES IN CONTINUOUS SPEECH
    Zhao Li (Department of Radio Engineering
    Journal of Electronics(China), 2000, (01) : 9 - 14
  • [9] HMM based recognition of Chinese tones in continuous speech
    Cheng, ML
    Cheng, XM
    Zhao, L
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 916 - 919
  • [10] Decision tree-based acoustic models for speech recognition
    Masami Akamine
    Jitendra Ajmera
    EURASIP Journal on Audio, Speech, and Music Processing, 2012