A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition

被引:7
|
作者
Alotaibi, Yousef A. [1 ]
Selouani, Sidh-Amed [2 ]
Yakoub, Mohammed Sidi [2 ]
Seddiq, Yasser Mohammed [3 ]
Meftah, Ali [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[2] Univ Moncton, LARIHS Lab, Campus Shippagan, Shippegan, NB, Canada
[3] KACST, Riyadh, Saudi Arabia
关键词
Object recognition - Trellis codes - Gaussian distribution - Linguistics - Communication channels (information theory) - Speech recognition;
D O I
10.3813/AAA.919404
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The robustness of speech classification and recognition systems can be improved by the adoption of language distinctive phonetic feature (DPF) elements that can increase the effective characterization of a speech signal. This paper presents the results of applying Hidden Markov Models (HMMs) that perform Arabic phoneme recognition in conjunction with the inclusion and classification of their DPF element classes. The research focuses on classifying Modern Standard Arabic (MSA) phonemes within isolated words without a language context. HMM-based phoneme recognition is tested using 8, 16, and 32 HMM Gaussian mixture models. The monophone configuration is designed with consideration of 2-gram language model to evaluate the inherent performance of the system. The overall correct rates for classifying DPF element classes for the three versions of HMM systems are 83.29% 88.96%, and 92.70% for 8, 16, and 32 HMM Gaussian mixture model systems, respectively.
引用
收藏
页码:1269 / 1277
页数:9
相关论文
共 50 条
  • [21] A New Look at the Automatic Mapping Between Arabic Distinctive Phonetic Features and Acoustic Cues
    Alotaibi, Yousef k
    Seddiq, Yasser
    Meftah, Ali
    Selouani, Sid-Ahmed
    Yakoub, Mohammed Sidi
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 368 - 371
  • [22] Phonetic Inventory for an Arabic Speech Corpus
    Halabi, Nawar
    Wald, Mike
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 734 - 738
  • [23] Investigating the Impact of Phonetic Cross Language Modeling on Arabic and English Speech Recognition
    Alotaibi, Yousef A.
    Meftah, Ali H.
    Selouani, Sid-Ahmed
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON COMMUNICATION SYSTEMS, NETWORKS & DIGITAL SIGNAL PROCESSING (CSNDSP), 2014, : 585 - 590
  • [24] Syllable-level desynchronisation of phonetic features for speech recognition
    Kirchhoff, K
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2274 - 2276
  • [25] TRAINING AND ADAPTING MLP FEATURES FOR ARABIC SPEECH RECOGNITION
    Park, J.
    Diehl, F.
    Gales, M. J. F.
    Tomalin, M.
    Woodland, P. C.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4461 - 4464
  • [26] DISTINCTIVE FEATURES - PROBLEMS OF PHONETIC ADEQUACY
    UGUZZONI, A
    [J]. LINGUA E STILE, 1980, 15 (02) : 233 - 280
  • [27] "Phonetic bases of distinctive features": Introduction
    Clements, G. N.
    Halle, P. A.
    [J]. JOURNAL OF PHONETICS, 2010, 38 (01) : 3 - 9
  • [28] Orthogonalized distinctive phonetic feature extraction for noise-robust automatic speech recognition
    Fukuda, T
    Nitta, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1110 - 1118
  • [29] Improving Minority Language Speech Recognition Based on Distinctive Features
    Fu, Tong
    Gao, Shaojun
    Wu, Xihong
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 411 - 420
  • [30] SUBMODULAR DATA SELECTION WITH ACOUSTIC AND PHONETIC FEATURES FOR AUTOMATIC SPEECH RECOGNITION
    Ni, Chongjia
    Wang, Lei
    Liu, Haibo
    Leung, Cheung-Chi
    Lu, Li
    Ma, Bin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4629 - 4633