Building Automatic Speech Recognition Systems for Moroccan Dialect: A Phoneme-Based Approach

被引:0
|
作者
Abderrahim Ezzine
Naouar Laaidi
Ouissam Zealouk
Hassan Satori
机构
[1] Sidi Mohamed Ben Abbdallah University,Department of Computer Science and Mathematics, Faculty of Sciences Dhar Mahraz
[2] Laboratory of Computer Science,undefined
[3] Signals,undefined
[4] Automation and Cognition (LISAC),undefined
关键词
Speech recognition; In-house corpus; Moroccan dialect; HMM-GMM; Phoneme modeling; Machine learning;
D O I
10.1007/s42979-024-03108-5
中图分类号
学科分类号
摘要
Building efficient acoustic models for dialects is a major challenge in Automatic Speech Recognition (ASR) systems. In this paper, we investigate the Moroccan Fessi dialect speech recognition system based on phoneme modeling. We employed a combined approach, including the Hidden Markov Model (HMM) and the Gaussian Mixture Model (GMM). Also, the ASR dialect specificity was analysed, including phonemes nature and phonetic inventory. Our results show the best performance was found by using 3 HMM and 4 GMM configurations, achieving an accuracy of 97.33%. Additionally, we observed that the digits containing voiced pharyngeal phonemes, particularly the phoneme /ʕ/, achieved the highest recognition rate, while words containing the phoneme /s/ exhibited multiple substitutions.
引用
收藏
相关论文
共 50 条
  • [1] Improved Phoneme-Based Myoelectric Speech Recognition
    Zhou, Quan
    Jiang, Ning
    Englehart, Kevin
    Hudgins, Bernard
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (08) : 2016 - 2023
  • [2] Myoclectric signal classification for phoneme-based speech recognition
    Scheme, Erik J.
    Hudgins, Bernard
    Parker, Phillip A.
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2007, 54 (04) : 694 - 699
  • [3] A STOCHASTIC SEGMENT MODEL FOR PHONEME-BASED CONTINUOUS SPEECH RECOGNITION
    OSTENDORF, M
    ROUKOS, S
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12): : 1857 - 1869
  • [4] ADAPTABLE PHONEME-BASED MODELS FOR LARGE-VOCABULARY SPEECH RECOGNITION
    BAMBERG, PG
    MANDEL, MA
    [J]. SPEECH COMMUNICATION, 1991, 10 (5-6) : 437 - 451
  • [5] CONTINUOUS SPEECH RECOGNITION USING A DEPENDENCY GRAMMAR AND PHONEME-BASED HMMS
    MATSUNAGA, S
    HOMMA, S
    SAGAYAMA, S
    FURUI, S
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1826 - 1833
  • [7] A Comprehensive Examination of Phoneme Recognition in Automatic Speech Recognition Systems
    Bhatt, Shobha
    Bansal, Shweta
    Kumar, Ankit
    Pandey, Saroj Kumar
    Ojha, Manoj Kumar
    Singh, Kamred Udham
    Chakraborty, Sanjay
    Singh, Teekam
    Swarup, Chetan
    [J]. TRAITEMENT DU SIGNAL, 2023, 40 (05) : 1997 - 2008
  • [8] Phoneme-based speech recognition via fuzzy neural networks modeling and learning
    Kasabov, NK
    Kozma, R
    Watts, MJ
    [J]. INFORMATION SCIENCES, 1998, 110 (1-2) : 61 - 79
  • [9] PHONEME-BASED DISTRIBUTION REGULARIZATION FOR SPEECH ENHANCEMENT
    Liu, Yajing
    Peng, Xiulian
    Xiong, Zhiwei
    Lu, Yan
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 726 - 730
  • [10] Phoneme-based Thai speech recognition using fuzzy system and neural network
    Cheirsilp, R
    Santiprabhob, P
    [J]. IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 393 - 397