Building Automatic Speech Recognition Systems for Moroccan Dialect: A Phoneme-Based Approach

被引：0

作者：

Abderrahim Ezzine

Naouar Laaidi

Ouissam Zealouk

Hassan Satori

机构：

[1] Sidi Mohamed Ben Abbdallah University,Department of Computer Science and Mathematics, Faculty of Sciences Dhar Mahraz

[2] Laboratory of Computer Science,undefined

[3] Signals,undefined

[4] Automation and Cognition (LISAC),undefined

来源：

SN Computer Science | / 5卷 / 6期

关键词：

Speech recognition; In-house corpus; Moroccan dialect; HMM-GMM; Phoneme modeling; Machine learning;

D O I：

10.1007/s42979-024-03108-5

中图分类号：

学科分类号：

摘要：

Building efficient acoustic models for dialects is a major challenge in Automatic Speech Recognition (ASR) systems. In this paper, we investigate the Moroccan Fessi dialect speech recognition system based on phoneme modeling. We employed a combined approach, including the Hidden Markov Model (HMM) and the Gaussian Mixture Model (GMM). Also, the ASR dialect specificity was analysed, including phonemes nature and phonetic inventory. Our results show the best performance was found by using 3 HMM and 4 GMM configurations, achieving an accuracy of 97.33%. Additionally, we observed that the digits containing voiced pharyngeal phonemes, particularly the phoneme /ʕ/, achieved the highest recognition rate, while words containing the phoneme /s/ exhibited multiple substitutions.

引用

共 50 条

[1] Improved Phoneme-Based Myoelectric Speech Recognition
Zhou, Quan
Jiang, Ning
Englehart, Kevin
Hudgins, Bernard
[J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2009, 56 (08) : 2016 - 2023
[2] Myoclectric signal classification for phoneme-based speech recognition
Scheme, Erik J.
Hudgins, Bernard
Parker, Phillip A.
[J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2007, 54 (04) : 694 - 699
[3] A STOCHASTIC SEGMENT MODEL FOR PHONEME-BASED CONTINUOUS SPEECH RECOGNITION
OSTENDORF, M
ROUKOS, S
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12): : 1857 - 1869
[4] ADAPTABLE PHONEME-BASED MODELS FOR LARGE-VOCABULARY SPEECH RECOGNITION
BAMBERG, PG
MANDEL, MA
[J]. SPEECH COMMUNICATION, 1991, 10 (5-6) : 437 - 451
[5] CONTINUOUS SPEECH RECOGNITION USING A DEPENDENCY GRAMMAR AND PHONEME-BASED HMMS
MATSUNAGA, S
HOMMA, S
SAGAYAMA, S
FURUI, S
[J]. IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1826 - 1833
[6] Learning strategies for modular neuro-fuzzy systems: A case study on phoneme-based speech recognition
Kasabov, N
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 1997, 5 (04) : 345 - 354
[7] A Comprehensive Examination of Phoneme Recognition in Automatic Speech Recognition Systems
Bhatt, Shobha
Bansal, Shweta
Kumar, Ankit
Pandey, Saroj Kumar
Ojha, Manoj Kumar
Singh, Kamred Udham
Chakraborty, Sanjay
Singh, Teekam
Swarup, Chetan
[J]. TRAITEMENT DU SIGNAL, 2023, 40 (05) : 1997 - 2008
[8] Phoneme-based speech recognition via fuzzy neural networks modeling and learning
Kasabov, NK
Kozma, R
Watts, MJ
[J]. INFORMATION SCIENCES, 1998, 110 (1-2) : 61 - 79
[9] PHONEME-BASED DISTRIBUTION REGULARIZATION FOR SPEECH ENHANCEMENT
Liu, Yajing
Peng, Xiulian
Xiong, Zhiwei
Lu, Yan
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 726 - 730
[10] Phoneme-based Thai speech recognition using fuzzy system and neural network
Cheirsilp, R
Santiprabhob, P
[J]. IC-AI'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 1-III, 2000, : 393 - 397

← 1 2 3 4 5 →