A comparative study for Arabic speech recognition system in noisy environments

被引：5

作者：

Ouisaadane, Abdelkbir ^{[1
]}

Safi, Said ^{[1
]}

机构：

[1] Sultan Moulay Slimane Univ, Polydisciplinary Fac, Dept Math & Comp Sci, Benimellal, Morocco

来源：

INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY | 2021年 / 24卷 / 03期

关键词：

GMM-HMM; DNN-HMM; Noise; Arabic speech; HIDDEN MARKOV-MODELS; NEURAL-NETWORKS; DNN-HMM;

D O I：

10.1007/s10772-021-09847-7

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speech recognition in noisy environments is one of the long-standing research themes but remains a very important challenge nowadays. Therefore, there is much research into all techniques and approaches to improve the performance of speech recognition systems, even in poor conditions. This paper presents a comparative study under various conditions based on two architectures (GMM-HMM and DNN-HMM), the Hybrid GMM-HMM models using the CMU Sphinx tools and the Hybrid DNN-HMM using the KALDI toolkit in noise environment. In this study, we compare the Hybrid GMM-HMM models and the Hybrid DNN-HMM models to evaluate the performance of the proposed system. The novelty of this paper is to test if the presented tools could be, with good accuracy, recognize the Arabic speech principally in noisy environment. In addition, we adopted the noisy training theory in this paper based on GMM-HMM and DNN-HMM model. We use the public Arabic Speech Corpus for Isolated Words (20 words), three noise levels, and three noise types. The implementation of our system consists of two phases: Features extraction using Mel-frequency Cepstral Coefficient (MFCC) and the classification phase will use separately the previous two models. In order to test the performance of these methods a simulation will presented for different SNR and for different district type of noise.

引用

页码：761 / 770

页数：10

共 50 条

[1] A comparative study for Arabic speech recognition system in noisy environments
Abdelkbir Ouisaadane
Said Safi
[J]. International Journal of Speech Technology, 2021, 24 : 761 - 770
[2] An experimental framework for Arabic digits speech recognition in noisy environments
Touazi A.
Debyeche M.
[J]. International Journal of Speech Technology, 2017, 20 (2) : 205 - 224
[3] Prosodic Features and Formant Contribution for Arabic Speech Recognition in Noisy Environments
Amrous, Anissa Imen
Debyeche, Mohamed
Amrouche, Abderrahman
[J]. SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 6TH INTERNATIONAL CONFERENCE SOCO 2011, 2011, 87 : 465 - 474
[4] A Comparative Study of Arabic Speech Recognition
Ali, Onsy Abdel Alim
Moselhy, Mohamed M.
Bzeih, Aya
[J]. 2012 16TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2012, : 884 - 887
[5] Robust Arabic speech recognition in noisy environments using prosodic features and formant
Amrous, Anissa
Debyeche, Mohamed
Amrouche, Abderrahman
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (04) : 351 - 359
[6] A robust speech recognition system for communication robots in noisy environments
Ishi, Carlos Toshinori
Matsuda, Shigeki
Kanda, Takayuki
Jitsuhiro, Takatoshi
Ishiguro, Hiroshi
Nakamura, Satoshi
Hagita, Norihiro
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (03) : 759 - 763
[7] Optimal Automatic Speech Recognition System Selection for Noisy Environments
Tachioka, Yuuki
Narita, Tomohiro
[J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[8] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
GONG, YF
[J]. SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
[9] Speech enhancement applied to speech recognition in noisy environments
[J]. Xu, Y.F., 2001, Press of Tsinghua University (41):
[10] Multisensory benefits for speech recognition in noisy environments
Oh, Yonghee
Schwalm, Meg
Kalpin, Nicole
[J]. FRONTIERS IN NEUROSCIENCE, 2022, 16

← 1 2 3 4 5 →