A comparative study for Arabic speech recognition system in noisy environments

被引:5
|
作者
Ouisaadane, Abdelkbir [1 ]
Safi, Said [1 ]
机构
[1] Sultan Moulay Slimane Univ, Polydisciplinary Fac, Dept Math & Comp Sci, Benimellal, Morocco
关键词
GMM-HMM; DNN-HMM; Noise; Arabic speech; HIDDEN MARKOV-MODELS; NEURAL-NETWORKS; DNN-HMM;
D O I
10.1007/s10772-021-09847-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech recognition in noisy environments is one of the long-standing research themes but remains a very important challenge nowadays. Therefore, there is much research into all techniques and approaches to improve the performance of speech recognition systems, even in poor conditions. This paper presents a comparative study under various conditions based on two architectures (GMM-HMM and DNN-HMM), the Hybrid GMM-HMM models using the CMU Sphinx tools and the Hybrid DNN-HMM using the KALDI toolkit in noise environment. In this study, we compare the Hybrid GMM-HMM models and the Hybrid DNN-HMM models to evaluate the performance of the proposed system. The novelty of this paper is to test if the presented tools could be, with good accuracy, recognize the Arabic speech principally in noisy environment. In addition, we adopted the noisy training theory in this paper based on GMM-HMM and DNN-HMM model. We use the public Arabic Speech Corpus for Isolated Words (20 words), three noise levels, and three noise types. The implementation of our system consists of two phases: Features extraction using Mel-frequency Cepstral Coefficient (MFCC) and the classification phase will use separately the previous two models. In order to test the performance of these methods a simulation will presented for different SNR and for different district type of noise.
引用
收藏
页码:761 / 770
页数:10
相关论文
共 50 条
  • [1] A comparative study for Arabic speech recognition system in noisy environments
    Abdelkbir Ouisaadane
    Said Safi
    [J]. International Journal of Speech Technology, 2021, 24 : 761 - 770
  • [2] An experimental framework for Arabic digits speech recognition in noisy environments
    Touazi A.
    Debyeche M.
    [J]. International Journal of Speech Technology, 2017, 20 (2) : 205 - 224
  • [3] Prosodic Features and Formant Contribution for Arabic Speech Recognition in Noisy Environments
    Amrous, Anissa Imen
    Debyeche, Mohamed
    Amrouche, Abderrahman
    [J]. SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS, 6TH INTERNATIONAL CONFERENCE SOCO 2011, 2011, 87 : 465 - 474
  • [4] A Comparative Study of Arabic Speech Recognition
    Ali, Onsy Abdel Alim
    Moselhy, Mohamed M.
    Bzeih, Aya
    [J]. 2012 16TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2012, : 884 - 887
  • [5] Robust Arabic speech recognition in noisy environments using prosodic features and formant
    Amrous, Anissa
    Debyeche, Mohamed
    Amrouche, Abderrahman
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (04) : 351 - 359
  • [6] A robust speech recognition system for communication robots in noisy environments
    Ishi, Carlos Toshinori
    Matsuda, Shigeki
    Kanda, Takayuki
    Jitsuhiro, Takatoshi
    Ishiguro, Hiroshi
    Nakamura, Satoshi
    Hagita, Norihiro
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (03) : 759 - 763
  • [7] Optimal Automatic Speech Recognition System Selection for Noisy Environments
    Tachioka, Yuuki
    Narita, Tomohiro
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [8] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    [J]. SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
  • [9] Speech enhancement applied to speech recognition in noisy environments
    [J]. Xu, Y.F., 2001, Press of Tsinghua University (41):
  • [10] Multisensory benefits for speech recognition in noisy environments
    Oh, Yonghee
    Schwalm, Meg
    Kalpin, Nicole
    [J]. FRONTIERS IN NEUROSCIENCE, 2022, 16