An experiment of Moroccan dialect speech recognition in noisy environments using PocketSphinx

被引:0
|
作者
Ouisaadane A. [1 ]
Safi S. [1 ]
Frikel M. [2 ]
机构
[1] Department of Mathematics and Computer Science Polydisciplinary Faculty, Sultan Moulay Slimane University, Benimellal
[2] GREYC Laboratory ENSICAEN School, LAC Laboratory Caen-Normandie University, Caen
关键词
ASR; HMM; Moroccan dialect; Noisy speech; Pocket sphinx;
D O I
10.1007/s10772-024-10103-x
中图分类号
学科分类号
摘要
In this study, we introduce an experimental framework for Moroccan dialect speech recognition under various additive noise conditions using the open-source tool PocketSphinx. We curated a corpus comprising the ten most commonly used greetings in the Moroccan dialect, extracted from telephone conversations. This corpus was recorded with 60 speakers (30 males and 30 females). Each speaker articulated each expression three times in natural and noisy conditions. Feature extraction utilized Mel Scale Cepstral Coefficients (MFCC), and acoustic modeling, based on monophony, was implemented using Hidden Markov Models (HMM). While automatic speech recognition systems demonstrate commendable performance in noise-free conditions, their efficacy noticeably diminishes in the presence of noise. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:329 / 339
页数:10
相关论文
共 50 条
  • [1] Automatic speech recognition for Moroccan dialect in noisy traffic environments
    Ezzine, Abderrahim
    Laaidi, Naouar
    Satori, Hassan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 141
  • [2] Speech Recognition of Moroccan Dialect Using Hidden Markov Models
    Mouaz, Bezoui
    Abderrahim, Beni Hssane
    Abdelmajid, Elmoutaouakkil
    10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 : 985 - 991
  • [3] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291
  • [4] Improvement of the speech recognition in noisy environments using a nonparametric regression
    Amrouche, A.
    Taleb-Ahmed, A.
    Rouvaen, J. M.
    Yagoub, M. C. E.
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2009, 24 (01) : 49 - 67
  • [5] Speech enhancement applied to speech recognition in noisy environments
    Xu, Y.F., 2001, Press of Tsinghua University (41):
  • [6] Speech Emotion Recognition in Noisy and Reverberant Environments
    Heracleous, Panikos
    Yasuda, Keiji
    Sugaya, Fumiaki
    Yoneyama, Akio
    Hashimoto, Masayuki
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 262 - 266
  • [7] Multisensory benefits for speech recognition in noisy environments
    Oh, Yonghee
    Schwalm, Meg
    Kalpin, Nicole
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [8] Speech Recognition On Mobile Devices In Noisy Environments
    Yurtcan, Yaser
    Kilic, Banu Gunel
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [9] Emotion recognition from Moroccan dialect speech and energy band distribution
    Agrima, Abdellah
    Farchi, Abdelmajid
    Elmazouzi, Laila
    Mounir, Ilham
    Mounir, Badia
    2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
  • [10] Speech Recognition using Deep Canonical Correlation Analysis in Noisy Environments
    Isobe, Shinnosuke
    Tamura, Satoshi
    Hayamizu, Satoru
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 63 - 70