Experiments on Automatic Recognition of Nonnative Arabic Speech

被引:10
|
作者
Alotaibi, Yousef Ajami [2 ]
Selouani, Sid-Ahmed [1 ]
O'Shaughnessy, Douglas [3 ]
机构
[1] Univ Moncton, Lab Rech & Interactiv Homme Syst LARIHS, Moncton, NB E8S 1P6, Canada
[2] King Saud Univ, Dept Comp Engn, Riyadh 11451, Saudi Arabia
[3] Univ Quebec, INRS Energie Mat Telecommun, Montreal, PQ H5A 1K6, Canada
关键词
Language Model; Speech Recognition System; Arabic Language; Female Speaker; Modern Standard Arabic;
D O I
10.1155/2008/679831
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The automatic recognition of foreign-accented Arabic speech is a challenging task since it involves a large number of nonnative accents. As well, the nonnative speech data available for training are generally insufficient. Moreover, as compared to other languages, the Arabic language has sparked a relatively small number of research efforts. In this paper, we are concerned with the problem of nonnative speech in a speaker independent, large-vocabulary speech recognition system for modern standard Arabic (MSA). We analyze some major differences at the phonetic level in order to determine which phonemes have a significant part in the recognition performance for both native and nonnative speakers. Special attention is given to specific Arabic phonemes. The performance of an HMM-based Arabic speech recognition system is analyzed with respect to speaker gender and its native origin. The West Point modern standard Arabic database from the language data consortium(LDC) and the hidden Markov Model Toolkit (HTK) are used throughout all experiments. Our study shows that the best performance in the overall phoneme recognition is obtained when nonnative speakers are involved in both training and testing phases. This is not the case when a language model and phonetic lattice networks are incorporated in the system. At the phonetic level, the results show that female nonnative speakers perform better than nonnative male speakers, and that emphatic phonemes yield a significant decrease in performance when they are uttered by both male and female nonnative speakers. Copyright (C) 2008 Yousef Ajami Alotaibi et al.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Syllable-based automatic Arabic speech recognition in noisy enviroment
    Azmi, Mohamed M.
    Tolba, Hesham
    [J]. 2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1436 - 1441
  • [22] AUTOMATIC SPEECH RECOGNITION OF ARABIC MULTI-GENRE BROADCAST MEDIA
    Najafian, Maryam
    Hsu, Wei-Ning
    Ali, Ahmed
    Glass, James
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 353 - 359
  • [23] Impact of Emotional Speech to Automatic Speaker Recognition - Experiments on GEES Speech Database
    Jokic, Ivan
    Jokic, Stevan
    Delic, Vlado
    Peric, Zoran
    [J]. SPEECH AND COMPUTER, 2014, 8773 : 268 - 275
  • [24] Modern standard Arabic speech corpus for implementing and evaluating automatic continuous speech recognition systems
    Abushariah, Mohammad Abd-Alrahman Mahmoud
    Ainon, Raja Noor
    Zainuddin, Roziati
    Alqudah, Assal Ali Mustafa
    Ahmed, Moustafa Elshafei
    Khalifa, Othman Omran
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2012, 349 (07): : 2215 - 2242
  • [25] Syllable-Based Automatic Arabic Speech Recognition in Different Conditions of Noise
    Azmi, Mohamed M.
    Tolba, Hesham
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 601 - +
  • [26] Emotion Recognition in Arabic Speech
    Klaylat, Samira
    Hamandi, Lama
    Osman, Ziad
    Zantout, Rached
    [J]. 2017 SENSORS NETWORKS SMART AND EMERGING TECHNOLOGIES (SENSET), 2017,
  • [27] Emotion recognition in Arabic speech
    Hadjadji, Imene
    Falek, Leila
    Demri, Lyes
    Teffahi, Hocine
    [J]. 2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRICAL ENGINEERING (ICAEE), 2019,
  • [28] Emotion recognition in Arabic speech
    Klaylat, Samira
    Osman, Ziad
    Hamandi, Lama
    Zantout, Rached
    [J]. ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2018, 96 (02) : 337 - 351
  • [29] Emotion recognition in Arabic speech
    Samira Klaylat
    Ziad Osman
    Lama Hamandi
    Rached Zantout
    [J]. Analog Integrated Circuits and Signal Processing, 2018, 96 : 337 - 351
  • [30] Automatic understanding of the spontaneous Arabic speech
    Zouaghi, Anis
    Zrigui, Mounir
    Antoniadis, Georges
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2008, 49 (01): : 141 - 166