ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION

被引:0
|
作者
Droua-Hamdani, Ghania [1 ]
Selouani, Sid Ahmed [2 ]
Boudraa, Malika [3 ]
机构
[1] CRSTDLA, Speech Proc Lab, Algiers, Algeria
[2] Univ Moncton, LARIHS Lab, Moncton, NB E1A 3E9, Canada
[3] USTHB Univ, Speech Commun Lab, Algiers, Algeria
来源
关键词
speech corpus; Algerian speakers; modern standard Arabic; automatic speech recognition; hidden Markov models;
D O I
暂无
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the Algerian Arabic Speech Database (ALGASD), a Modern Standard Arabic (MSA) speech corpus composed of utterances pronounced by 300 Algerian native speakers selected from eleven regions of Algeria. One of the objectives of this corpus design is to be representative of the regional accents of MSA spoken in Algeria. Useful information related to the speakers, such as gender, age, and education level, is provided. This paper also reports the results of the Automatic Speech Recognition (ASR) application of the corpus and outlines an original global monophone recognition model designed to handle linguistic variability. The global phone recognition rate for this ASR reference system is satisfactory and may constitute a useful baseline ASR system dedicated to MSA.
引用
收藏
页码:157 / 166
页数:10
相关论文
共 50 条
  • [41] Using Automatic Speech Recognition in Spoken Corpus Curation
    Gorisch, Jan
    Gref, Michael
    Schmidt, Thomas
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6423 - 6428
  • [42] Modern Arabic speech corpus for Text to Speech synthesis
    Oumaima, Zine
    Meziane, Abdelouafi
    2020 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGY MANAGEMENT, OPERATIONS AND DECISIONS (ICTMOD), 2020,
  • [43] AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION
    BRUNDAGE, WJ
    CONTROL ENGINEERING, 1983, 30 (04) : 117 - 117
  • [44] IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition
    Ganji, Sreeram
    Dhawan, Kunal
    Sinha, Rohit
    SPEECH COMMUNICATION, 2019, 110 : 76 - 89
  • [45] Impact of Emotional Speech to Automatic Speaker Recognition - Experiments on GEES Speech Database
    Jokic, Ivan
    Jokic, Stevan
    Delic, Vlado
    Peric, Zoran
    SPEECH AND COMPUTER, 2014, 8773 : 268 - 275
  • [46] QSDAS: NEW QURANIC SPEECH DATABASE FOR ARABIC SPEAKER RECOGNITION
    Harrag, A.
    Mohamadi, T.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C) : 7 - 19
  • [47] PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition
    Taerungruang, Supawat
    Taninpong, Phimphaka
    Chunwijitra, Vataya
    Thatphithakkul, Sumonmas
    Kasuriya, Sawit
    Inthanon, Viroj
    Paksaranuwat, Pawat
    Thumronglaohapun, Salinee
    Nakharutai, Nawapon
    Inkeaw, Papangkorn
    Bootkrajang, Jakramate
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [48] Speech corpus recycling for acoustic cross-domain environments for automatic speech recognition
    Ichikawa, Osamu
    Rennie, Steven J.
    Fukuda, Takashi
    Willett, Daniel
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2016, 37 (02) : 55 - 65
  • [49] Arabic Intonatif Speech database
    Boukadida, F
    Ellouze, N
    2004 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), VOLS. 1- 3, 2004, : 1408 - 1411
  • [50] RODIGITS - A ROMANIAN CONNECTED-DIGITS SPEECH CORPUS FOR AUTOMATIC SPEECH AND SPEAKER RECOGNITION
    Georgescu, Alexandru Lucian
    Caranica, Alexandru
    Cucu, Horia
    Burileanu, Corneliu
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2018, 80 (03): : 45 - 62