Modern Standard Arabic Speech Corpora: A Systematic Review

被引:2
|
作者
Alqadasi, Ammar Mohammed Ali [1 ,2 ]
Abdulghafor, Rawad [1 ]
Sunar, Mohd Shahrizal [3 ,4 ,5 ]
Salam, Md. Sah Bin H. J. [4 ]
机构
[1] Int Islamic Univ Malaysia, Fac Informat & Commun Technol, Natl Dept Comp Sci, Kuala Lumpur 53100, Malaysia
[2] Int Islamic Univ Malaysia, Fac Informat & Commun Technol, Natl Dept Comp Sci, Kuala Lumpur, Malaysia
[3] Arab Open Univ Oman, Fac Comp Studies FCS, Muscat 130, Oman
[4] Univ Teknol Malaysia, Fac Comp, Johor Baharu 81310, Malaysia
[5] Univ Teknol Malaysia, Inst Human Ctr Engn, Media & Game Innovat Ctr Excellence, Johor Baharu 81310, Malaysia
关键词
Databases; Speech processing; Standards; Speech recognition; Market research; Distributed databases; Systematics; Speech corpus; speech database; modern standard Arabic; MSA corpora; speech recognition; Arabic recognition; RECOGNITION SYSTEM; FEATURE-EXTRACTION; CORPUS; TEXT; CLASSIFICATION; RHYTHM; TRANSCRIPTION; IMPACT;
D O I
10.1109/ACCESS.2023.3282259
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech processing applications have become integral components across various domains of modern life. The design and preparation of a reliable recognition system rely heavily on the availability of suitable speech databases. While numerous speech databases exist for English and other languages, the availability of comprehensive resources for Arabic language remains limited. In light of this, we conducted a systematic review aiming to identify, analyse, and classify existing Modern Standard Arabic speech databases. Through our review, we identified 27 publicly available databases and analysed an additional 80 subjective databases. These databases were thoroughly studied, classified based on their characteristics, and subjected to a detailed analysis of research trends in the field. This paper provides a comprehensive discussion on the diverse speech databases developed for various speech processing applications. It sheds light on the purposes and unique characteristics of Arabic speech databases, enabling researchers to easily access suitable resources for their specific applications. The findings of this review contribute to bridging the gap in available Arabic speech databases and serve as a valuable resource for researchers in the field.
引用
收藏
页码:55771 / 55796
页数:26
相关论文
共 50 条
  • [21] A Student Grammar of Modern Standard Arabic
    Melaine, Hamida
    [J]. LANGUAGE LEARNING JOURNAL, 2005, 32 (01): : 80 - 80
  • [22] A Student Grammar of Modern Standard Arabic
    Kaye, Alan S.
    [J]. JOURNAL OF NEAR EASTERN STUDIES, 2009, 68 (03) : 233 - 234
  • [23] Polysyllabic shortening in Modern Standard Arabic
    Abu Guba, Mohammed Nour
    Mashaqba, Bassil
    Huneety, Anas
    [J]. JOURNAL OF SEMITIC STUDIES, 2023, 68 (02) : 759 - 770
  • [24] Shared Arguments in Modern Standard Arabic
    Alotaibi, Yasir Hameed
    [J]. INTERNATIONAL JOURNAL OF ENGLISH LINGUISTICS, 2018, 8 (01) : 164 - 183
  • [25] FOCUS TRANSFORMATION OF MODERN STANDARD ARABIC
    ANSHEN, F
    SCHREIBER, PA
    [J]. LANGUAGE, 1968, 44 (04) : 792 - 797
  • [26] Textual Entailment for Modern Standard Arabic
    Alabbas, Maytham
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (04): : 653 - 654
  • [27] A Reference Grammar of Modern Standard Arabic
    Barry, Sandra
    [J]. LANGUAGE LEARNING JOURNAL, 2006, 34 (01): : 79 - 80
  • [28] Modern Standard Arabic Readability Prediction
    Nassiri, Naoual
    Lakhouaja, Abdelhak
    Cavalli-Sforza, Violetta
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 120 - 133
  • [29] The influence of English on Modern Standard Arabic speech reporting styles: A corpus-based study
    Al-Wahy, Ahmed Seddik
    [J]. LINGUA, 2021, 259
  • [30] Rhythmic Features across Modern Standard Arabic and Arabic Dialects
    Droua-Hamdani, Ghania
    Alotaibi, Yousef A.
    Selouani, Sid-Ahmed
    Boudraa, Malika
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,