A Panoramic Survey of Natural Language Processing in the Arab World

被引:35
|
作者
Darwish, Kareem [1 ]
Habash, Nizar [2 ]
Abbas, Mourad [3 ]
Al-Khalifa, Hend [4 ]
Al-Natsheh, Huseein T. [5 ]
Bouamor, Houda [6 ]
Bouzoubaa, Karim [7 ]
Cavalli-Sforza, Violetta [8 ]
El-Beltagy, Samhaa R. [9 ]
El-Hajj, Wassim [10 ]
Jarrar, Mustafa [11 ]
Mubarak, Hamdy [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Doha, Qatar
[2] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates
[3] Ctr Sci & Tech Res Dev Arab Language CRSTDLA, Bouzareah, Algeria
[4] King Saud Univ, Riyadh, Saudi Arabia
[5] Mawdoo3, Amman, Jordan
[6] Carnegie Mellon Univ, Doha, Qatar
[7] Mohammed V Univ, Rabat, Morocco
[8] Al Akhawayn Univ, Ifrane, Morocco
[9] Newgiza Univ, Cairo, Egypt
[10] Amer Univ Beirut, Beirut, Lebanon
[11] Birzeit Univ, Birzeit, Palestine
关键词
10;
D O I
10.1145/3447735
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The term natural language refers to any system of symbolic communication (spoken, signed or written) without intentional human planning and design. This distinguishes natural languages such as Arabic and Japanese from artificially constructed languages such as Esperanto or Python. Natural language processing (NLP) is the sub-field of artificial intelligence (AI) focused on modeling natural languages to build applications such as speech recognition and synthesis, machine translation, optical character recognition (OCR), sentiment analysis (SA), question answering, dialogue systems, etc. NLP is a highly interdisciplinary field with connections to computer science, linguistics, cognitive science, psychology, mathematics and others. Some of the earliest AI applications were in NLP (e.g., machine translation); and the last decade (2010-2020) in particular has witnessed an incredible increase in quality, matched with a rise in public awareness, use, and expectations of what may have seemed like science fiction in the past. NLP researchers pride themselves on developing language independent models and tools that can be applied to all human languages, e.g. machine translation systems can be built for a variety of languages using the same basic mechanisms and models. However, the reality is that some languages do get more attention (e.g., English and Chinese) than others (e.g., Hindi and Swahili). Arabic, the primary language of the Arab world and the religious language of millions of non-Arab Muslims is somewhere in the middle of this continuum. Though Arabic NLP has many challenges, it has seen many successes and developments. Next we discuss Arabic's main challenges as a necessary background, and we present a brief history of Arabic NLP. We then survey a number of its research areas, and close with a critical discussion of the future of Arabic NLP.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 50 条
  • [1] Language processing in the natural world
    Tanenhaus, Michael K.
    Brown-Schmidt, Sarah
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1493) : 1105 - 1122
  • [2] Natural Language Processing for Dialects of a Language: A Survey
    Joshi, Aditya
    Dabre, Raj
    Kanojia, Diptesh
    Li, Zhuang
    Zhan, Haolan
    Haffari, Gholamreza
    Dippold, Doris
    ACM COMPUTING SURVEYS, 2025, 57 (06)
  • [3] A survey of graphs in natural language processing
    Nastase, Vivi
    Mihalcea, Rada
    Radev, Dragomir R.
    NATURAL LANGUAGE ENGINEERING, 2015, 21 (05) : 665 - 698
  • [4] Natural language processing in finance: A survey
    Du, Kelvin
    Zhao, Yazhi
    Mao, Rui
    Xing, Frank
    Cambria, Erik
    INFORMATION FUSION, 2025, 115
  • [5] Quantum Natural Language Processing: A Comprehensive Survey
    Varmantchaonala, Charles M.
    Fendji, Jean Louis K. E.
    Schoning, Julius
    Atemkeng, Marcellin
    IEEE ACCESS, 2024, 12 : 99578 - 99598
  • [6] Perspectivist approaches to natural language processing: a survey
    Frenda, Simona
    Abercrombie, Gavin
    Basile, Valerio
    Pedrani, Alessandro
    Panizzon, Raffaella
    Cignarella, Alessandra Teresa
    Marco, Cristina
    Bernardi, Davide
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [7] Efficient Methods for Natural Language Processing: A Survey
    Treviso, Marcos
    Lee, Ji-Ung
    Ji, Tianchu
    van Aken, Betty
    Cao, Qingqing
    Ciosici, Manuel R.
    Hassid, Michael
    Heafield, Kenneth
    Hooker, Sara
    Raffel, Colin
    Martins, Pedro H.
    Martins, Andre F. T.
    Forde, Jessica Zosa
    Milder, Peter
    Simpson, Edwin
    Slonim, Noam
    Dodge, Jesse
    Strubell, Emma
    Balasubramanian, Niranjan
    Derczynski, Leon
    Gurevych, Iryna
    Schwartz, Roy
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 826 - 860
  • [8] Efficient Methods for Natural Language Processing: A Survey
    Treviso, Marcos
    Ji, Tianchu
    Lee, Ji-Ung
    van Aken, Betty
    Cao, Qingqing
    R. Ciosici, Manuel
    Hassid, Michael
    Heafield, Kenneth
    Hooker, Sara
    H. Martins, Pedro
    F. T. Martins, Andre
    Milder, Peter
    Raffel, Colin
    Simpson, Edwin
    Slonim, Noam
    Dodge, Jesse
    Strubell, Emma
    Balasubramanian, Niranjan
    Derczynski, Leon
    Gurevych, Iryna
    Schwartz, Roy
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 (826-860) : 826 - 860
  • [9] Conformal Prediction for Natural Language Processing: A Survey
    Campos, Margarida
    Farinhas, Antonio
    Zerva, Chrysoula
    Figueiredo, Mario A. T.
    Martins, Andre F. T.
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 1497 - 1516
  • [10] Deep Learning for Natural Language Processing: A Survey
    Arkhangelskaya E.O.
    Nikolenko S.I.
    Journal of Mathematical Sciences, 2023, 273 (4) : 533 - 582