A Panoramic Survey of Natural Language Processing in the Arab World

被引:35
|
作者
Darwish, Kareem [1 ]
Habash, Nizar [2 ]
Abbas, Mourad [3 ]
Al-Khalifa, Hend [4 ]
Al-Natsheh, Huseein T. [5 ]
Bouamor, Houda [6 ]
Bouzoubaa, Karim [7 ]
Cavalli-Sforza, Violetta [8 ]
El-Beltagy, Samhaa R. [9 ]
El-Hajj, Wassim [10 ]
Jarrar, Mustafa [11 ]
Mubarak, Hamdy [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Doha, Qatar
[2] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates
[3] Ctr Sci & Tech Res Dev Arab Language CRSTDLA, Bouzareah, Algeria
[4] King Saud Univ, Riyadh, Saudi Arabia
[5] Mawdoo3, Amman, Jordan
[6] Carnegie Mellon Univ, Doha, Qatar
[7] Mohammed V Univ, Rabat, Morocco
[8] Al Akhawayn Univ, Ifrane, Morocco
[9] Newgiza Univ, Cairo, Egypt
[10] Amer Univ Beirut, Beirut, Lebanon
[11] Birzeit Univ, Birzeit, Palestine
关键词
10;
D O I
10.1145/3447735
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The term natural language refers to any system of symbolic communication (spoken, signed or written) without intentional human planning and design. This distinguishes natural languages such as Arabic and Japanese from artificially constructed languages such as Esperanto or Python. Natural language processing (NLP) is the sub-field of artificial intelligence (AI) focused on modeling natural languages to build applications such as speech recognition and synthesis, machine translation, optical character recognition (OCR), sentiment analysis (SA), question answering, dialogue systems, etc. NLP is a highly interdisciplinary field with connections to computer science, linguistics, cognitive science, psychology, mathematics and others. Some of the earliest AI applications were in NLP (e.g., machine translation); and the last decade (2010-2020) in particular has witnessed an incredible increase in quality, matched with a rise in public awareness, use, and expectations of what may have seemed like science fiction in the past. NLP researchers pride themselves on developing language independent models and tools that can be applied to all human languages, e.g. machine translation systems can be built for a variety of languages using the same basic mechanisms and models. However, the reality is that some languages do get more attention (e.g., English and Chinese) than others (e.g., Hindi and Swahili). Arabic, the primary language of the Arab world and the religious language of millions of non-Arab Muslims is somewhere in the middle of this continuum. Though Arabic NLP has many challenges, it has seen many successes and developments. Next we discuss Arabic's main challenges as a necessary background, and we present a brief history of Arabic NLP. We then survey a number of its research areas, and close with a critical discussion of the future of Arabic NLP.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 50 条
  • [41] Pre-trained models for natural language processing: A survey
    Qiu XiPeng
    Sun TianXiang
    Xu YiGe
    Shao YunFan
    Dai Ning
    Huang XuanJing
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (10) : 1872 - 1897
  • [42] XNLP: A Living Survey for XAI Research in Natural Language Processing
    Qian, Kun
    Danilevsky, Marina
    Katsis, Yannis
    Kawas, Ban
    Oduor, Erick
    Popa, Lucian
    Li, Yunyao
    26TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES (IUI '21 COMPANION), 2021, : 78 - 80
  • [43] A Survey of Natural Language Processing Implementation for Data Query Systems
    Wong, Albert
    Joiner, Dakota
    Chiu, Chunyin
    Elsayed, Mohamed
    Pereira, Keegan
    Khmelevsky, Youry
    Mahony, Joe
    IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN SYSTEMS SCIENCE AND ENGINEERING (IEEE RASSE 2021), 2021,
  • [44] Survey: Finite-state technology in natural language processing
    Maletti, Andreas
    THEORETICAL COMPUTER SCIENCE, 2017, 679 : 2 - 17
  • [45] Adversarial attack and defense technologies in natural language processing: A survey
    Qiu, Shilin
    Liu, Qihe
    Zhou, Shijie
    Huang, Wen
    NEUROCOMPUTING, 2022, 492 : 278 - 307
  • [46] Applying Deep Learning and Natural Language Processing in Cancer: A Survey
    AbuSamra, Aiman Ahmad
    Al-Madhoun, Areej M. R.
    2021 PALESTINIAN INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (PICICT 2021), 2021, : 103 - 115
  • [47] Pre-trained models for natural language processing: A survey
    QIU XiPeng
    SUN TianXiang
    XU YiGe
    SHAO YunFan
    DAI Ning
    HUANG XuanJing
    Science China(Technological Sciences), 2020, (10) : 1872 - 1897
  • [48] Pre-trained models for natural language processing: A survey
    XiPeng Qiu
    TianXiang Sun
    YiGe Xu
    YunFan Shao
    Ning Dai
    XuanJing Huang
    Science China Technological Sciences, 2020, 63 : 1872 - 1897
  • [49] Survey on Emerging Research on the Use of Natural Language Processing in Clinical Language Assessment of Children
    Solorio, Thamar
    LANGUAGE AND LINGUISTICS COMPASS, 2013, 7 (12): : 633 - 646
  • [50] Survey of Adversarial Attack, Defense and Robustness Analysis for Natural Language Processing
    Zheng H.
    Chen J.
    Zhang Y.
    Zhang X.
    Ge C.
    Liu Z.
    Ouyang Y.
    Ji S.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (08): : 1727 - 1750