A Panoramic Survey of Natural Language Processing in the Arab World

被引:35
|
作者
Darwish, Kareem [1 ]
Habash, Nizar [2 ]
Abbas, Mourad [3 ]
Al-Khalifa, Hend [4 ]
Al-Natsheh, Huseein T. [5 ]
Bouamor, Houda [6 ]
Bouzoubaa, Karim [7 ]
Cavalli-Sforza, Violetta [8 ]
El-Beltagy, Samhaa R. [9 ]
El-Hajj, Wassim [10 ]
Jarrar, Mustafa [11 ]
Mubarak, Hamdy [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Doha, Qatar
[2] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates
[3] Ctr Sci & Tech Res Dev Arab Language CRSTDLA, Bouzareah, Algeria
[4] King Saud Univ, Riyadh, Saudi Arabia
[5] Mawdoo3, Amman, Jordan
[6] Carnegie Mellon Univ, Doha, Qatar
[7] Mohammed V Univ, Rabat, Morocco
[8] Al Akhawayn Univ, Ifrane, Morocco
[9] Newgiza Univ, Cairo, Egypt
[10] Amer Univ Beirut, Beirut, Lebanon
[11] Birzeit Univ, Birzeit, Palestine
关键词
10;
D O I
10.1145/3447735
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The term natural language refers to any system of symbolic communication (spoken, signed or written) without intentional human planning and design. This distinguishes natural languages such as Arabic and Japanese from artificially constructed languages such as Esperanto or Python. Natural language processing (NLP) is the sub-field of artificial intelligence (AI) focused on modeling natural languages to build applications such as speech recognition and synthesis, machine translation, optical character recognition (OCR), sentiment analysis (SA), question answering, dialogue systems, etc. NLP is a highly interdisciplinary field with connections to computer science, linguistics, cognitive science, psychology, mathematics and others. Some of the earliest AI applications were in NLP (e.g., machine translation); and the last decade (2010-2020) in particular has witnessed an incredible increase in quality, matched with a rise in public awareness, use, and expectations of what may have seemed like science fiction in the past. NLP researchers pride themselves on developing language independent models and tools that can be applied to all human languages, e.g. machine translation systems can be built for a variety of languages using the same basic mechanisms and models. However, the reality is that some languages do get more attention (e.g., English and Chinese) than others (e.g., Hindi and Swahili). Arabic, the primary language of the Arab world and the religious language of millions of non-Arab Muslims is somewhere in the middle of this continuum. Though Arabic NLP has many challenges, it has seen many successes and developments. Next we discuss Arabic's main challenges as a necessary background, and we present a brief history of Arabic NLP. We then survey a number of its research areas, and close with a critical discussion of the future of Arabic NLP.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 50 条
  • [21] A Survey on Backdoor Attack and Defense in Natural Language Processing
    Sheng, Xuan
    Han, Zhaoyang
    Li, Piji
    Chang, Xiangmao
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 809 - 820
  • [22] Graph Neural Networks for Natural Language Processing: A Survey
    Wu, Lingfei
    Chen, Yu
    Shen, Kai
    Guo, Xiaojie
    Gao, Hanning
    Li, Shucheng
    Pei, Jian
    Long, Bo
    FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2023, 16 (02): : 119 - 329
  • [23] A Survey of the Usages of Deep Learning for Natural Language Processing
    Otter, Daniel W.
    Medina, Julian R.
    Kalita, Jugal K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 604 - 624
  • [24] Word embeddings for biomedical natural language processing: A survey
    Chiu, Billy
    Baker, Simon
    LANGUAGE AND LINGUISTICS COMPASS, 2020, 14 (12):
  • [25] A Survey on Using Gaze Behaviour for Natural Language Processing
    Mathias, Sandeep
    Kanojia, Diptesh
    Mishra, Abhijit
    Bhattacharya, Pushpak
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4907 - 4913
  • [26] Local Interpretations for Explainable Natural Language Processing: A Survey
    Luo, Siwen
    Ivison, Hamish
    Han, Soyeon Caren
    Poon, Josiah
    ACM COMPUTING SURVEYS, 2024, 56 (09)
  • [27] Data augmentation approaches in natural language processing: A survey
    Li, Bohan
    Hou, Yutai
    Che, Wanxiang
    AI OPEN, 2022, 3 : 71 - 90
  • [28] A Survey of the State of Explainable AI for Natural Language Processing
    Danilevsky, Marina
    Qian, Kun
    Aharonov, Ranit
    Katsis, Yannis
    Kawas, Ban
    Sen, Prithviraj
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 447 - 459
  • [29] SECNLP: A survey of embeddings in clinical natural language processing
    Kalyan, Katikapalli Subramanyam
    Sangeetha, S.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 101 (101)
  • [30] Processing natural language without natural language processing
    Brill, E
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 360 - 369