A Panoramic Survey of Natural Language Processing in the Arab World

被引:35
|
作者
Darwish, Kareem [1 ]
Habash, Nizar [2 ]
Abbas, Mourad [3 ]
Al-Khalifa, Hend [4 ]
Al-Natsheh, Huseein T. [5 ]
Bouamor, Houda [6 ]
Bouzoubaa, Karim [7 ]
Cavalli-Sforza, Violetta [8 ]
El-Beltagy, Samhaa R. [9 ]
El-Hajj, Wassim [10 ]
Jarrar, Mustafa [11 ]
Mubarak, Hamdy [1 ]
机构
[1] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Doha, Qatar
[2] New York Univ Abu Dhabi, Abu Dhabi, U Arab Emirates
[3] Ctr Sci & Tech Res Dev Arab Language CRSTDLA, Bouzareah, Algeria
[4] King Saud Univ, Riyadh, Saudi Arabia
[5] Mawdoo3, Amman, Jordan
[6] Carnegie Mellon Univ, Doha, Qatar
[7] Mohammed V Univ, Rabat, Morocco
[8] Al Akhawayn Univ, Ifrane, Morocco
[9] Newgiza Univ, Cairo, Egypt
[10] Amer Univ Beirut, Beirut, Lebanon
[11] Birzeit Univ, Birzeit, Palestine
关键词
10;
D O I
10.1145/3447735
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The term natural language refers to any system of symbolic communication (spoken, signed or written) without intentional human planning and design. This distinguishes natural languages such as Arabic and Japanese from artificially constructed languages such as Esperanto or Python. Natural language processing (NLP) is the sub-field of artificial intelligence (AI) focused on modeling natural languages to build applications such as speech recognition and synthesis, machine translation, optical character recognition (OCR), sentiment analysis (SA), question answering, dialogue systems, etc. NLP is a highly interdisciplinary field with connections to computer science, linguistics, cognitive science, psychology, mathematics and others. Some of the earliest AI applications were in NLP (e.g., machine translation); and the last decade (2010-2020) in particular has witnessed an incredible increase in quality, matched with a rise in public awareness, use, and expectations of what may have seemed like science fiction in the past. NLP researchers pride themselves on developing language independent models and tools that can be applied to all human languages, e.g. machine translation systems can be built for a variety of languages using the same basic mechanisms and models. However, the reality is that some languages do get more attention (e.g., English and Chinese) than others (e.g., Hindi and Swahili). Arabic, the primary language of the Arab world and the religious language of millions of non-Arab Muslims is somewhere in the middle of this continuum. Though Arabic NLP has many challenges, it has seen many successes and developments. Next we discuss Arabic's main challenges as a necessary background, and we present a brief history of Arabic NLP. We then survey a number of its research areas, and close with a critical discussion of the future of Arabic NLP.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 50 条
  • [31] Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing
    Ponti, Edoardo Maria
    O'Horan, Helen
    Berzak, Yevgeni
    Vulic, Ivan
    Reichart, Roi
    Poibeau, Thierry
    Shutova, Ekaterina
    Korhonen, Anna
    COMPUTATIONAL LINGUISTICS, 2019, 45 (03) : 559 - 601
  • [32] Understanding poetry using natural language processing tools: a survey
    De Sisto, Mirella
    Hernandez-Lorenzo, Laura
    de la Rosa, Javier
    Ros, Salvador
    Gonzalez-Blanco, Elena
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2024, 39 (02) : 500 - 521
  • [33] SURVEY OF NATURAL LANGUAGE PROCESSING AND MACHINE TRANSLATION IN JAPAN.
    Nagao, Makoto
    Japan Annual Reviews in Electronics, Computers & Telecommunications: Computer Science & Technologi, 1982, : 64 - 70
  • [34] Pre-trained models for natural language processing: A survey
    QIU XiPeng
    SUN TianXiang
    XU YiGe
    SHAO YunFan
    DAI Ning
    HUANG XuanJing
    Science China(Technological Sciences), 2020, 63 (10) : 1872 - 1897
  • [35] 241Computational Politeness in Natural Language Processing: A Survey
    Priya, Priyanshu
    Firdaus, Mauajama
    Ekbal, Asif
    ACM COMPUTING SURVEYS, 2024, 56 (09)
  • [36] Natural Language Processing Meets Quantum Physics: A Survey and Categorization
    Wu, Sixuan
    Li, Jian
    Zhang, Peng
    Zhang, Yue
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3172 - 3182
  • [37] A Survey on the Integration of Blockchain Smart Contracts and Natural Language Processing
    Song, Zikai
    Shen, Pengxu
    Liu, Chuan
    Liu, Chao
    Gao, Haoyu
    Lei, Hong
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND NETWORKS, VOL III, CENET 2023, 2024, 1127 : 467 - 477
  • [38] Natural Language Processing (NLP) based Text Summarization - A Survey
    Awasthi, Ishitva
    Gupta, Kuntal
    Bhogal, Prabjot Singh
    Anand, Sahejpreet Singh
    Soni, Piyush Kumar
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 1310 - 1317
  • [39] Natural language processing for similar languages, varieties, and dialects: A survey
    Zampieri, Marcos
    Nakov, Preslav
    Scherrer, Yves
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (06) : 595 - 612
  • [40] Identification of Causal Dependencies by using Natural Language Processing: A Survey
    Nazaruka, Erika
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE), 2019, : 603 - 613