Towards a Cascade of Morpho-syntactic Tools for Arabic Natural Language Processing

被引:0
|
作者
Mesfar, Slim [1 ]
机构
[1] Univ Manouba, RIADI, Manouba, Tunisia
关键词
Arabic language; lexical analysis; agglutinative morphology; automatic vocalization; Named Entities Recognition; NooJ linguistic platform;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a cascade of morpho-syntactic tools to deal with Arabic natural language processing. It begins with the description of a large coverage formalization of the Arabic lexicon. The built electronic dictionary, named "El-DicAr", which stands for "Electronic Dictionary for Arabic", links inflectional, morphological, and syntactic-semantic information to the list of lemmas. Automated inflectional and derivational routines are applied to each lemma producing over 3 million inflected forms. El-DicAr represents the linguistic engine for the automatic analyzer, built through a lexical analysis module, and a cascade of morpho-syntactic tools including: a morphological analyzer, a spell-checker, a named entity recognition tool, an automatic annotator and tools for linguistic research and contextual exploration. The morphological analyzer identifies the component morphemes of the agglutinative forms using large coverage morphological grammars. The spell-checker corrects the most frequent typographical errors. The lexical analysis module handles the different vocalization statements in Arabic written texts. Finally, the named entity recognition tool is based on a combination of the morphological analysis results and a set of rules represented as local grammars.
引用
收藏
页码:150 / 162
页数:13
相关论文
共 50 条
  • [1] French language in America: morpho-syntactic approaches
    Auger, Julie
    [J]. FRENCH REVIEW, 2007, 80 (05): : 1173 - 1174
  • [2] A LINK-BASED MORPHO-SYNTACTIC PARSING SYSTEM FOR ARABIC
    Sadek, Ahmed
    Sakr, Mohamed
    Kouta, Mohamad
    Al-Raghi, Abdo
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 1181 - 1188
  • [3] Morpho-Syntactic Tagging System Based on the Patterns Words for Arabic Texts
    El-Jihad, Abdelhamid
    Yousfi, Abdellah
    Si-Lhoussain, Aouragh
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2011, 8 (04) : 350 - 354
  • [5] The Evolution of Morpho-Syntactic Mood in Arabic: A View from early Christian Arabic Gospel Manuscripts
    Stokes, Phillip W.
    [J]. JOURNAL OF SEMITIC STUDIES, 2024, 69 (01) : 361 - 413
  • [6] On-line Morpho-Syntactic Processing in the Healthy and Aphasic Brain
    Schneider, Laurence
    Toepel, Ulrike
    Murray, Micah M.
    Clarke, Stephanie
    [J]. AOA2010, 48TH ACADEMY OF APHASIA PROCEEDINGS, 2010, 6 : 43 - +
  • [7] Morpho-syntactic processing of Arabic plurals after aphasia: dissecting lexical meaning from morpho-syntax within word boundaries
    Khwaileh, Tariq
    Body, Richard
    Herbert, Ruth
    [J]. COGNITIVE NEUROPSYCHOLOGY, 2015, 32 (06) : 340 - 367
  • [8] Morpho-Syntactic Abilities of Unbalanced Bilingual Children: A Closer Look at the Weaker Language
    Meir, Natalia
    [J]. FRONTIERS IN PSYCHOLOGY, 2018, 9
  • [9] Sentence Shortening via Morpho-Syntactic Annotated Data in Historical Language Learning
    Moritz, Maria
    Pavlek, Barbara
    Franzini, Greta
    Crane, Gregory
    [J]. ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2016, 9 (01):
  • [10] LEFT ANTERIOR NEGATIVITIES (LANS) AND MORPHO-SYNTACTIC PROCESSING IN ERP READING STUDIES
    Marcinek, Bradley T.
    Ullman, Michael T.
    Drury, John E.
    [J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, : 253 - 253