Explicit Fine Grained Syntactic and Semantic Annotation of the Idafa Construction in Arabic

被引:0
|
作者
Hawwari, Abdelati [1 ]
Attia, Mohammed [2 ]
Ghoneim, Mahmoud [1 ]
Diab, Mona [1 ]
机构
[1] George Washington Univ, Dept Comp Sci, Washington, DC 20052 USA
[2] Google Inc, Mountain View, CA USA
关键词
Arabic; construct state; Idafa; annotation; Treebank; syntax; semantics;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Idafa in traditional Arabic grammar is an umbrella construction that covers several phenomena including what is expressed in English as noun-noun compounds and Saxon & Norman genitives. Additionally, Idafa participates in some other constructions, such as quantifiers, quasi-prepositions, and adjectives. Identifying the various types of the Idafa construction (IC) is of importance to Natural Language Processing (NLP) applications. Noun-Noun compounds exhibit special behaviour in most languages impacting their semantic interpretation. Hence distinguishing them could have an impact on downstream NLP applications. The most comprehensive computational syntactic representation of the Arabic language is found in the LDC Arabic Treebank (ATB). Despite its coverage, ICs are not explicitly labeled in the ATB and furthermore, there is no clear distinction between ICs of noun-noun relations and other traditional ICs. Hence, we devise a detailed syntactic and semantic typification process of the IC phenomenon in Arabic. We target the ATB as a platform for this classification. We render the ATB annotated with explicit IC labels in addition to further semantic characterization which is useful for syntactic, semantic and cross language processing. Our typification of IC comprises 3 main syntactic IC types: False Idafas (FIC), Grammatical Idafas (GIC), and True Idafas (TIC), which are further divided into 10 syntactic subclasses. The TIC group is further classified into semantic relations. We devise a method for automatic IC labeling and compare its yield against the CATiB Treebank. Our evaluation shows that we achieve the same level of accuracy, but with the additional fine-grained classification into the various syntactic and semantic types.
引用
收藏
页码:3569 / 3577
页数:9
相关论文
共 50 条
  • [1] CharaParser for Fine-Grained Semantic Annotation of Organism Morphological Descriptions
    Cui, Hong
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (04): : 738 - 754
  • [2] Supervised collaboration for syntactic annotation of Quranic Arabic
    Dukes, Kais
    Atwell, Eric
    Habash, Nizar
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (01) : 33 - 62
  • [3] An intelligent tool for syntactic annotation of Arabic corpora
    Zribi, Chiraz Ben Othmane
    Ben Fraj, Feriel
    Ben Ahmed, Mohamed
    [J]. INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2011, 40 (04) : 227 - 237
  • [4] Supervised collaboration for syntactic annotation of Quranic Arabic
    Kais Dukes
    Eric Atwell
    Nizar Habash
    [J]. Language Resources and Evaluation, 2013, 47 : 33 - 62
  • [5] SynTags - Web Interface for Syntactic and Semantic Annotation
    Atanasov, Atanas
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE COMPUTATIONAL LINGUISTICS IN BULGARIA (CLIB '16), 2016, : 47 - 53
  • [6] Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank
    Dukes, Kais
    Atwell, Eric
    Sharaf, Abdul-Baquee M.
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1822 - 1827
  • [7] TALS: A Framework For Text Analysis, Fine-Grained Annotation, Localisation and Semantic Segmentation
    Jaradat, Shatha
    Dokoohaki, Nima
    Wara, Ummul
    Goswami, Mallu
    Hammar, Kim
    Matskin, Mihhail
    [J]. 2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 201 - 206
  • [8] FFL: Fine-grained Fault Localization for Student Programs via Syntactic and Semantic Reasoning
    Nguyen, Thanh-Dat
    Le-Cong, Thanh
    Luong, Duc-Minh
    Duong, Van-Hai
    Le, Xuan-Bach D.
    Lo, David
    Huynh, Quyet-Thang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 151 - 162
  • [9] Framework for Automatic Semantic Annotation of Arabic Websites
    Helmy, Tarek
    Al-Bukhitan, Saeed
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2016, 25 (01)
  • [10] A Semantic Annotation Model for Arabic Legal Texts
    Berrazega, Ines
    Faiz, Rim
    Bouhafs, Asma
    Mourad, Ghassan
    [J]. 9TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2016), 2016,