Parsing Modern Standard Arabic using Treebank Resources

被引:0
|
作者
Al-Emran, Mostafa [1 ,2 ]
Zaza, Sarween [2 ]
Shaalan, Khaled [2 ]
机构
[1] Al Buraimi Univ Coll, Al Buraimi, Oman
[2] British Univ Dubai, Dubai, U Arab Emirates
关键词
Statistical Parsing; Treebank; Arabic;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A Treebank is a linguistic resource that is composed of a large collection of manually annotated and verified syntactically analyzed sentences. Statistical Natural Language Processing ( NLP) approaches have been successful in using these annotations for developing basic NLP tasks such as tokenization, diacritization, part-of-speech tagging, parsing, among others. In this paper, we address the problem of exploiting Treebank resources for statistical parsing of Modern Standard Arabic ( MSA) sentences. Statistical parsing is significant for NLP tasks that use parsed text as an input such as Information Retrieval, and Machine Translation. We conducted an experiment on Pen Arabic Treebank ( PATB) and the parsing performance obtained in terms of Precision, Recall, and F-measure was 82.4%, 86.6%, 84.4%, respectively.
引用
收藏
页码:80 / 83
页数:4
相关论文
共 50 条
  • [1] Toward Hybrid Method for Parsing Modern Standard Arabic
    Khoufi, Nabil
    Aloulou, Chafik
    Hadrich Belguith, Lamia
    [J]. 2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 451 - 456
  • [2] Dependency Parsing of Modern Standard Arabic with Lexical and Inflectional Features
    Marton, Yuval
    Habash, Nizar
    Rambow, Owen
    [J]. COMPUTATIONAL LINGUISTICS, 2013, 39 (01) : 161 - 194
  • [3] Treebank-Based Acquisition of LFG Parsing Resources for French
    Schluter, Natalie
    van Genabith, Josef
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2909 - 2916
  • [4] Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing
    Cetinoglu, Oezlem
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3360 - 3365
  • [5] Experiments in German treebank parsing
    Fissaha, S
    Oleinik, D
    Kornberger, R
    Müller, K
    Prescher, D
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 50 - 57
  • [6] Resources for Turkish dependency parsing: introducing the BOUN Treebank and the BoAT annotation tool
    Utku Türk
    Furkan Atmaca
    Şaziye Betül Özateş
    Gözde Berk
    Seyyit Talha Bedir
    Abdullatif Köksal
    Balkız Öztürk Başaran
    Tunga Güngör
    Arzucan Özgür
    [J]. Language Resources and Evaluation, 2022, 56 : 259 - 307
  • [7] Resources for Turkish dependency parsing: introducing the BOUN Treebank and the BoAT annotation tool
    Turk, Utku
    Atmaca, Furkan
    Ozates, Saziye Betul
    Berk, Gozde
    Bedir, Seyyit Talha
    Koksal, Abdullatif
    Basaran, Balkiz Ozturk
    Gungor, Tunga
    Ozgur, Arzucan
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2022, 56 (01) : 259 - 307
  • [8] Localization in Modern Standard Arabic
    Abdelali, A
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (01): : 23 - 28
  • [9] MODERN STANDARD ARABIC AND COLLOQUIALS
    KAYE, AS
    [J]. LINGUA, 1970, 24 (04) : 374 - &
  • [10] Parsing Noun Phrases in the Penn Treebank
    Vadas, David
    Curran, James R.
    [J]. COMPUTATIONAL LINGUISTICS, 2011, 37 (04) : 753 - 809