Parsing Modern Standard Arabic using Treebank Resources

被引:0
|
作者
Al-Emran, Mostafa [1 ,2 ]
Zaza, Sarween [2 ]
Shaalan, Khaled [2 ]
机构
[1] Al Buraimi Univ Coll, Al Buraimi, Oman
[2] British Univ Dubai, Dubai, U Arab Emirates
关键词
Statistical Parsing; Treebank; Arabic;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A Treebank is a linguistic resource that is composed of a large collection of manually annotated and verified syntactically analyzed sentences. Statistical Natural Language Processing ( NLP) approaches have been successful in using these annotations for developing basic NLP tasks such as tokenization, diacritization, part-of-speech tagging, parsing, among others. In this paper, we address the problem of exploiting Treebank resources for statistical parsing of Modern Standard Arabic ( MSA) sentences. Statistical parsing is significant for NLP tasks that use parsed text as an input such as Information Retrieval, and Machine Translation. We conducted an experiment on Pen Arabic Treebank ( PATB) and the parsing performance obtained in terms of Precision, Recall, and F-measure was 82.4%, 86.6%, 84.4%, respectively.
引用
收藏
页码:80 / 83
页数:4
相关论文
共 50 条
  • [21] Textual Entailment for Modern Standard Arabic
    Alabbas, Maytham
    [J]. INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (04): : 653 - 654
  • [22] A Reference Grammar of Modern Standard Arabic
    Barry, Sandra
    [J]. LANGUAGE LEARNING JOURNAL, 2006, 34 (01): : 79 - 80
  • [23] Modern Standard Arabic Readability Prediction
    Nassiri, Naoual
    Lakhouaja, Abdelhak
    Cavalli-Sforza, Violetta
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, 2018, 782 : 120 - 133
  • [24] Rhythmic Features across Modern Standard Arabic and Arabic Dialects
    Droua-Hamdani, Ghania
    Alotaibi, Yousef A.
    Selouani, Sid-Ahmed
    Boudraa, Malika
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [26] Parsing the Penn Chinese Treebank with semantic knowledge
    Xiong, DY
    Li, SL
    Liu, Q
    Lin, SX
    Qian, YL
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 70 - 81
  • [27] Cross-Lingual Dependency Parsing Using Code-Mixed TreeBank
    Zhang, Meishan
    Zhang, Yue
    Fu, Guohong
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 997 - 1006
  • [28] Camel Treebank: An Open Multi-genre Arabic Dependency Treebank
    Habash, Nizar
    AbuOdeh, Muhammed
    Taji, Dima
    Faraj, Reem
    El Gizuli, Jamila
    Kallas, Omar
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2672 - 2681
  • [29] The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic
    Al-Saif, Amal
    Markert, Katja
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2046 - 2053
  • [30] Sentential object complements in Modern Standard Arabic
    Kaye, AS
    [J]. BULLETIN OF THE SCHOOL OF ORIENTAL AND AFRICAN STUDIES-UNIVERSITY OF LONDON, 2003, 66 : 89 - 90