Automatic Discourse Parsing of Arabic Texts: the Case of Attachments

被引:0
|
作者
Bettaieb, Hela [1 ]
Boujelben, Ines [2 ]
Keskes, Iskandar [3 ]
机构
[1] Miracl Lab Sfax, FSEGS, Fac Econ & Management Sfax, Sfax, Tunisia
[2] Gabes Univ, Miracl Lab Sfax, Gabes, Tunisia
[3] Gafsa Univ, Miracl Lab Sfax, ANLP Res Grp, Gafsa, Tunisia
关键词
Discourse Attachments; Discourse relations; Discourse analysis; Segmented discourse Reresentation Theory (SDRT); Elementary Discourse Units (EDUs);
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The study of the structure of the Arabic texts is considered as a modern concern. Its importance lies in its ability to determine the semantic and rhetorical meaning of the discourse. Through a coherent structural graph consisting of text units and rhetorical relations linking them. It also highlights its importance by employing it in several applications from the natural language processing field, for example the question/answer system, the automatic translation and the automated text summary system and the Acquisition of the Arabic terminology. The rhetorical analysis is based on three important pillars. The first pillar is to divide the text into text units. The second pillar is to look for structural links between different text units. The third pillar connects these units to each other through rhetorical relations with semantic meanings. In this context, our task of the automatic construction of discourse structure: the case of attachments falls within the third pillar of rhetorical analysis. This approach of rhetorical analysis is based on the segmented discourse representation theory (SDRT) within our proposed method and on the classifier RandomForest. Our method was tested on the corpus of test, where the F-measure was 73%..
引用
收藏
页码:128 / 132
页数:5
相关论文
共 50 条
  • [1] Annotation for and robust parsing of discourse structure on unrestricted texts
    Baldridge, Jason
    Asher, Nicholas
    Hunter, Julie
    [J]. ZEITSCHRIFT FUR SPRACHWISSENSCHAFT, 2007, 26 (02): : 213 - 239
  • [2] Automatic Structuring of Arabic Normative Texts
    Berrazega, Ines
    Faiz, Rim
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND MATHEMATICAL METHODS: COMPUTATIONAL METHODS IN SYSTEMS AND SOFTWARE 2017, VOL. 2, 2018, 662 : 238 - 249
  • [3] Automatic minimal diacritization of Arabic texts
    Alnefaie, Rehab
    Azmi, Aqil M.
    [J]. ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 169 - 174
  • [4] PARSING ARABIC TEXTS USING REAL PATTERNS OF SYNTACTIC TREES
    Ben Fraj, F.
    Zribi, C. Ben Othmane
    Ben Ahmed, M.
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C) : 83 - 101
  • [5] Events Automatic Extraction from Arabic Texts
    Hkiri, Emna
    Mallat, Souheyl
    Zrigui, Mounir
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2016, 6 (01) : 36 - 51
  • [6] Clause-based Discourse Segmentation of Arabic Texts
    Keskes, Iskandar
    Benamara, Farah
    Belguith, Lamia Hadrich
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2826 - 2832
  • [7] AUTOMATIC MERGING OF MARKED UP TEXTS FOR DICTIONARY ENTRY PARSING
    Moruz, Mihai Alex
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE 'LINQUISTIC RESOURCES AND TOOLS FOR PROCESSING THE ROMANIAN LANGUAGE', 2016, : 163 - 171
  • [8] Information Structure Parsing for Chinese Legal Texts: A Discourse Analysis Perspective
    Sun, Bo
    [J]. INTERNATIONAL JOURNAL OF TECHNOLOGY AND HUMAN INTERACTION, 2019, 15 (01) : 46 - 64
  • [9] AUTOMATIC SUMMARIZATION OF ARABIC TEXTS BASED ON RST TECHNIQUE
    Maaloul, Mohamed Hedi
    Keskes, Iskandar
    Belguith, Lamia Hadrich
    Blache, Philippe
    [J]. ICEIS 2010: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 2: ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS, 2010, : 434 - 437
  • [10] Integration of data sources in an automatic corrector of Arabic texts
    Outifa, Jaouad
    Aouragh, Si Lhoussain
    Ouatik, Said El Alaoui
    [J]. 2016 4TH IEEE INTERNATIONAL COLLOQUIUM ON INFORMATION SCIENCE AND TECHNOLOGY (CIST), 2016, : 344 - 348