Parsa: An open information extraction system for Persian

被引:3
|
作者
Rahat, Mahmoud [1 ]
Talebpour, Alireza [1 ]
机构
[1] Shahid Beheshti Univ, Fac Comp Sci & Engn, Tehran, Iran
关键词
D O I
10.1093/llc/fqy003
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This article presents Parsa as an open information extraction (OIE) system for Persian. Comparing with advanced English approaches, OIE has just started to develop in other languages. Existing systems apply information about the grammar and syntactic structures of the target language to gain domain independence (which is a key goal in OIE). To improve modeling these complex structures, Parsa introduces a novel set of Patterns based on tree format. The patterns also enable Parsa to define POS tags, and lexical constraints to reduce incorrect matches. Each Tree Pattern is placed inside a Package based on its type and priority. The Packages help Parsa to alleviate some challenges in processing Persian like null-subject problem and uninformative extraction. To make the extraction process simple and coherent, we separate matching template from extraction template. An efficient algorithm for matching patterns inside dependency parse of a sentence is presented as well. Our experiments showed that Parsa achieves better performance than the state of the art systems in Persian, and highly comparable with the existing approaches in English.
引用
收藏
页码:874 / 893
页数:20
相关论文
共 50 条
  • [41] A modular information extraction system
    Feldman, Ronen
    Regev, Yizhar
    Gorodetsky, Maya
    [J]. INTELLIGENT DATA ANALYSIS, 2008, 12 (01) : 51 - 71
  • [42] Acoustics Open Information System
    Shamaev, V. G.
    Gorshkov, A. B.
    [J]. ACOUSTICAL PHYSICS, 2019, 65 (06) : 778 - 786
  • [43] Acoustics Open Information System
    V. G. Shamaev
    A. B. Gorshkov
    [J]. Acoustical Physics, 2019, 65 : 778 - 786
  • [44] MILIE: Modular & Iterative Multilingual Open Information Extraction
    Kotnis, Bhushan
    Gashteovski, Kiril
    Onoro-Rubio, Daniel
    Shaker, Ammar
    Rodriguez-Tembras, Vanesa
    Takamoto, Makoto
    Niepert, Mathias
    Lawrence, Carolin
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6939 - 6950
  • [45] Exploiting Duality in Open Information Extraction with Predicate Prompt
    Chen, Zhen
    Liu, Jingping
    Yang, Deqing
    Xiao, Yanghua
    Xu, Huimin
    Wang, Zongyu
    Xie, Rui
    Xian, Yunsen
    [J]. PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 125 - 133
  • [46] Dependency Parsing Representation Learning for Open Information Extraction
    Li Zekun
    Ning Nianwen
    Peng Chengcheng
    Wu Bin
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 433 - 444
  • [47] PragmaticOIE: a pragmatic open information extraction for Portuguese language
    Cleiton Fernando Lima Sena
    Daniela Barreiro Claro
    [J]. Knowledge and Information Systems, 2020, 62 : 3811 - 3836
  • [48] MinScIE: Citation-centered Open Information Extraction
    Lauscher, Anne
    Song, Yide
    Gashteovski, Kiril
    [J]. 2019 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2019), 2019, : 386 - 387
  • [49] Event Detection Based on Open Information Extraction and Ontology
    Sahnoun, Sihem
    Elloumi, Samir
    Ben Yahia, Sadok
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, 2019, 11683 : 244 - 255
  • [50] Improving Open Information Extraction with Distant Supervision Learning
    Han, Jiabao
    Wang, Hongzhi
    [J]. NEURAL PROCESSING LETTERS, 2021, 53 (05) : 3287 - 3306