Parsa: An open information extraction system for Persian

被引:3
|
作者
Rahat, Mahmoud [1 ]
Talebpour, Alireza [1 ]
机构
[1] Shahid Beheshti Univ, Fac Comp Sci & Engn, Tehran, Iran
关键词
D O I
10.1093/llc/fqy003
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This article presents Parsa as an open information extraction (OIE) system for Persian. Comparing with advanced English approaches, OIE has just started to develop in other languages. Existing systems apply information about the grammar and syntactic structures of the target language to gain domain independence (which is a key goal in OIE). To improve modeling these complex structures, Parsa introduces a novel set of Patterns based on tree format. The patterns also enable Parsa to define POS tags, and lexical constraints to reduce incorrect matches. Each Tree Pattern is placed inside a Package based on its type and priority. The Packages help Parsa to alleviate some challenges in processing Persian like null-subject problem and uninformative extraction. To make the extraction process simple and coherent, we separate matching template from extraction template. An efficient algorithm for matching patterns inside dependency parse of a sentence is presented as well. Our experiments showed that Parsa achieves better performance than the state of the art systems in Persian, and highly comparable with the existing approaches in English.
引用
收藏
页码:874 / 893
页数:20
相关论文
共 50 条
  • [31] A systematic mapping study on open information extraction
    Glauber, Rafael
    Claro, Daniela Barreiro
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 112 : 372 - 387
  • [32] A meta learning approach for open information extraction
    Jiabao Han
    Hongzhi Wang
    [J]. Neural Computing and Applications, 2022, 34 : 12681 - 12694
  • [33] Transformer based network for Open Information Extraction
    Han, Jiabao
    Wang, Hongzhi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102
  • [34] Generative adversarial networks for open information extraction
    Jiabao Han
    Hongzhi Wang
    [J]. Advances in Computational Intelligence, 2021, 1 (4):
  • [35] A Lexicalized Tree Kernel for Open Information Extraction
    Xu, Ying
    Ringlstetter, Christoph
    Kim, Mi-Young
    Goebel, Randy
    Kondrak, Grzegorz
    Miyao, Yusuke
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 279 - 284
  • [36] Open Information Extraction with Global Structure Constraints
    Zhu, Qi
    Ren, Xiang
    Shang, Jingbo
    Zhang, Yu
    Xu, Frank F.
    Han, Jiawei
    [J]. COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 57 - 58
  • [37] Pattern Learning for Chinese Open Information Extraction
    Li, Yang
    Miao, Qingliang
    Guo, Tong
    Geng, Ji
    Hu, Changjian
    Xu, Feiyu
    [J]. KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING (CCKS 2018), 2019, 957 : 78 - 90
  • [38] A modular information extraction system
    Feldman, Ronen
    Regev, Yizhar
    Gorodetsky, Maya
    [J]. INTELLIGENT DATA ANALYSIS, 2008, 12 (01) : 51 - 71
  • [39] Data Augmentation and Preparation Process of PerInfEx: A Persian Chatbot With the Ability of Information Extraction
    Safari, Pegah
    Shamsfard, Mehrnoush
    [J]. IEEE ACCESS, 2024, 12 : 19158 - 19180
  • [40] Acoustics Open Information System
    V. G. Shamaev
    A. B. Gorshkov
    [J]. Acoustical Physics, 2019, 65 : 778 - 786