Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction

被引:1
|
作者
Schone, Patrick [1 ]
Allison, Tim [2 ]
Giannella, Chris [2 ]
Pfeifer, Craig [2 ]
机构
[1] FamilySearch, 50 E N Temple St, Salt Lake City, UT 84150 USA
[2] MITRE Corp, Annapolis Jct, MD 20701 USA
关键词
multilingual relation extraction; Wikipedia;
D O I
10.1109/ICTAI.2011.163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation extraction has been a subject of significant study over the past decade. Most relation extractors have been developed by combining the training of complex computational systems on large volumes of annotations with extensive rule-writing by language experts. Moreover, many relation extractors are reliant on other non-trivial NLP technologies which themselves are developed through significant human efforts, such as entity tagging, parsing, etc. Due to the high cost of creating and assembling the required resources, relation extractors have typically been developed for only high-resourced languages. In this paper, we describe a near-zero-cost methodology to build relation extractors for significantly distinct non-English languages using only freely available Wikipedia and other web documents, and some knowledge of English. We apply our method to build alma-mater, birthplace, father, occupation, and spouse relation extractors in Greek, Spanish, Russian, and Chinese. We conduct evaluations of induced relations at the file level - the most refined we have seen in the literature.
引用
收藏
页码:944 / 951
页数:8
相关论文
共 40 条
  • [1] Multilingual Entity and Relation Extraction Dataset and Model
    Seganti, Alessandro
    Firlag, Klaudia
    Skowronska, Helena
    Satlawa, Michal
    Andruszkiewicz, Piotr
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1946 - 1955
  • [2] Bootstrapping Joint Entity and Relation Extraction with Reinforcement Learning
    Xia, Min
    Cheng, Xiang
    Su, Sen
    Kuang, Ming
    Li, Gang
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 418 - 432
  • [3] Reducing Semantic Drift in Bootstrapping for Entity Relation Extraction
    Chen Sijia
    Li Yan
    Chen Guang
    [J]. PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1947 - 1950
  • [4] A named entity relation extraction method based on bootstrapping
    He Tingting
    Xu Chao
    Li Jing
    Zhao Junzhe
    [J]. 2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 758 - 763
  • [5] Multilingual Entity, Relation, Event and Human Value Extraction
    Li, Manling
    Lin, Ying
    Hoover, Joseph
    Whitehead, Spencer
    Voss, Clare R.
    Dehghani, Morteza
    Ji, Heng
    [J]. NAACL HLT 2019: THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2019, : 110 - 115
  • [6] Named Entity Relation Mining Using Wikipedia
    Iftene, Adrian
    Balahur-Dobrescu, Alexandra
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 763 - 766
  • [7] RoRED: Bootstrapping labeling rule discovery for robust relation extraction
    Hou, Wenjun
    Hong, Liang
    Xu, Haoshuai
    Yin, Wei
    [J]. INFORMATION SCIENCES, 2023, 629 : 62 - 76
  • [8] Relation Extraction and Discovery from Free Texts via Bootstrapping
    Yang, Yunlong
    Luo, Jie
    [J]. 2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 116 - 121
  • [9] A Bootstrapping Algorithm for Geo-Entity Relation Extraction from Online Encyclopedia
    Yu, Li
    Lu, Feng
    [J]. 2015 23RD INTERNATIONAL CONFERENCE ON GEOINFORMATICS, 2015,
  • [10] Using Graph Based Method to Improve Bootstrapping Relation Extraction
    Li, Haibo
    Bollegala, Danushka
    Matsuo, Yutaka
    Ishizuka, Mitsuru
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 127 - 138