RoRED: Bootstrapping labeling rule discovery for robust relation extraction

被引:2
|
作者
Hou, Wenjun [1 ]
Hong, Liang [1 ]
Xu, Haoshuai [1 ]
Yin, Wei [1 ]
机构
[1] Wuhan Univ, Sch Informat Management, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Relation extraction; Rule discovery; Bootstrapping;
D O I
10.1016/j.ins.2023.01.132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Labeling rules can be leveraged to produce training data by matching the sentences in the corpus. However, the robustness of the relation extraction is reduced by noisy labels generated from incorrectly matched and missing sentences. To address this problem, we propose the bootstrapping labeling rule discovery method for robust relation extraction (RoRED). Specifically, we first define PN-rules to filter incorrectly matched sentences based on positive (P) and negative (N) rules. Second, we design a semantic-matching mechanism to match missing sentences based on semantic associations between rules, words, and sentences. Moreover, we present a co-training-based rule verification approach to refine the labels of matched sentences and improve the overall quality of bootstrapped rule discovery. Experiments on a real-world dataset indicate that RoRED achieves at least a 20% gain in F1 score compared to state-of-the-art methods.
引用
收藏
页码:62 / 76
页数:15
相关论文
共 50 条
  • [1] Relation Extraction and Discovery from Free Texts via Bootstrapping
    Yang, Yunlong
    Luo, Jie
    [J]. 2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 116 - 121
  • [2] Bootstrapping Multilingual Relation Discovery Using English Wikipedia and Wikimedia-Induced Entity Extraction
    Schone, Patrick
    Allison, Tim
    Giannella, Chris
    Pfeifer, Craig
    [J]. 2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 944 - 951
  • [3] From Grammar Rule Extraction to Treebanking: A Bootstrapping Approach
    Ghayoomi, Masood
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1912 - 1919
  • [4] Construction of semantic bootstrapping models for relation extraction
    Zhang, Chunyun
    Xu, Weiran
    Ma, Zhanyu
    Gao, Sheng
    Li, Qun
    Guo, Jun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 83 : 128 - 137
  • [5] Methodology for bootstrapping relation extraction for the Semantic Web
    Tchalakova, Maria
    Popov, Borislav
    Yankova, Milena
    [J]. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, PROCEEDINGS, 2006, 4183 : 222 - 232
  • [6] Adaptive Rule Discovery for Labeling Text Data
    Galhotra, Sainyam
    Golshan, Behzad
    Tan, Wang-Chiew
    [J]. SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2217 - 2225
  • [7] Reducing Semantic Drift in Bootstrapping for Entity Relation Extraction
    Chen Sijia
    Li Yan
    Chen Guang
    [J]. PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1947 - 1950
  • [8] Bootstrapping Joint Entity and Relation Extraction with Reinforcement Learning
    Xia, Min
    Cheng, Xiang
    Su, Sen
    Kuang, Ming
    Li, Gang
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 418 - 432
  • [9] A named entity relation extraction method based on bootstrapping
    He Tingting
    Xu Chao
    Li Jing
    Zhao Junzhe
    [J]. 2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 758 - 763
  • [10] BOOTSTRAPPING-BASED RELATION EXTRACTION IN FINANCIAL DOMAIN
    Kong, Bing
    Xu, Rui-Feng
    Wu, Dong-Yin
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOL. 2, 2015, : 897 - 903