Revisiting the Negative Data of Distantly Supervised Relation Extraction

被引:0
|
作者
Xie, Chenhao [1 ,2 ]
Liang, Jiaqing [1 ,2 ]
Liu, Jingping [1 ]
Huang, Chengsong [1 ]
Huang, Wenhao [1 ]
Xiao, Yanghua [1 ,3 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Data Sci, Shanghai, Peoples R China
[2] Shuyan Technol Inc, Shanghai, Peoples R China
[3] Fudan Aishu Cognit Intelligence Joint Res Ctr, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distantly supervision automatically generates plenty of training samples for relation extraction. However, it also incurs two major problems: noisy labels and imbalanced training data. Previous works focus more on reducing wrongly labeled relations (false positives) while few explore the missing relations that are caused by incompleteness of knowledge base (false negatives). Furthermore, the quantity of negative labels overwhelmingly surpasses the positive ones in previous problem formulations. In this paper, we first provide a thorough analysis of the above challenges caused by negative data. Next, we formulate the problem of relation extraction into as a positive unlabeled learning task to alleviate false negative problem. Thirdly, we propose a pipeline approach, dubbed RERE, that first performs sentence classification with relational labels and then extracts the subjects/objects. Experimental results show that the proposed method consistently outperforms existing approaches and remains excellent performance even learned with a large quantity of false positive samples. Source code is available online(1).
引用
收藏
页码:3572 / 3581
页数:10
相关论文
共 50 条
  • [1] Exploring Long Tail Data in Distantly Supervised Relation Extraction
    Gui, Yaocheng
    Liu, Qian
    Zhu, Man
    Gao, Zhiqiang
    [J]. NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 514 - 522
  • [2] Distantly Supervised Relation Extraction in Federated Settings
    Sui, Dianbo
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 569 - 583
  • [3] Survey on Distantly-Supervised Relation Extraction
    Yang S.-Z.
    Liu Y.-X.
    Zhang K.-W.
    Hong Y.
    Huang H.
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (08): : 1636 - 1660
  • [4] Bias Modeling for Distantly Supervised Relation Extraction
    Xiang, Yang
    Zhang, Yaoyun
    Wang, Xiaolong
    Qin, Yang
    Han, Wenying
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [5] Reinforcement Learning for Distantly Supervised Relation Extraction
    Sun, Tingting
    Zhang, Chunhong
    Ji, Yang
    Hu, Zheng
    [J]. IEEE ACCESS, 2019, 7 : 98023 - 98033
  • [6] Learning labeling functions in distantly supervised relation extraction
    Gui, Yaocheng
    Liu, Qian
    Gao, Zhiqiang
    [J]. INTELLIGENT DATA ANALYSIS, 2020, 24 (02) : 427 - 443
  • [7] Incorporating Instance Correlations in Distantly Supervised Relation Extraction
    Zhang, Luhao
    Hu, Linmei
    Shi, Chuan
    [J]. SEMANTIC TECHNOLOGY, JIST 2019: PROCEEDINGS, 2020, 12032 : 177 - 191
  • [8] Dynamic Chunkwise CNN for Distantly Supervised Relation Extraction
    Liu, Fangbing
    Wang, Qing
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 738 - 747
  • [9] Distantly Supervised Neural Network Model for Relation Extraction
    Wang, Zhen
    Chang, Baobao
    Sui, Zhifang
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 253 - 266
  • [10] Semantic piecewise convolutional neural network with adaptive negative training for distantly supervised relation extraction
    Yu, Mei
    Chen, Yunke
    Zhao, Mankun
    Xu, Tianyi
    Yu, Jian
    Yu, Ruiguo
    Liu, Hongwei
    Li, Xuewei
    [J]. NEUROCOMPUTING, 2023, 537 : 12 - 21