Distant supervision for relation extraction with hierarchical attention-based networks

被引:5
|
作者
Zhang, Jing [1 ]
Cao, Meilin [2 ]
机构
[1] Southeast Univ, Sch Cyber Sci & Engn, 2 SEU Rd, Nanjing 211189, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, 200 Xiaolingwei St, Nanjing 210094, Peoples R China
基金
中国国家自然科学基金;
关键词
Distant supervision; Relation extraction; Multi-instance learning; Attention mechanism;
D O I
10.1016/j.eswa.2023.119727
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distant supervision employs external knowledge bases to automatically label corpora. The labeled sentences in a corpus are usually packaged and trained for relation extraction using a multi-instance learning paradigm. The automated distant supervision inevitably introduces label noises. Previous studies that used sentence-level attention mechanisms to de-noise neither considered correlation among sentences in a bag nor correlation among bags. As a result, a large amount of effective supervision information is lost, which will affect the performance of learned relation extraction models. Moreover, these methods ignore the lack of feature information in the few-sentence bags (especially the one-sentence bags). To address these issues, this paper proposes hierarchical attention-based networks that can de-noise at both sentence and bag levels. In the calculation of bag representation, we provide weights to sentence representations using sentence-level attention that considers correlations among sentences in each bag. Then, we employ bag-level attention to combine the similar bags by considering their correlations, which can enhance the feature of target bags with poor feature information, and to provide properer weights in the calculation of bag group representation. Both sentence-level attention and bag-level attention can make full use of supervised information to improve model performance. The proposed method was compared with nine state-of-the-art methods on the New York Times datasets and Google IISc Distant Supervision dataset, respectively, whose experimental results show its conspicuous advantages in relation extraction tasks.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] ARNOR: Attention Regularization based Noise Reduction for Distant Supervision Relation Classification
    Jia, Wei
    Dai, Dai
    Xiao, Xinyan
    Wu, Hua
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1399 - 1408
  • [42] Distant Supervision for Relation Extraction via Piecewise Attention and Bag-Level Contextual Inference
    Phi, Van-Thuy
    Santoso, Joan
    Tran, Van-Hien
    Shindo, Hiroyuki
    Shimbo, Masashi
    Matsumoto, Yuji
    IEEE ACCESS, 2019, 7 : 103570 - 103582
  • [43] ATTENTION-BASED LSTM FOR PSYCHOLOGICAL STRESS DETECTION FROM SPOKEN LANGUAGE USING DISTANT SUPERVISION
    Winata, Genta Indra
    Kampman, Onno Pepijn
    Fung, Pascale
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6204 - 6208
  • [44] Distant Supervision for Relation Extraction in The Persian Language using Piecewise Convolutional Neural Networks
    Nasser, Mehrdad
    Asgari, Majid
    Minaei-Bidgoli, Behrouz
    2019 5TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2019, : 96 - 99
  • [45] Distant Supervision for Relation Extraction beyond the Sentence Boundary
    Quirk, Chris
    Poon, Hoifung
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 1171 - 1182
  • [46] Infusion of Labeled Data into Distant Supervision for Relation Extraction
    Pershina, Maria
    Min, Bonan
    Xu, Wei
    Grishman, Ralph
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 732 - 738
  • [47] Distant Supervision for Relation Extraction with Neural Instance Selector
    Chen, Yubo
    Liu, Hongtao
    Wu, Chuhan
    Yuan, Zhigang
    Jiang, Minyu
    Huang, Yongfeng
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT I, 2018, 11108 : 209 - 220
  • [48] Risks of misinterpretation in the evaluation of Distant Supervision for Relation Extraction
    Garcia-Mendoza, Juan-Luis
    Villasenor-Pineda, Luis
    Orihuela-Espina, Felipe
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2022, (68): : 71 - 83
  • [49] Distant Supervision for Relation Extraction via Sparse Representation
    Zeng, Daojian
    Lai, Siwei
    Wang, Xuepeng
    Liu, Kang
    Zhao, Jun
    Lv, Xueqiang
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2014, 2014, 8801 : 151 - 162