Tell me your position: Distantly supervised biomedical entity relation extraction using entity position marker

被引:2
|
作者
Zhu, Jiran [1 ]
Dong, Jikun [1 ]
Du, Hongyun [1 ]
Geng, Yanfang [1 ]
Fan, Shengyu [1 ]
Yu, Hui [1 ]
Shao, Zengzhen [2 ]
Wang, Xia [3 ]
Yang, Yaping [3 ]
Xu, Weizhi [1 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Peoples R China
[2] Shandong Womens Univ, Sch Data & Comp Sci, Jinan, Peoples R China
[3] AiLife Diagnost, Pearland, TX USA
基金
中国国家自然科学基金;
关键词
Deep neural network; Natural language processing; Distant supervision; Biomedical entity relation extraction; Position marker;
D O I
10.1016/j.neunet.2023.09.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A significant amount of textual data has been produced in the biomedical area recently as a result of the advancement of biomedical technologies. Large-scale biomedical data can be automatically obtained with the help of distant supervision. However, the noisy data brought by distant supervision methods makes relation extraction tasks more difficult. Previous work has focused more on how to restore mislabeled relationships, but little attention has been paid to the importance of labeled entity locations for relationship extraction tasks. In this paper, we present a "four-stage" model based on BioBERT and Multi-Instance Learning by using entity position markers. Firstly, the sentence is marked with position. Secondly, BioBERT, a biomedical pre-trained language model, is used in the final sentence feature vector representation not only with the global position marker but also with the start and end marker of both the head and tail entity. Thirdly, the aggregation of sentence vectors in the bag is used as the vector feature of the bag by three aggregation methods, and the performance of different sentence feature vectors combined with different bag encoding methods is discussed. At last, relation classification is performed at the bag level. According to experimental results, the presented model significantly outperforms all baseline models and contributes to noise reduction. In addition, different bag encoding methods need to match corresponding sentence encoding representation to achieve the best performance.
引用
收藏
页码:531 / 538
页数:8
相关论文
共 50 条
  • [21] Distantly supervised biomedical relation extraction using piecewise attentive convolutional neural network and reinforcement learning
    Zhu, Tiantian
    Qin, Yang
    Xiang, Yang
    Hu, Baotian
    Chen, Qingcai
    Peng, Weihua
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (12) : 2571 - 2581
  • [22] Research on the Entity Relation Extraction of Field based on Semi-Supervised
    Guo, Jianyi
    Zhao, Jun
    Yu, Zhengtao
    Su, Lei
    Xian, Yantuan
    Tian, Wei
    ADVANCED RESEARCH ON AUTOMATION, COMMUNICATION, ARCHITECTONICS AND MATERIALS, PTS 1 AND 2, 2011, 225-226 (1-2): : 1292 - 1300
  • [23] Semi-supervised Entity Relation Extraction Based on Trigger Word
    Tai, Liting
    Guo, Fenzhuo
    Qin, Sujuan
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 497 - 501
  • [24] A neural joint model for entity and relation extraction from biomedical text
    Li, Fei
    Zhang, Meishan
    Fu, Guohong
    Ji, Donghong
    BMC BIOINFORMATICS, 2017, 18
  • [25] RETRACTED: Utilizing Entity-Based Gated Convolution and Multilevel Sentence Attention to Improve Distantly Supervised Relation Extraction (Retracted Article)
    Yi, Qian
    Zhang, Guixuan
    Zhang, Shuwu
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [26] Entity Fusion Contrastive Inference Network for Biomedical Document Relation Extraction
    Cai, Huixian
    Yuan, Jianyuan
    Sang, Guoming
    Liu, Zhi
    Lin, Hongfei
    Zhang, Yijia
    HEALTH INFORMATION PROCESSING, CHIP 2023, 2023, 1993 : 145 - 163
  • [27] A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction
    Amin, Saadullah
    Dunfield, Katherine Ann
    Vechkaeva, Anna
    Neumann, Guenter
    19TH SIGBIOMED WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2020), 2020, : 187 - 194
  • [28] A neural joint model for entity and relation extraction from biomedical text
    Fei Li
    Meishan Zhang
    Guohong Fu
    Donghong Ji
    BMC Bioinformatics, 18
  • [29] Research on Pattern Representation and Reliability in Semi-Supervised Entity Relation Extraction
    Ye, Feiyue
    Tang, Nan
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2016, PT II, 2016, 9713 : 289 - 297
  • [30] Using Dilated Residual Network to Model Distantly Supervised Relation Extraction
    Zhan, Lei
    Yang, Yan
    Zhu, Pinpin
    He, Liang
    Yu, Zhou
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 500 - 504