Large-scale entity representation learning for biomedical relationship extraction

被引:7
|
作者
Saenger, Mario [1 ]
Leser, Ulf [1 ]
机构
[1] Humboldt Univ, Comp Sci Dept, Knowledge Management Bioinformat, D-10099 Berlin, Germany
关键词
TOOL;
D O I
10.1093/bioinformatics/btaa674
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The automatic extraction of published relationships between molecular entities has important applications in many biomedical fields, ranging from Systems Biology to Personalized Medicine. Existing works focused on extracting relationships described in single articles or in single sentences. However, a single record is rarely sufficient to judge upon the biological correctness of a relation, as experimental evidence might be weak or only valid in a certain context. Furthermore, statements may be more speculative than confirmative, and different articles often contradict each other. Experts therefore always take the complete literature into account to take a reliable decision upon a relationship. It is an open research question how to do this effectively in an automatic manner. Results: We propose two novel relation extraction approaches which use recent representation learning techniques to create comprehensive models of biomedical entities or entity-pairs, respectively. These representations are learned by considering all publications from PubMed mentioning an entity or a pair. They are used as input for a neural network for classifying relations globally, i.e. the derived predictions are corpus-based, not sentence- or article based as in prior art. Experiments on the extraction of mutation-disease, drug-disease and drug-drug relationships show that the learned embeddings indeed capture semantic information of the entities under study and outperform traditional methods by 4-29% regarding F1 score.
引用
收藏
页码:236 / 242
页数:7
相关论文
共 50 条
  • [1] Learning entity-oriented representation for biomedical relation extraction
    Hu, Ying
    Chen, Yanping
    Qin, Yongbin
    Huang, Ruizhang
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 147
  • [2] Representation Learning of Large-Scale Knowledge Graphs via Entity Feature Combinations
    Tan, Zhen
    Zhao, Xiang
    Wang, Wei
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1777 - 1786
  • [3] Large-scale neural biomedical entity linking with layer overwriting
    Tsujimura, Tomoki
    Miwa, Makoto
    Sasaki, Yutaka
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 143
  • [4] Active Learning for Large-Scale Entity Resolution
    Qian, Kun
    Popa, Lucian
    Sen, Prithviraj
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1379 - 1388
  • [5] Large-scale Graph Representation Learning
    Leskovec, Jure
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4 - 4
  • [6] Large-Scale Entity Extraction from Enterprise Data
    Gupta, Rajeev
    Kondapally, Ranganath
    [J]. SECOND INTERNATIONAL CONFERENCE ON AIML SYSTEMS 2022, 2022,
  • [7] Large-scale Entity Extraction and Probabilistic Record Linkage
    Villanustre, Flavio
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON COLLABORATION TECHNOLOGIES AND SYSTEMS (CTS), 2014, : 85 - 93
  • [8] BioRel: A Large-Scale Dataset for Biomedical Relation Extraction
    Xing, Rui
    Luo, Jie
    Song, Tengwei
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 1801 - 1808
  • [9] BioRel: towards large-scale biomedical relation extraction
    Xing, Rui
    Luo, Jie
    Song, Tengwei
    [J]. BMC BIOINFORMATICS, 2020, 21 (Suppl 16)
  • [10] BioRel: towards large-scale biomedical relation extraction
    Rui Xing
    Jie Luo
    Tengwei Song
    [J]. BMC Bioinformatics, 21