Community knowledge graph abstraction for enhanced link prediction: A study on PubMed knowledge graph

被引:0
|
作者
Zhao, Yang [1 ]
Bollegala, Danushka [2 ]
Hirose, Shunsuke [1 ]
Jin, Yingzi [1 ]
Kozu, Tomotake [1 ]
机构
[1] Deloitte Touche Tohmatsu LLC, Deloitte Analyt R&D, 3-2-3 Marunouchi,Chiyoda Ku, Tokyo 1008360, Japan
[2] Univ Liverpool, Dept Comp Sci, Liverpool L69 3BX, England
关键词
PKG; CKG; KGE; Entity distance-based method; Link prediction; Backtracking process;
D O I
10.1016/j.jbi.2024.104725
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Objective: As new knowledge is produced at a rapid pace in the biomedical field, existing biomedical Knowledge Graphs (KGs) cannot be manually updated in a timely manner. Previous work in Natural Language Processing (NLP) has leveraged link prediction to infer the missing knowledge in general-purpose KGs. Inspired by this, we propose to apply link prediction to existing biomedical KGs to infer missing knowledge. Although Knowledge Graph Embedding (KGE) methods are effective in link prediction tasks, they are less capable of capturing relations between communities of entities with specific attributes (Fanourakis et al., 2023). Methods: To address this challenge, we proposed an entity distance-based method for abstracting a Community Knowledge Graph (CKG) from a simplified version of the pre-existing PubMed Knowledge Graph (PKG) (Xu et al., 2020). For link prediction on the abstracted CKG, we proposed an extension approach for the existing KGE models by linking the information in the PKG to the abstracted CKG. The applicability of this extension was proved by employing six well-known KGE models: TransE, TransH, DistMult, ComplEx, SimplE, and RotatE. Evaluation metrics including Mean Rank (MR), Mean Reciprocal Rank (MRR), and Hits@k were used to assess the link prediction performance. In addition, we presented a backtracking process that traces the results of CKG link prediction back to the PKG scale for further comparison. Results: Six different CKGs were abstracted from the PKG by using embeddings of the six KGE methods. The results of link prediction in these abstracted CKGs indicate that our proposed extension can improve the existing KGE methods, achieving a top-10 accuracy of 0.69 compared to 0.5 for TransE, 0.7 compared to 0.54 for TransH, 0.67 compared to 0.6 for DistMult, 0.73 compared to 0.57 for ComplEx, 0.73 compared to 0.63 for SimplE, and 0.85 compared to 0.76 for RotatE on their CKGs, respectively. These improved performances also highlight the wide applicability of the extension approach. Conclusion: This study proposed novel insights into abstracting CKGs from the PKG. The extension approach indicated enhanced performance of the existing KGE methods and has applicability. As an interesting future extension, we plan to conduct link prediction for entities that are newly introduced to the PKG.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Building a PubMed knowledge graph
    Jian Xu
    Sunkyu Kim
    Min Song
    Minbyul Jeong
    Donghyeon Kim
    Jaewoo Kang
    Justin F. Rousseau
    Xin Li
    Weijia Xu
    Vetle I. Torvik
    Yi Bu
    Chongyan Chen
    Islam Akef Ebeid
    Daifeng Li
    Ying Ding
    Scientific Data, 7
  • [2] Building a PubMed knowledge graph
    Xu, Jian
    Kim, Sunkyu
    Song, Min
    Jeong, Minbyul
    Kim, Donghyeon
    Kang, Jaewoo
    Rousseau, Justin F.
    Li, Xin
    Xu, Weijia
    Torvik, Vetle I.
    Bu, Yi
    Chen, Chongyan
    Ebeid, Islam Akef
    Li, Daifeng
    Ding, Ying
    SCIENTIFIC DATA, 2020, 7 (01)
  • [3] Community Enhanced Knowledge Graph for Recommendation
    He, Zhen-Yu
    Wang, Chang-Dong
    Wang, Jinfeng
    Lai, Jian-Huang
    Tang, Yong
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (05) : 5789 - 5802
  • [4] Granular concept-enhanced relational graph convolution networks for link prediction in knowledge graph
    Dai, Yuhao
    Yan, Mengyu
    Li, Jinhai
    INFORMATION SCIENCES, 2025, 694
  • [5] A Knowledge Selective Adversarial Network for Link Prediction in Knowledge Graph
    Hu, Kairong
    Liu, Hai
    Hao, Tianyong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 171 - 183
  • [6] Geography-Enhanced Link Prediction Framework for Knowledge Graph Completion
    Wang, Yashen
    Zhang, Huanhuan
    Xie, Haiyong
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 198 - 210
  • [7] Fuzzy Search of Knowledge Graph with Link Prediction
    Ugai, Takanori
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE GRAPHS (IJCKG 2021), 2021, : 121 - 125
  • [8] A Survey on Knowledge Graph Embeddings for Link Prediction
    Wang, Meihong
    Qiu, Linling
    Wang, Xiaoli
    SYMMETRY-BASEL, 2021, 13 (03):
  • [9] Link prediction of the knowledge graph in the CTD database
    Jeon, J.
    Woo, G.
    Kim, K.
    Cho, S.
    Shin, W.
    Kim, D.
    Choi, J.
    TOXICOLOGY LETTERS, 2024, 399 : S140 - S140
  • [10] Numerical Knowledge Representation Learning and Link Prediction over Knowledge Graph
    Huang, Zhen
    Qiu, Xue
    Liu, Yu
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XIII, ICIC 2024, 2024, 14874 : 371 - 378