Anytime bottom-up rule learning for large-scale knowledge graph completion

被引:1
|
作者
Meilicke, Christian [1 ]
Chekol, Melisachew Wudage [2 ]
Betz, Patrick [1 ]
Fink, Manuel [1 ]
Stuckeschmidt, Heiner [1 ]
机构
[1] Univ Mannheim, Mannheim, Germany
[2] Univ Utrecht, Utrecht, Netherlands
来源
VLDB JOURNAL | 2024年 / 33卷 / 01期
关键词
Knowledge graph completion; Link prediction; Rule learning;
D O I
10.1007/s00778-023-00800-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge graph completion is the task of predicting correct facts that can be expressed by the vocabulary of a given knowledge graph, which are not explicitly stated in that graph. Broadly, there are two main approaches for solving the knowledge graph completion problem. Sub-symbolic approaches embed the nodes and/or edges of a given graph into a low-dimensional vector space and use a scoring function to determine the plausibility of a given fact. Symbolic approaches learn a model that remains within the primary representation of the given knowledge graph. Rule-based approaches are well-known examples. One such approach is AnyBURL. It works by sampling random paths, which are generalized into Horn rules. Previously published results show that the prediction quality of AnyBURL is close to current state of the art with the additional benefit of offering an explanation for a predicted fact. In this paper, we propose several improvements and extensions of AnyBURL. In particular, we focus on AnyBURL's capability to be successfully applied to large and very large datasets. Overall, we propose four separate extensions: (i) We add to each rule a set of pairwise inequality constraints which enforces that different variables cannot be grounded by the same entities, which results into more appropriate confidence estimations. (ii) We introduce reinforcement learning to guide path sampling in order to use available computational resources more efficiently. (iii) We propose an efficient sampling strategy to approximate the confidence of a rule instead of computing its exact value. (iv) We develop a new multithreaded AnyBURL, which incorporates all previously mentioned modifications. In an experimental study, we show that our approach outperforms both symbolic and sub-symbolic approaches in large-scale knowledge graph completion. It has a higher prediction quality and requires significantly less time and computational resources.
引用
下载
收藏
页码:131 / 161
页数:31
相关论文
共 50 条
  • [21] Including bottom-up emission abatement technologies in a large-scale global economic model for policy assessments
    Weitzel, Matthias
    Saveyn, Bert
    Vandyck, Toon
    ENERGY ECONOMICS, 2019, 83 : 254 - 263
  • [22] Zooplankton and forage fish species off Peru: Large-scale bottom-up forcing and local-scale depletion
    Ayon, Patricia
    Swartzman, Gordon
    Bertrand, Arnaud
    Gutierrez, Mariano
    Bertrand, Sophie
    PROGRESS IN OCEANOGRAPHY, 2008, 79 (2-4) : 208 - 214
  • [23] Large-scale knowledge graph representations of disease processes
    Hoch, Matti
    Gupta, Shailendra
    Wolkenhauer, Olaf
    CURRENT OPINION IN SYSTEMS BIOLOGY, 2024, 38
  • [24] Leveraging Semantics for Large-Scale Knowledge Graph Evaluation
    Rashid, Sabbir M.
    Viswanathan, Amar
    Gross, Ian
    Kendall, Elisa
    McGuinness, Deborah L.
    PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, : 437 - 442
  • [25] A New Graph-Partitioning Algorithm for Large-Scale Knowledge Graph
    Zhong, Jiang
    Wang, Chen
    Li, Qi
    Li, Qing
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 434 - 444
  • [26] Large-Scale, Bottom-Up Synthesis of Binary Metal-Organic Framework Nanosheets for Efficient Water Oxidation
    Li, Fei-Long
    Wang, Pengtang
    Huang, Xiaoqing
    Young, David James
    Wang, Hui-Fang
    Braunstein, Pierre
    Lang, Jian-Ping
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2019, 58 (21) : 7051 - 7056
  • [27] Bottom-Up Electrodeposition of Large-Scale Nanotwinned Copper within 3D Through Silicon Via
    Sun, Fu-Long
    Liu, Zhi-Quan
    Li, Cai-Fu
    Zhu, Qing-Sheng
    Zhang, Hao
    Suganuma, Katsuaki
    MATERIALS, 2018, 11 (02):
  • [28] Ursgal, Universal Python']Python Module Combining Common Bottom-Up Proteomics Tools for Large-Scale Analysis
    Kremer, Lukas P. M.
    Leufken, Johannes
    Oyunchimeg, Purevdulam
    Schulze, Stefan
    Fufezan, Christian
    JOURNAL OF PROTEOME RESEARCH, 2016, 15 (03) : 788 - 794
  • [29] AnySCAN: An Efficient Anytime Framework with Active Learning for Large-scale Network Clustering
    Zhao, Weizhong
    Chen, Gang
    Xu, Xiaowei
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 665 - 674
  • [30] A bottom-up model of skill learning
    Sun, R
    Merrill, E
    Peterson, T
    PROCEEDINGS OF THE TWENTIETH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1998, : 1037 - 1042