Anytime bottom-up rule learning for large-scale knowledge graph completion

被引：1

作者：

Meilicke, Christian ^{[1
]}

Chekol, Melisachew Wudage ^{[2
]}

Betz, Patrick ^{[1
]}

Fink, Manuel ^{[1
]}

Stuckeschmidt, Heiner ^{[1
]}

机构：

[1] Univ Mannheim, Mannheim, Germany

[2] Univ Utrecht, Utrecht, Netherlands

来源：

VLDB JOURNAL | 2024年 / 33卷 / 01期

关键词：

Knowledge graph completion; Link prediction; Rule learning;

D O I：

10.1007/s00778-023-00800-5

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Knowledge graph completion is the task of predicting correct facts that can be expressed by the vocabulary of a given knowledge graph, which are not explicitly stated in that graph. Broadly, there are two main approaches for solving the knowledge graph completion problem. Sub-symbolic approaches embed the nodes and/or edges of a given graph into a low-dimensional vector space and use a scoring function to determine the plausibility of a given fact. Symbolic approaches learn a model that remains within the primary representation of the given knowledge graph. Rule-based approaches are well-known examples. One such approach is AnyBURL. It works by sampling random paths, which are generalized into Horn rules. Previously published results show that the prediction quality of AnyBURL is close to current state of the art with the additional benefit of offering an explanation for a predicted fact. In this paper, we propose several improvements and extensions of AnyBURL. In particular, we focus on AnyBURL's capability to be successfully applied to large and very large datasets. Overall, we propose four separate extensions: (i) We add to each rule a set of pairwise inequality constraints which enforces that different variables cannot be grounded by the same entities, which results into more appropriate confidence estimations. (ii) We introduce reinforcement learning to guide path sampling in order to use available computational resources more efficiently. (iii) We propose an efficient sampling strategy to approximate the confidence of a rule instead of computing its exact value. (iv) We develop a new multithreaded AnyBURL, which incorporates all previously mentioned modifications. In an experimental study, we show that our approach outperforms both symbolic and sub-symbolic approaches in large-scale knowledge graph completion. It has a higher prediction quality and requires significantly less time and computational resources.

引用

下载

页码：131 / 161

页数：31

共 50 条

[21] Including bottom-up emission abatement technologies in a large-scale global economic model for policy assessments
Weitzel, Matthias
Saveyn, Bert
Vandyck, Toon
ENERGY ECONOMICS, 2019, 83 : 254 - 263
[22] Zooplankton and forage fish species off Peru: Large-scale bottom-up forcing and local-scale depletion
Ayon, Patricia
Swartzman, Gordon
Bertrand, Arnaud
Gutierrez, Mariano
Bertrand, Sophie
PROGRESS IN OCEANOGRAPHY, 2008, 79 (2-4) : 208 - 214
[23] Large-scale knowledge graph representations of disease processes
Hoch, Matti
Gupta, Shailendra
Wolkenhauer, Olaf
CURRENT OPINION IN SYSTEMS BIOLOGY, 2024, 38
[24] Leveraging Semantics for Large-Scale Knowledge Graph Evaluation
Rashid, Sabbir M.
Viswanathan, Amar
Gross, Ian
Kendall, Elisa
McGuinness, Deborah L.
PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, : 437 - 442
[25] A New Graph-Partitioning Algorithm for Large-Scale Knowledge Graph
Zhong, Jiang
Wang, Chen
Li, Qi
Li, Qing
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 434 - 444
[26] Large-Scale, Bottom-Up Synthesis of Binary Metal-Organic Framework Nanosheets for Efficient Water Oxidation
Li, Fei-Long
Wang, Pengtang
Huang, Xiaoqing
Young, David James
Wang, Hui-Fang
Braunstein, Pierre
Lang, Jian-Ping
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2019, 58 (21) : 7051 - 7056
[27] Bottom-Up Electrodeposition of Large-Scale Nanotwinned Copper within 3D Through Silicon Via
Sun, Fu-Long
Liu, Zhi-Quan
Li, Cai-Fu
Zhu, Qing-Sheng
Zhang, Hao
Suganuma, Katsuaki
MATERIALS, 2018, 11 (02):
[28] Ursgal, Universal Python']Python Module Combining Common Bottom-Up Proteomics Tools for Large-Scale Analysis
Kremer, Lukas P. M.
Leufken, Johannes
Oyunchimeg, Purevdulam
Schulze, Stefan
Fufezan, Christian
JOURNAL OF PROTEOME RESEARCH, 2016, 15 (03) : 788 - 794
[29] AnySCAN: An Efficient Anytime Framework with Active Learning for Large-scale Network Clustering
Zhao, Weizhong
Chen, Gang
Xu, Xiaowei
2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 665 - 674
[30] A bottom-up model of skill learning
Sun, R
Merrill, E
Peterson, T
PROCEEDINGS OF THE TWENTIETH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1998, : 1037 - 1042

← 1 2 3 4 5 →