Rule-based data augmentation for knowledge graph embedding

被引:3
|
作者
Li, Guangyao
Sun, Zequn
Qian, Lei [1 ,2 ]
Guo, Qiang [1 ,2 ]
Hu, Wei [1 ,2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] State Key Lab Math Engn & Adv Comp, Wuxi, Peoples R China
来源
AI OPEN | 2021年 / 2卷
基金
中国国家自然科学基金;
关键词
Knowledge graph embedding; Data augmentation; Logical rules;
D O I
10.1016/j.aiopen.2021.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge graph (KG) embedding models suffer from the incompleteness issue of observed facts. Different from existing solutions that incorporate additional information or employ expressive and complex embedding techniques, we propose to augment KGs by iteratively mining logical rules from the observed facts and then using the rules to generate new relational triples. We incrementally train KG embeddings with the coming of new augmented triples, and leverage the embeddings to validate these new triples. To guarantee the quality of the augmented data, we filter out the noisy triples based on a propagation mechanism during the validation. The mined rules and rule groundings are human -understandable, and can make the augmentation procedure reliable. Our KG augmentation framework is applicable to any KG embedding models with no need to modify their embedding techniques. Our experiments on two popular embedding -based tasks (i.e., entity alignment and link prediction) show that the proposed framework can bring significant improvement to existing KG embedding models on most benchmark datasets.
引用
收藏
页码:186 / 196
页数:11
相关论文
共 50 条
  • [41] A rule-based data warehouse model
    Favre, Cecile
    Bentayeb, Fadila
    Boussaid, Omar
    FLEXIBLE AND EFFICIENT INFORMATION HANDLING, 2006, 4042 : 274 - 277
  • [42] Rule-Based Conditioning of Probabilistic Data
    van Keulen, Maurice
    Kaminski, Benjamin L.
    Matheja, Christoph
    Katoen, Joost-Pieter
    SCALABLE UNCERTAINTY MANAGEMENT (SUM 2018), 2018, 11142 : 290 - 305
  • [43] Graph theory for rule-based modeling of biochemical networks
    Blinov, Michael L.
    Yang, Jin
    Faeder, James R.
    Hlavacek, William S.
    TRANSACTIONS ON COMPUTATIONAL SYSTEMS BIOLOGY VII, 2006, 4230 : 89 - 106
  • [44] A Rule-Based Approach to Embedding Techniques for Text Document Classification
    Aubaid, Asmaa M.
    Mishra, Alok
    APPLIED SCIENCES-BASEL, 2020, 10 (11):
  • [45] Knowledge Graph Reasoning Combining Rule Inference Patterns and Fact Embedding
    Shan, Xiaohuan
    Jiang, Jiantao
    Chen, Ze
    Song, Baoyan
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (10): : 923 - 935
  • [46] Combining RDF Graph Data and Embedding Models for an Augmented Knowledge Graph
    Nikolov, Andriy
    Haase, Peter
    Herzig, Daniel M.
    Trame, Johannes
    Kozlov, Artem
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 977 - 980
  • [47] Domain Knowledge Graph Question Answering Based on Semantic Analysis and Data Augmentation
    Hu, Shulin
    Zhang, Huajun
    Zhang, Wanying
    APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [48] Knowledge Graph Embedding for Hyper-Relational Data
    Chunhong Zhang
    Miao Zhou
    Xiao Han
    Zheng Hu
    Yang Ji
    Tsinghua Science and Technology, 2017, 22 (02) : 185 - 197
  • [49] Data Poisoning Attack against Knowledge Graph Embedding
    Zhang, Hengtong
    Zheng, Tianhang
    Gao, Jing
    Miao, Chenglin
    Su, Lu
    Li, Yaliang
    Ren, Kui
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4853 - 4859
  • [50] Knowledge Graph Embedding for Hyper-Relational Data
    Chunhong Zhang
    Miao Zhou
    Xiao Han
    Zheng Hu
    Yang Ji
    Tsinghua Science and Technology, 2017, (02) : 185 - 197