Estimation-based optimizations for the semantic compression of RDF knowledge bases

被引:1
|
作者
Wang, Ruoyu [1 ]
Wong, Raymond [1 ]
Sun, Daniel [2 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] UGAiForge LLC, Canberra, ACT, Australia
关键词
Knowledge bases; Semantic compression; Negative sampling; Statistical estimation; Optimization; Rule mining; SOCIAL QUESTION; GRAPH; RULES;
D O I
10.1016/j.ipm.2024.103799
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Structured knowledge bases are critical for the interpretability of AI techniques. RDF KBs, which are the dominant representation of structured knowledge, are expanding extremely fast to increase their knowledge coverage, enhancing the capability of knowledge reasoning while bringing heavy burdens to downstream applications. Recent studies employ semantic compression to detect and remove knowledge redundancies via semantic models and use the induced model for further applications, such as knowledge completion and error detection. However, semantic models that are sufficiently expressive for semantic compression cannot be efficiently induced, especially for large-scale KBs, due to the hardness of logic induction. In this article, we present estimation-based optimizations for the semantic compression of RDF KBs from the perspectives of input and intermediate data involved in the induction of first-order logic rules. The negative sampling technique selects a representative subset of all negative tuples with respect to the closed-world assumption, reducing the cost of evaluating the quality of a logic rule used for knowledge inference. The number of logic inference operations used during a compression procedure is reduced by a statistical estimation technique that prunes logic rules of low quality. The evaluation results show that the two techniques are feasible for the purpose of semantic compression and accelerate the compression algorithm by up to 47x compared to the state-of-the-art system.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] EvoPat - Pattern-Based Evolution and Refactoring of RDF Knowledge Bases
    Riess, Christoph
    Heino, Norman
    Tramp, Sebastian
    Auer, Soeren
    SEMANTIC WEB-ISWC 2010, PT I, 2010, 6496 : 647 - 662
  • [2] RelFinder: Revealing Relationships in RDF Knowledge Bases
    Heim, Philipp
    Hellmann, Sebastian
    Lehmann, Jens
    Lohmann, Steffen
    Stegemann, Timo
    SEMANTIC MULTIMEDIA, PROCEEDINGS, 2009, 5887 : 182 - +
  • [3] On Computing Deltas of RDF/S Knowledge Bases
    Zeginis, Dimitris
    Tzitzikas, Yannis
    Christophides, Vassilis
    ACM TRANSACTIONS ON THE WEB, 2011, 5 (03)
  • [4] Interactive Exploration of Fuzzy RDF Knowledge Bases
    Manolis, Nikos
    Tzitzikas, Yannis
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PT I, 2011, 6643 : 1 - 16
  • [5] DistSim - Scalable Distributed in-Memory Semantic Similarity Estimation for RDF Knowledge Graphs
    Draschner, Carsten Felix
    Lehmann, Jens
    Jabeen, Hajira
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 333 - 336
  • [6] Estimation of FAQ knowledge bases by using semantic expressions for questions and answers
    Harada, Jun
    Fuketa, Masao
    Morita, Kazuhiro
    Sumitomo, Touru
    Hiraishi, Wataru
    Atlam, El-Sayed
    Aoe, Jun-Ichi
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2008, 32 (01) : 69 - 81
  • [7] Reasoning over RDF Knowledge Bases: Where We Are
    Colucci, Simona
    Donini, Francesco M.
    Di Sciascio, Eugenio
    AI*IA 2017 ADVANCES IN ARTIFICIAL INTELLIGENCE, 2017, 10640 : 243 - 255
  • [8] Mapping RDF knowledge bases using exchange samples
    Rivero, Carlos R.
    Hernandez, Inma
    Ruiz, David
    Corchuelo, Rafael
    KNOWLEDGE-BASED SYSTEMS, 2016, 93 : 47 - 66
  • [9] A Learning-based Semantic Approximate Query over RDF Knowledge Graph
    Ge, Zhangpeng
    Wang, Yuxiang
    Yan, Haijiang
    Xu, Xiaoliang
    2018 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2018, : 135 - 141
  • [10] Datalog Reasoning over Compressed RDF Knowledge Bases
    Hu, Pan
    Urbani, Jacopo
    Motik, Boris
    Horrocks, Ian
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2065 - 2068