Estimation-based optimizations for the semantic compression of RDF knowledge bases

被引:1
|
作者
Wang, Ruoyu [1 ]
Wong, Raymond [1 ]
Sun, Daniel [2 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] UGAiForge LLC, Canberra, ACT, Australia
关键词
Knowledge bases; Semantic compression; Negative sampling; Statistical estimation; Optimization; Rule mining; SOCIAL QUESTION; GRAPH; RULES;
D O I
10.1016/j.ipm.2024.103799
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Structured knowledge bases are critical for the interpretability of AI techniques. RDF KBs, which are the dominant representation of structured knowledge, are expanding extremely fast to increase their knowledge coverage, enhancing the capability of knowledge reasoning while bringing heavy burdens to downstream applications. Recent studies employ semantic compression to detect and remove knowledge redundancies via semantic models and use the induced model for further applications, such as knowledge completion and error detection. However, semantic models that are sufficiently expressive for semantic compression cannot be efficiently induced, especially for large-scale KBs, due to the hardness of logic induction. In this article, we present estimation-based optimizations for the semantic compression of RDF KBs from the perspectives of input and intermediate data involved in the induction of first-order logic rules. The negative sampling technique selects a representative subset of all negative tuples with respect to the closed-world assumption, reducing the cost of evaluating the quality of a logic rule used for knowledge inference. The number of logic inference operations used during a compression procedure is reduced by a statistical estimation technique that prunes logic rules of low quality. The evaluation results show that the two techniques are feasible for the purpose of semantic compression and accelerate the compression algorithm by up to 47x compared to the state-of-the-art system.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Framework and Schema for Semantic Web Knowledge Bases
    McGlothlin, James P.
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1992 - 1993
  • [32] SEMANTIC KNOWLEDGE BASES FOR FINANCIAL ACCOUNTING STANDARDS
    GANGOLLY, JS
    HEDLEY, TP
    WONG, CT
    EXPERT SYSTEMS WITH APPLICATIONS, 1991, 3 (01) : 117 - 128
  • [33] Towards Context-Semantic Knowledge Bases
    Goczyla, Krzysztof
    Waloszek, Aleksander
    Waloszek, Wojciech
    2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2012, : 475 - 482
  • [34] Learning with Semantic Kernels for Clausal Knowledge Bases
    Fanizzi, Nicola
    d'Amato, Claudia
    FOUNDATIONS OF INTELLIGENT SYSTEMS, 2011, 6804 : 250 - 259
  • [35] Semantic web and knowledge bases in learning and teaching
    Oinonen, Kari
    SECOND INTERNATIONAL WORKSHOP ON SEMANTIC MEDIA ADAPTATION AND PERSONALIZATION, PROCEEDINGS, 2007, : 50 - 55
  • [36] Applying Grammar-Based Compression to RDF
    Roder, Michael
    Frerk, Philip
    Conrads, Felix
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB, ESWC 2021, 2021, 12731 : 93 - 108
  • [37] Exploring Term Networks for Semantic Search over RDF Knowledge Graphs
    Marx, Edgard
    Hoeffner, Konrad
    Shekarpour, Saeedeh
    Ngomo, Axel-Cyrille Ngonga
    Lehmann, Jens
    Auer, Soeren
    METADATA AND SEMANTICS RESEARCH, MTSR 2016, 2016, 672 : 249 - 261
  • [38] A semantic framework for condition monitoring in Industry 4.0 based on evolving knowledge bases
    Giustozzi, Franco
    Saunier, Julien
    Zanni-Merk, Cecilia
    SEMANTIC WEB, 2024, 15 (02) : 583 - 611
  • [39] Predicate Invention Based RDF Data Compression
    Zhu, Man
    Wu, Weixin
    Pan, Jeff Z.
    Han, Jingyu
    Huang, Pengfei
    Liu, Qian
    SEMANTIC TECHNOLOGY (JIST 2018), 2018, 11341 : 153 - 161
  • [40] Graph Pattern Based RDF Data Compression
    Pan, Jeff Z.
    Gomez Perez, Jose Manuel
    Ren, Yuan
    Wu, Honghan
    Wang, Haofen
    Zhu, Man
    SEMANTIC TECHNOLOGY (JIST 2014), 2015, 8943 : 239 - 256