Summarization of Massive RDF Graphs Using Identifier Classification

被引:0
|
作者
dos Santos, Andre Fernandes [1 ]
Leal, Jose Paulo
机构
[1] Univ Porto, CRAGS, Porto, Portugal
关键词
knowledge graphs; graph summarization; namespaces; RDF;
D O I
10.1007/978-3-031-40960-8_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The size of massive knowledge graphs (KGs) and the lack of prior information regarding the schemas, ontologies and vocabularies they use frequently makes them hard to understand and visualize. Graph summarization techniques can help by abstracting details of the original graph to produce a reduced summary that can more easily be explored. Identifiers often carry latent information which could be used for classification of the entities they represent. Particularly, IRI namespaces can be used to classify RDF resources. Namespaces, used in some RDF serialization formats as a shortening mechanism for resource IRIs, have no role in the semantics of RDF. Nevertheless, there is often a hidden meaning behind the decision of grouping resources under a common prefix and assigning an alias to it. We improved on previous work on a namespace-based approach to KG summarization that classifies resources using their namespaces. Producing the summary graph is fast, light on computing resources and requires no previous domain knowledge. The summary graph can be used to analyze the namespace interdependencies of the original graph. We also present chilon, a tool for calculating namespace-based KG summaries. Namespaces are gathered from explicit declarations in the graph serialization, community contributions or resource IRI prefix analysis. We applied chilon to publicly available KGs, used it to generate interactive visualizations of the summaries, and discuss the results obtained.
引用
收藏
页码:89 / 103
页数:15
相关论文
共 50 条
  • [1] Parallel Quotient Summarization of RDF Graphs
    Guzewicz, Pawel
    Manolescu, Ioana
    [J]. PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON SEMANTIC BIG DATA (SBD 2019), 2019,
  • [2] Query-Oriented Summarization of RDF Graphs
    Cebiric, Sejla
    Goasdoue, Francois
    Manolescu, Ioana
    [J]. DATA SCIENCE, 2015, 9147 : 87 - 91
  • [3] Query-Oriented Summarization of RDF Graphs
    Cebiric, Sejla
    Goasdoue, Francois
    Manolescu, Ioana
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (12): : 2013 - 2016
  • [4] SSumM: Sparse Summarization of Massive Graphs
    Lee, Kyuhan
    Jo, Hyeonsoo
    Ko, Jihoon
    Lim, Sungsu
    Shin, Kijung
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 144 - 154
  • [5] SLUGGER: Lossless Hierarchical Summarization of Massive Graphs
    Lee, Kyuhan
    Ko, Jihoon
    Shin, Kijung
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 472 - 484
  • [6] Translating Sentences Using RDF Graphs
    Beniwal, Rohit
    Jain, Minni
    Aggarwal, Rishabh
    Rawat, Manish
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTATIONAL ENGINEERING (ICACE), 2018, : 145 - 149
  • [7] RDF Digest: Efficient Summarization of RDF/S KBs
    Troullinou, Georgia
    Kondylakis, Haridimos
    Daskalaki, Evangelia
    Plexousakis, Dimitris
    [J]. SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, ESWC 2015, 2015, 9088 : 119 - 134
  • [8] Quality metrics for RDF graph summarization
    Zneika, Mussab
    Vodislav, Dan
    Kotzinos, Dimitris
    [J]. SEMANTIC WEB, 2019, 10 (03) : 555 - 584
  • [9] Signing RDF graphs
    Carroll, JJ
    [J]. SEMANTIC WEB - ISWC 2003, 2003, 2870 : 369 - 384
  • [10] Trust in RDF Graphs
    Tomaszuk, Dominik
    Pak, Karol
    Rybinski, Henryk
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, 2013, 186 : 273 - +