Graph Pattern Based RDF Data Compression

被引:13
|
作者
Pan, Jeff Z. [1 ]
Gomez Perez, Jose Manuel [2 ]
Ren, Yuan [1 ]
Wu, Honghan [1 ,3 ]
Wang, Haofen [4 ]
Zhu, Man [5 ]
机构
[1] Univ Aberdeen, Dept Comp Sci, Aberdeen, Scotland
[2] ISOCO, Barcelona, Spain
[3] Nanjing Univ Informat & Technol, Nanjing, Jiangsu, Peoples R China
[4] E China Univ Sci & Technol, Shanghai 200237, Peoples R China
[5] Southeast Univ, Sch Comp Sci, Nanjing, Jiangsu, Peoples R China
来源
关键词
D O I
10.1007/978-3-319-15615-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing volume of RDF documents and their inter-linking raise a challenge on the storage and transferring of such documents. One solution to this problem is to reduce the size of RDF documents via compression. Existing approaches either apply well-known generic compression technologies but seldom exploit the graph structure of RDF documents. Or, they focus on minimized compact serialisations leaving the graph nature inexplicit, which leads obstacles for further applying higher level compression techniques. In this paper we propose graph pattern based technologies, which on the one hand can reduce the numbers of triples in RDF documents and on the other hand can serialise RDF graph in a data pattern based way, which can deal with syntactic redundancies which are not eliminable to existing techniques. Evaluation on real world datasets shows that our approach can substantially reduce the size of RDF documents by complementing the abilities of existing approaches. Furthermore, the evaluation results on rule mining operations show the potentials of the proposed serialisation format in supporting efficient data access.
引用
收藏
页码:239 / 256
页数:18
相关论文
共 50 条
  • [41] Applying Grammar-Based Compression to RDF
    Roder, Michael
    Frerk, Philip
    Conrads, Felix
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB, ESWC 2021, 2021, 12731 : 93 - 108
  • [42] RDF-star2Vec: RDF-star Graph Embeddings for Data Mining
    Egami, Shusaku
    Ugai, Takanori
    Oota, Masateru
    Matsushita, Kyoumoto
    Kawamura, Takahiro
    Kozaki, Kouji
    Fukuda, Ken
    IEEE ACCESS, 2023, 11 : 142030 - 142042
  • [43] Query of Marine Big Data Based on Graph Compression and Views
    Zhao, Danfeng
    Zhang, Yeyi
    Lin, Junchen
    Song, Wei
    Liotta, Antonio
    Huang, Dongmei
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 252 - 257
  • [44] Exploiting RDF Open Data Using NoSQL Graph Databases
    Bouhali, Raouf
    Laurent, Anne
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2015, 458 : 177 - 190
  • [45] A DISTRIBUTIONAL STRUCTURED SEMANTIC SPACE FOR QUERYING RDF GRAPH DATA
    Freitas, Andre
    Curry, Edward
    Gabriel Oliveira, Joao
    O'Riain, Sean
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2011, 5 (04) : 433 - 462
  • [46] Storing RDF as a graph
    Bönström, V
    Hinze, A
    Schweppe, H
    FIRST LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2003, : 27 - 36
  • [47] RDF Graph Summarization Based on Node Characteristic and Centrality
    Guo, Jimao
    Wang, Yi
    JOURNAL OF WEB ENGINEERING, 2022, 21 (07): : 2073 - 2094
  • [48] Effective Dictionary Based Data Compression and Pattern Searching in Dictionary Based Compressed Data
    Jain, Pooja
    Jain, Anurag
    Agrawal, Chetan
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [49] Power quality data compression based on pattern similarity measurement
    Huang, Nantian
    Xu, Dianguo
    Liu, Xiaosheng
    Lin, Lin
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2011, 26 (10): : 39 - 46
  • [50] A suboptimal lossy data compression based on approximate pattern matching
    Luczak, T
    Szpankowski, W
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1997, 43 (05) : 1439 - 1451