Graph Pattern Based RDF Data Compression

被引:13
|
作者
Pan, Jeff Z. [1 ]
Gomez Perez, Jose Manuel [2 ]
Ren, Yuan [1 ]
Wu, Honghan [1 ,3 ]
Wang, Haofen [4 ]
Zhu, Man [5 ]
机构
[1] Univ Aberdeen, Dept Comp Sci, Aberdeen, Scotland
[2] ISOCO, Barcelona, Spain
[3] Nanjing Univ Informat & Technol, Nanjing, Jiangsu, Peoples R China
[4] E China Univ Sci & Technol, Shanghai 200237, Peoples R China
[5] Southeast Univ, Sch Comp Sci, Nanjing, Jiangsu, Peoples R China
来源
关键词
D O I
10.1007/978-3-319-15615-6_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growing volume of RDF documents and their inter-linking raise a challenge on the storage and transferring of such documents. One solution to this problem is to reduce the size of RDF documents via compression. Existing approaches either apply well-known generic compression technologies but seldom exploit the graph structure of RDF documents. Or, they focus on minimized compact serialisations leaving the graph nature inexplicit, which leads obstacles for further applying higher level compression techniques. In this paper we propose graph pattern based technologies, which on the one hand can reduce the numbers of triples in RDF documents and on the other hand can serialise RDF graph in a data pattern based way, which can deal with syntactic redundancies which are not eliminable to existing techniques. Evaluation on real world datasets shows that our approach can substantially reduce the size of RDF documents by complementing the abilities of existing approaches. Furthermore, the evaluation results on rule mining operations show the potentials of the proposed serialisation format in supporting efficient data access.
引用
下载
收藏
页码:239 / 256
页数:18
相关论文
共 50 条
  • [1] Graph-based Large Scale RDF Data Compression
    Zhang, Wei Emma
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1276 - 1276
  • [2] Graph-Based RDF Data Management
    Zou L.
    Özsu M.T.
    Data Science and Engineering, 2017, 2 (1) : 56 - 70
  • [3] Predicate Invention Based RDF Data Compression
    Zhu, Man
    Wu, Weixin
    Pan, Jeff Z.
    Han, Jingyu
    Huang, Pengfei
    Liu, Qian
    SEMANTIC TECHNOLOGY (JIST 2018), 2018, 11341 : 153 - 161
  • [4] Provenance compression scheme based on graph patterns for large RDF documents
    Bok, Kyoungsoo
    Han, Jieun
    Lim, Jongtae
    Yoo, Jaesoo
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (08): : 6376 - 6398
  • [5] Provenance compression scheme based on graph patterns for large RDF documents
    Kyoungsoo Bok
    Jieun Han
    Jongtae Lim
    Jaesoo Yoo
    The Journal of Supercomputing, 2020, 76 : 6376 - 6398
  • [6] Pattern-Based Keyword Search on RDF Data
    Ouksili, Hanane
    Kedad, Zoubida
    Lopes, Stephane
    Nugier, Sylvaine
    SEMANTIC WEB, ESWC 2016, 2016, 9989 : 30 - 34
  • [7] Pattern Matching Based Algorithms for Graph Compression
    Chatterjee, Amlan
    Shah, Rushabh Jitendrakumar
    Sen, Soumya
    2018 FOURTH IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2018, : 93 - 97
  • [8] Graph-based Indexing Method for Searching in RDF Data
    Kyu, Khin Myat
    Oo, Aung Nway
    2019 INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION TECHNOLOGIES (ICAIT), 2019, : 96 - 101
  • [9] Distributed subgraph query for RDF graph data based on MapReduce
    Su, Qianxiang
    Huang, Qingrong
    Wu, Nan
    Pan, Ying
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [10] Horn-Rule Based Compression Technique for RDF Data
    Gayathri, V
    Kumar, P. Sreenivasa
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 396 - 401