Efficient Provenance Storage for RDF Dataset in Semantic Web Environment

被引:4
|
作者
Sharma, Kumar [1 ]
Marjit, Ujjal [2 ]
Biswas, Utpal [1 ]
机构
[1] Univ Kalyani, Dept Comp Sci & Engn, Kalyani, W Bengal, India
[2] Univ Kalyani, Ctr Informat Resource Management, Kalyani, W Bengal, India
关键词
provenance; provenance storage; rdf; linked data; semantic web;
D O I
10.1109/ICIT.2015.21
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the web, excessive growth of RDF (Resource Description Framework) data leads the data authentication issues such as trouble in trusting data and verifying data qualities. Oftentimes, data into the web are published along with provenance for measuring trust and quality. Provenance provides authentication information, like, who modified and published the data, from where the data are originated and what manipulations have been applied to the data. During the last decade, various approaches have been evolved for representing and storing provenance information of RDF datasets. Most of them have followed the annotation-based approach. The annotation-based approach suggests that the metadata should be salted away in separate documents to maintain a rich set of provenance. While doing this, it demands more space for storing coarse-grained and fine-grained provenance. Nevertheless, they do not consider the techniques for redundancy reduction for duplicate provenance values exist in the dataset, in particular, when provenance is defined at the instance level. As a consequence, the size of the provenance may outsize the actual data size itself. In such instances, there should be provisions for reducing the space occupied by the duplicate provenance values. In this article, an approach has been proposed to store provenance information efficiently based on inheritance techniques. The provenance is pre-computed during the generation of the RDF data. It is then stored in an optimized way while minimizing the storage area for repeated provenance values. The result is quite promising in the sense that, the storage size is reduced considerably, as compared to the normally computed provenance.
引用
收藏
页码:94 / 100
页数:7
相关论文
共 50 条
  • [1] RDF and the Semantic Web
    Anon
    [J]. Database and Network Journal, 2003, 33 (05):
  • [2] Hybrid storage scheme for RDF data management in Semantic Web
    Kim, Sung Wan
    [J]. Journal of Digital Information Management, 2006, 4 (01): : 32 - 36
  • [3] Benchmarking RDF schemas for the Semantic Web
    Magkanaraki, A
    Alexaki, S
    Christophides, V
    Plexousakis, D
    [J]. SEMANTIC WEB - ISWC 2002, 2002, 2342 : 132 - 146
  • [4] Web queries in Protoform and RDF semantic
    Tseng, C
    Ng, P
    [J]. Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1437 - 1440
  • [5] The semantic Web:: The roles of XML and RDF
    Decker, S
    Melnik, S
    Van Harmelen, F
    Fensel, D
    Klein, M
    Broekstra, J
    Erdmann, M
    Horrocks, I
    [J]. IEEE INTERNET COMPUTING, 2000, 4 (05) : 63 - 74
  • [6] TOWARDS AN EFFICIENT RDF DATASET SLICING
    Marx, Edgard
    Soru, Tommaso
    Shekarpour, Saeedeh
    Auer, Soren
    Ngomo, Axel-Cyrille Ngonga
    Breitman, Karin
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2013, 7 (04) : 455 - 477
  • [7] Framework for the semantic Web:: An RDF tutorial
    Decker, S
    Mitra, P
    Melnik, S
    [J]. IEEE INTERNET COMPUTING, 2000, 4 (06) : 68 - 73
  • [8] Translating Topic Maps to RDF/RDF Schema for The Semantic Web
    Shin, Shinae
    Jeong, Dongwon
    Baik, Doo-Kwon
    [J]. JOURNAL OF RESEARCH AND PRACTICE IN INFORMATION TECHNOLOGY, 2009, 41 (03): : 223 - 238
  • [9] Using provenance in the Semantic Web
    Gil, Yolanda
    Groth, Paul
    [J]. JOURNAL OF WEB SEMANTICS, 2011, 9 (02): : 147 - 148
  • [10] Analysis of RDF Syntaxes for Semantic Web Development
    Gryaznov, Yevgeny
    Rusakov, Pavel
    [J]. APPLIED COMPUTER SYSTEMS, 2015, 18 (01) : 33 - 42