Efficient Provenance Storage for RDF Dataset in Semantic Web Environment

被引：4

作者：

Sharma, Kumar ^{[1
]}

Marjit, Ujjal ^{[2
]}

Biswas, Utpal ^{[1
]}

机构：

[1] Univ Kalyani, Dept Comp Sci & Engn, Kalyani, W Bengal, India

[2] Univ Kalyani, Ctr Informat Resource Management, Kalyani, W Bengal, India

来源：

2015 14TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT 2015) | 2015年

关键词：

provenance; provenance storage; rdf; linked data; semantic web;

D O I：

10.1109/ICIT.2015.21

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Over the web, excessive growth of RDF (Resource Description Framework) data leads the data authentication issues such as trouble in trusting data and verifying data qualities. Oftentimes, data into the web are published along with provenance for measuring trust and quality. Provenance provides authentication information, like, who modified and published the data, from where the data are originated and what manipulations have been applied to the data. During the last decade, various approaches have been evolved for representing and storing provenance information of RDF datasets. Most of them have followed the annotation-based approach. The annotation-based approach suggests that the metadata should be salted away in separate documents to maintain a rich set of provenance. While doing this, it demands more space for storing coarse-grained and fine-grained provenance. Nevertheless, they do not consider the techniques for redundancy reduction for duplicate provenance values exist in the dataset, in particular, when provenance is defined at the instance level. As a consequence, the size of the provenance may outsize the actual data size itself. In such instances, there should be provisions for reducing the space occupied by the duplicate provenance values. In this article, an approach has been proposed to store provenance information efficiently based on inheritance techniques. The provenance is pre-computed during the generation of the RDF data. It is then stored in an optimized way while minimizing the storage area for repeated provenance values. The result is quite promising in the sense that, the storage size is reduced considerably, as compared to the normally computed provenance.

引用

页码：94 / 100

页数：7

共 50 条

[1] RDF and the Semantic Web
Anon
[J]. Database and Network Journal, 2003, 33 (05):
[2] Hybrid storage scheme for RDF data management in Semantic Web
Kim, Sung Wan
[J]. Journal of Digital Information Management, 2006, 4 (01): : 32 - 36
[3] Benchmarking RDF schemas for the Semantic Web
Magkanaraki, A
Alexaki, S
Christophides, V
Plexousakis, D
[J]. SEMANTIC WEB - ISWC 2002, 2002, 2342 : 132 - 146
[4] Web queries in Protoform and RDF semantic
Tseng, C
Ng, P
[J]. Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1437 - 1440
[5] The semantic Web:: The roles of XML and RDF
Decker, S
Melnik, S
Van Harmelen, F
Fensel, D
Klein, M
Broekstra, J
Erdmann, M
Horrocks, I
[J]. IEEE INTERNET COMPUTING, 2000, 4 (05) : 63 - 74
[6] TOWARDS AN EFFICIENT RDF DATASET SLICING
Marx, Edgard
Soru, Tommaso
Shekarpour, Saeedeh
Auer, Soren
Ngomo, Axel-Cyrille Ngonga
Breitman, Karin
[J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2013, 7 (04) : 455 - 477
[7] Framework for the semantic Web:: An RDF tutorial
Decker, S
Mitra, P
Melnik, S
[J]. IEEE INTERNET COMPUTING, 2000, 4 (06) : 68 - 73
[8] Translating Topic Maps to RDF/RDF Schema for The Semantic Web
Shin, Shinae
Jeong, Dongwon
Baik, Doo-Kwon
[J]. JOURNAL OF RESEARCH AND PRACTICE IN INFORMATION TECHNOLOGY, 2009, 41 (03): : 223 - 238
[9] Using provenance in the Semantic Web
Gil, Yolanda
Groth, Paul
[J]. JOURNAL OF WEB SEMANTICS, 2011, 9 (02): : 147 - 148
[10] Analysis of RDF Syntaxes for Semantic Web Development
Gryaznov, Yevgeny
Rusakov, Pavel
[J]. APPLIED COMPUTER SYSTEMS, 2015, 18 (01) : 33 - 42

← 1 2 3 4 5 →