MuSe: a multi-level storage scheme for big RDF data using MapReduce

被引:6
|
作者
Chawla, Tanvi [1 ]
Singh, Girdhari [1 ]
Pilli, Emmanuel S. [1 ]
机构
[1] Malaviya Natl Inst Technol, Dept Comp Sci & Engn, Jaipur, India
关键词
RDF; SPARQL; Hadoop; HDFS; MapReduce; Storage; BENCHMARK;
D O I
10.1186/s40537-021-00519-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Resource Description Framework (RDF) model owing to its flexible structure is increasingly being used to represent Linked data. The rise in amount of Linked data and Knowledge graphs has resulted in an increase in the volume of RDF data. RDF is used to model metadata especially for social media domains where the data is linked. With the plethora of RDF data sources available on the Web, scalable RDF data management becomes a tedious task. In this paper, we present MuSe-an efficient distributed RDF storage scheme for storing and querying RDF data with Hadoop MapReduce. In MuSe, the Big RDF data is stored at two levels for answering the common triple patterns in SPARQL queries. MuSe considers the type of frequently occuring triple patterns and optimizes RDF storage to answer such triple patterns in minimum time. It accesses only the tables that are sufficient for answering a triple pattern instead of scanning the whole RDF dataset. The extensive experiments on two synthetic RDF datasets i.e. LUBM and WatDiv, show that MuSe outperforms the compared state-of-the art frameworks in terms of query execution time and scalability.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] MuSe: a multi-level storage scheme for big RDF data using MapReduce
    Tanvi Chawla
    Girdhari Singh
    Emmanuel S. Pilli
    [J]. Journal of Big Data, 8
  • [2] A Multi-level Access Control Scheme Based on Attribute Encryption for Big Data
    Li, Ruixia
    Peng, Wei
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2019), 2019, : 694 - 698
  • [3] Distributed SPARQL over Big RDF Data A Comparative Analysis using Presto and MapReduce
    Mammo, Mulugeta
    Bansal, Srividya K.
    [J]. 2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 33 - 40
  • [4] Holographic storage of multi-level digital data
    Fang, LH
    Chen, M
    Gong, YQ
    Chen, XG
    Wang, Q
    [J]. ADVANCES IN OPTICAL DATA STORAGE TECHNOLOGY, 2005, 5643 : 334 - 341
  • [5] Implementation of Data Search in Multi-Level NAND Flash Memory by Complementary Storage Scheme
    Wang, Fei
    Feng, Yang
    Zhan, Xuepeng
    Chen, Bing
    Chen, Jiezhi
    [J]. IEEE ELECTRON DEVICE LETTERS, 2020, 41 (08) : 1189 - 1192
  • [6] Multi-level Ontological Model of Big Data Processing
    Bova, Victoria V.
    Kureichik, Vladimir V.
    Scheglov, Sergey N.
    Kureichik, Liliya V.
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'18), VOL 1, 2019, 874 : 171 - 181
  • [7] Multi-level metadata management scheme for cloud storage system
    [J]. Ko, Y. W. (yuko@hallym.ac.kr), 1600, Science and Engineering Research Support Society, 20 Virginia Court, Sandy Bay, Tasmania, Australia (09):
  • [8] Demodulation of Multi-Level Data Using Convolutional Neural Network in Holographic Data Storage
    Katano, Yutaro
    Muroi, Tetsuhiko
    Kinoshita, Nobuhiro
    Ishii, Norihiko
    [J]. 2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 728 - 732
  • [9] Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce
    Husain, Mohammad Farhan
    Doshi, Pankil
    Khan, Latifur
    Thuraisingham, Bhavani
    [J]. CLOUD COMPUTING, PROCEEDINGS, 2009, 5931 : 680 - 686
  • [10] New Error Correction Scheme for Multi-level Optical Storage System
    Zhang Xiaotian
    Pei Jing
    Xu Haizheng
    [J]. PROCEEDINGS OF 2ND CONFERENCE ON LOGISTICS, INFORMATICS AND SERVICE SCIENCE (LISS 2012), VOLS 1 AND 2, 2013,