Efficient Parallel Processing of Analytical Queries on Linked Data

被引:0
|
作者
Hagedorn, Stefan [1 ]
Sattler, Kai-Uwe [1 ]
机构
[1] Ilmenau Univ Technol, Ilmenau, Germany
关键词
linked data; parallel query processing; micro benchmark; SEMANTIC WEB; JOIN;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Linked data has become one of the most successful movements of the Semantic Web community. RDF and SPARQL have been established as de-facto standards for representing and querying linked data and there exists quite a number of RDF stores and SPARQL engines that can be used to work with the data. However, for many types of queries on linked data these stores are not the best choice regarding query execution times. For example, users are interested in analytical tasks such as profiling or finding correlated entities in their datasets. In this paper we argue that currently available RDF stores are not optimal for such scan-intensive tasks. In order to address this issue, we discuss query evaluation techniques for linked data exploiting the features of modern hardware architectures such as big memory and multi-core processors. Particularly, we describe parallelization techniques as part of our CameLOD system. Furthermore, we compare our system with the well-known linked data stores Virtuoso and RDF-3X by running different analytical queries on the DBpedia dataset and show that we can outperform these systems significantly.
引用
收藏
页码:452 / 469
页数:18
相关论文
共 50 条
  • [21] Efficient execution of parallel aggregate data cube queries in data warehouse environments
    Tan, RBN
    Taniar, D
    Lu, G
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 709 - 716
  • [22] Efficient Batch Processing for Multiple Keyword Queries on Graph Data
    Chen, Lu
    Liu, Chengfei
    Yang, Xiaochun
    Wang, Bin
    Li, Jianxin
    Zhou, Rui
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1261 - 1270
  • [23] SPHLU: An Efficient Algorithm for Processing PRkNN Queries on Uncertain Data
    Wang Shengsheng
    Li Yang
    Chai Sheng
    Bolou, Bolou Dickson
    CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (03) : 403 - 406
  • [24] EFFICIENT SECONDARY MEMORY PROCESSING OF WINDOW QUERIES ON SPATIAL DATA
    NARDELLI, E
    PROIETTI, G
    INFORMATION SCIENCES, 1995, 84 (1-2) : 67 - 83
  • [25] Parallel Processing of Dynamic Continuous Queries over Streaming Data Flows
    Deng, Ze
    Wu, Xiaoming
    Wang, Lizhen
    Chen, Xiaodao
    Ranjan, Rajiv
    Zomaya, Albert
    Chen, Dan
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (03) : 834 - 846
  • [26] SPHLU:An Efficient Algorithm for Processing PRkNN Queries on Uncertain Data
    WANG Shengsheng
    LI Yang
    CHAI Sheng
    BOLOU Bolou Dickson
    Chinese Journal of Electronics, 2016, 25 (03) : 403 - 406
  • [27] Efficient Processing of Skyline Group Queries over a Data Stream
    Xi Guo
    Hailing Li
    Aziguli Wulamu
    Yonghong Xie
    Yajing Fu
    Tsinghua Science and Technology, 2016, 21 (01) : 29 - 39
  • [28] Efficient Processing of Skyline Group Queries over a Data Stream
    Guo, Xi
    Li, Hailing
    Wulamu, Aziguli
    Xie, Yonghong
    Fu, Yajing
    TSINGHUA SCIENCE AND TECHNOLOGY, 2016, 21 (01) : 29 - 39
  • [29] Rewriting Complex SPARQL Analytical Queries for Efficient Cloud-based Processing
    Ravindra, Padmashree
    Kim, HyeongSik
    Anyanwu, Kemafor
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 32 - 37
  • [30] HPPQ: A Parallel Package Queries Processing Approach for Large-Scale Data
    Shi, Meihui
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    Yu, Ge
    BIG DATA MINING AND ANALYTICS, 2018, 1 (02): : 146 - 159