Parallel SPARQL Query Optimization

被引:5
|
作者
Wu, Buwen [1 ,2 ]
Zhou, Yongluan [2 ]
Jin, Hai [1 ]
Deshpande, Amol [3 ]
机构
[1] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
[2] Univ Southern Denmark, Odense, Denmark
[3] Univ Maryland, College Pk, MD 20742 USA
关键词
RDF;
D O I
10.1109/ICDE.2017.110
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing parallel SPARQL query optimizers assume hash-based data partitioning and adopt plan enumeration algorithms with unnecessarily high complexity. Therefore, they cannot easily accommodate other partitioning methods and only consider an unnecessarily limited plan space. To address these problems, we first define a generic RDF data partitioning model to capture the common structure of various state-of-the-art RDF data partitioning methods. Then we propose a query plan enumeration algorithm that not only has an optimal efficiency, but also accommodates different data partitioning methods. Furthermore, based on a solid analysis of the complexity of the plan enumeration algorithm, we propose two new heuristic methods that can consider a much larger plan space than the existing methods, and at the same time can still confine the search space of the algorithm. An autonomous approach is proposed to choose one of the two methods by considering the structure and the size of a complex SPARQL query. We conduct extensive experiments using synthetic and a real-world dataset, which show the superiority of our algorithms in comparing to existing ones.
引用
收藏
页码:547 / 558
页数:12
相关论文
共 50 条
  • [1] SPARQL Query Parallel Processing: A Survey
    Feng, Jiaying
    Meng, Chenhong
    Song, Jiaming
    Zhang, Xiaowang
    Feng, Zhiyong
    Zou, Lei
    [J]. 2017 IEEE 6TH INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS 2017), 2017, : 444 - 451
  • [2] The SPARQL query graph model for query optimization
    Hartig, Olaf
    Heese, Ralf
    [J]. SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4519 : 564 - +
  • [3] SPARQL Query Optimization on Top of DHTs
    Kaoudi, Zoi
    Kyzirakos, Kostis
    Koubarakis, Manolis
    [J]. SEMANTIC WEB-ISWC 2010, PT I, 2010, 6496 : 418 - 435
  • [4] SPARQL Multi-Query Optimization
    Chen, Jiaqi
    Zhang, Fan
    Zou, Lei
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1419 - 1425
  • [5] Scalable Multi-Query Optimization for SPARQL
    Le, Wangchao
    Kementsietsidis, Anastasios
    Duan, Songyun
    Li, Feifei
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 666 - 677
  • [6] A Vision for SPARQL Multi-Query Optimization on MapReduce
    Anyanwu, Kemafor
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 25 - 26
  • [7] A parallel approach for improving Geo-SPARQL query performance
    Zhao, Tian
    Zhang, Chuanrong
    Anselin, Luc
    Li, Weidong
    Chen, Ke
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2015, 8 (05) : 383 - 402
  • [8] Multi-Query Optimization via Common Sub Query Elimination for SPARQL
    Zhou, Xiaoyi
    Luo, Jie
    He, Tao
    [J]. 2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 213 - 218
  • [9] Some Thoughts on OWL-Empowered SPARQL Query Optimization
    Papakonstantinou, Vassilis
    Flouris, Giorgos
    Fundulaki, Irini
    Gubichev, Andrey
    [J]. SEMANTIC WEB, ESWC 2016, 2016, 9989 : 12 - 16
  • [10] A Robust Optimization Approach of SQL-to-SPARQL Query Rewriting
    Ahmed, Abatal
    Bahaj, Mohamed
    Nassima, Soussi
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (11) : 538 - 543