CliqueSquare: Flat Plans for Massively Parallel RDF Queries

被引:0
|
作者
Goasdoue, Francois [1 ,2 ]
Kaoudi, Zoi [3 ]
Manolescu, Ioana [2 ,4 ]
Quiane-Ruiz, Jorge-Arnulfo [5 ]
Zampetakis, Stamatis [2 ,4 ]
机构
[1] Univ Rennes 1, Rennes, France
[2] INRIA, Rocquencourt, France
[3] Athena Res Ctr, IMIS, Maroussi, Greece
[4] Univ Paris Sud, Orsay, France
[5] QCRI, Doha, Qatar
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As increasing volumes of RDF data are being produced and analyzed, many massively distributed architectures have been proposed for storing and querying this data. These architectures are characterized first, by their RDF partitioning and storage method, and second, by their approach for distributed query optimization, i.e., determining which operations to execute on each node in order to compute the query answers. We present CliqueSquare, a novel optimization approach for evaluating conjunctive RDF queries in a massively parallel environment. We focus on reducing query response time, and thus seek to build flat plans, where the number of joins encountered on a root-to-leaf path in the plan is minimized. We present a family of optimization algorithms, relying on n-ary (star) equality joins to build flat plans, and compare their ability to find the flattest possibles. We have deployed our algorithms in a MapReduce-based RDF platform and demonstrate experimentally the interest of the flat plans built by our best algorithms.
引用
收藏
页码:771 / 782
页数:12
相关论文
共 50 条
  • [1] CliqueSquare in Action: Flat Plans for Massively Parallel RDF Queries
    Djahandideh, Benjamin
    Goasdoue, Francois
    Kaoudi, Zoi
    Manolescu, Ioana
    Quiane-Ruiz, Jorge-Arnulfo
    Zampetakis, Stamatis
    [J]. 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1432 - 1435
  • [2] Towards Parallel Processing of RDF Queries in DHTs
    Lohrmann, Bjoern
    Battre, Dominic
    Kao, Odej
    [J]. DATA MANAGEMENT IN GRID AND PEER-TO-PEER SYSTEMS, PROCEEDINGS, 2009, 5697 : 36 - 47
  • [3] Efficient Massively Parallel Join Optimization for Large Queries
    Mancini, Riccardo
    Karthik, Srinivas
    Chandra, Bikash
    Mageirakos, Vasilis
    Ailamaki, Anastasia
    [J]. PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 122 - 135
  • [4] Massively Parallel Nearest Neighbor Queries for Dynamic Point Clouds on the GPU
    Leite, Pedro
    Teixeira, Joao M.
    Farias, Thiago
    Teichrieb, Veronica
    Kelner, Judith
    [J]. PROCEEDINGS OF THE 21ST INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING, 2009, : 19 - +
  • [5] Parallel and scalable processing of spatio-temporal RDF queries using Spark
    Nikitopoulos, Panagiotis
    Vlachou, Akrivi
    Doulkeridis, Christos
    Vouros, George A.
    [J]. GEOINFORMATICA, 2021, 25 (04) : 623 - 653
  • [6] Parallel and scalable processing of spatio-temporal RDF queries using Spark
    Panagiotis Nikitopoulos
    Akrivi Vlachou
    Christos Doulkeridis
    George A. Vouros
    [J]. GeoInformatica, 2021, 25 : 623 - 653
  • [7] RDF aggregate queries and views
    Hung, E
    Deng, Y
    Subrahmanian, VS
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 717 - 728
  • [8] Evaluation of RDF queries via equivalence
    Weiwei Ni
    Zhihong Chong
    Hu Shu
    Jiajia Bao
    Aoying Zhou
    [J]. Frontiers of Computer Science, 2013, 7 : 20 - 33
  • [9] Web queries in Protoform and RDF semantic
    Tseng, C
    Ng, P
    [J]. Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1437 - 1440
  • [10] Query Relaxation for Star Queries on RDF
    Huang, Hai
    Liu, Chengfei
    [J]. WEB INFORMATION SYSTEM ENGINEERING-WISE 2010, 2010, 6488 : 376 - 389