Analysis and Improvement of Optimizer for Query Processing on Graph Store

被引:9
|
作者
Yao, Youyang [1 ]
Li, Jiaqi [1 ]
Chen, Rong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai Key Lab Scalable Comp & Syst, Inst Parallel & Distributed Syst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimizer; Type-centric Estimation; Graph Store; Query Processing;
D O I
10.1145/3265723.3265729
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RDF systems are widely used to store public knowledge bases and process SPARQL queries. A large number of such systems have been proposed in the recent literature to provide low latency and high throughput for concurrent query processing over large RDF data. We perform an in-depth analysis on three key components (cardinality estimation, cost model, and plan enumeration) of the query optimizer to reveal the main issues and challenges on the accuracy and performance for traditional approaches. This calls for a rethink of how to build an accurate and fast query optimizer for modern RDF systems. We introduce a type-centric approach to enhance the accuracy of cardinality estimation prominently, which naturally embeds the lineage of correlated query conditions (triple patterns) into existing type system of RDF data. The preliminary results show that our approach greatly improves the accuracy of query optimization by several orders of magnitude compared to state-of-the-art approaches and provides a better overall performance by reducing execution time or optimization time.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] SHC: Distributed Query Processing for Non-Relational Data Store
    Yang, Weiqing
    Tang, Mingjie
    Yu, Yongyang
    Liang, Yanbo
    Saha, Bikas
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1465 - 1476
  • [22] Neo: A Learned Query Optimizer
    Marcus, Ryan
    Negi, Parimarjan
    Mao, Hongzi
    Zhang, Chi
    Alizadeh, Mohammad
    Kraska, Tim
    Papaemmanouil, Olga
    Tatbul, Nesime
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (11): : 1705 - 1718
  • [23] DeepO: A Learned Query Optimizer
    Sun, Luming
    Ji, Tao
    Li, Cuiping
    Chen, Hong
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2421 - 2424
  • [24] HadoopToSQL a MapReduce Query Optimizer
    Iu, Ming-Yee
    Zwaenepoel, Willy
    EUROSYS'10: PROCEEDINGS OF THE EUROSYS 2010 CONFERENCE, 2010, : 251 - 264
  • [25] Data analysis for query processing
    Robinson, J
    Lowden, BGT
    ADVANCES IN INTELLIGENT DATA ANALYSIS: REASONING ABOUT DATA, 1997, 1280 : 447 - 458
  • [26] The Vertica Query Optimizer: The Case for Specialized Query Optimizers
    Tran, Nga
    Lamb, Andrew
    Shrinivas, Lakshmikant
    Bodagala, Sreenath
    Dave, Jaimin
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 1108 - 1119
  • [27] GRAPH/Z: A Key-Value Store Based Scalable Graph Processing System
    Li, Tonglin
    Ma, Chaoqi
    Li, Jiabao
    Zhou, Xiaobing
    Wang, Ke
    Zhao, Dongfang
    Sadooghi, Iman
    Raicu, Ioan
    2015 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING - CLUSTER 2015, 2015, : 516 - 517
  • [28] G-Store: High-Performance Graph Store for Trillion-Edge Processing
    Kumar, Pradeep
    Huang, H. Howie
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 830 - 841
  • [29] Plan Before You Execute: A Cost-Based Query Optimizer for Attributed Graph Databases
    Das, Soumyava
    Goyal, Ankur
    Chakravarthy, Sharma
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2016, 2016, 9829 : 314 - 328
  • [30] Combining Graph Exploration and Fragmentation for Scalable RDF Query Processing
    Khelil, Abdallah
    Mesmoudi, Amin
    Galicia, Jorge
    Bellatreche, Ladjel
    Hacid, Mohand-Said
    Coquery, Emmanuel
    INFORMATION SYSTEMS FRONTIERS, 2021, 23 (01) : 165 - 183