Tempura: A General Cost-Based Optimizer Framework for Incremental Data Processing

被引:5
|
作者
Wang, Zuozhi [1 ]
Zeng, Kai [2 ]
Huang, Botong [2 ]
Chen, Wei [2 ]
Cui, Xiaozong [2 ]
Wang, Bo [2 ]
Liu, Ji [2 ]
Fan, Liya [2 ]
Qu, Dachuan [2 ]
Hou, Zhenyu [2 ]
Guan, Tao [2 ]
Li, Chen [1 ]
Zhou, Jingren [2 ]
机构
[1] Univ Calif Irvine, Irvine, CA 92697 USA
[2] Alibaba Grp, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 14卷 / 01期
关键词
Query processing - Data handling;
D O I
10.14778/3421424.3421427
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Incremental processing is widely-adopted in many applications, ranging from incremental view maintenance, stream computing, to recently emerging progressive data warehouse and intermittent query processing. Despite many algorithms developed on this topic, none of them can produce an incremental plan that always achieves the best performance, since the optimal plan is data dependent. In this paper, we develop a novel cost-based optimizer framework, called Tempura, for optimizing incremental data processing. We propose an incremental query planning model called TIP based on the concept of time-varying relations, which can formally model incremental processing in its most general form. We give a full specification of Tempura, which can not only unify various existing techniques to generate an optimal incremental plan, but also allow the developer to add their rewrite rules. We study how to explore the plan space and search for an optimal incremental plan. We evaluate Tempura in various incremental processing scenarios to show its effectiveness and efficiency.
引用
收藏
页码:14 / 27
页数:14
相关论文
共 50 条
  • [1] Tempura: a general cost-based optimizer framework for incremental data processing (Journal Version)
    Wang, Zuozhi
    Zeng, Kai
    Huang, Botong
    Chen, Wei
    Cui, Xiaozong
    Wang, Bo
    Liu, Ji
    Fan, Liya
    Qu, Dachuan
    Hou, Zhenyu
    Guan, Tao
    Li, Chen
    Zhou, Jingren
    VLDB JOURNAL, 2023, 32 (06): : 1315 - 1342
  • [2] Tempura: a general cost-based optimizer framework for incremental data processing (Journal Version)
    Zuozhi Wang
    Kai Zeng
    Botong Huang
    Wei Chen
    Xiaozong Cui
    Bo Wang
    Ji Liu
    Liya Fan
    Dachuan Qu
    Zhenyu Hou
    Tao Guan
    Chen Li
    Jingren Zhou
    The VLDB Journal, 2023, 32 : 1315 - 1342
  • [3] A Cost-based Optimizer for Gradient Descent Optimization
    Kaoudi, Zoi
    Quiane-Ruiz, Jorge-Arnulfo
    Thirumuruganathan, Saravanan
    Chawla, Sanjay
    Agrawal, Divy
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 977 - 992
  • [4] gCBO: A Cost-based Optimizer for Graph Databases
    Yang, Linglin
    Yang, Lei
    Pang, Yue
    Zou, Lei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 5054 - 5058
  • [5] RHEEMix in the data jungle: a cost-based optimizer for cross-platform systems
    Sebastian Kruse
    Zoi Kaoudi
    Bertty Contreras-Rojas
    Sanjay Chawla
    Felix Naumann
    Jorge-Arnulfo Quiané-Ruiz
    The VLDB Journal, 2020, 29 : 1287 - 1310
  • [6] RHEEMix in the data jungle: a cost-based optimizer for cross-platform systems
    Kruse, Sebastian
    Kaoudi, Zoi
    Contreras-Rojas, Bertty
    Chawla, Sanjay
    Naumann, Felix
    Quiane-Ruiz, Jorge-Arnulfo
    VLDB JOURNAL, 2020, 29 (06): : 1287 - 1310
  • [7] Geno: Cost-based Heterogeneous Fusion Query Optimizer
    Tu Y.-F.
    Chen X.-Q.
    Zhou S.-J.
    Bian F.-S.
    Wu F.
    Chen B.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 774 - 796
  • [8] Cost-based Fault-tolerance for Parallel Data Processing
    Salama, Abdallah
    Binnig, Carsten
    Kraska, Tim
    Zamanian, Erfan
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 285 - 297
  • [9] An Effective and Cost-Based Framework for a Qualitative Hybrid Data Deduplication
    Haruna, Charles R.
    Hou, MengShu
    Eghan, Moses J.
    Kpiebaareh, Michael Y.
    Tandoh, Lawrence
    ADVANCES IN COMPUTER COMMUNICATION AND COMPUTATIONAL SCIENCES, IC4S 2018, 2019, 924 : 511 - 520
  • [10] Towards a framework for cost-based transformation
    Skillicorn, DB
    JOURNAL OF SYSTEMS ARCHITECTURE, 1996, 42 (05) : 331 - 340