Scaling-Up In-Memory Datalog Processing: Observations and Techniques

被引:8
|
作者
Fan, Zhiwei [1 ]
Zhu, Jianqiao [1 ]
Zhang, Zuyu [1 ]
Albarghouthi, Aws [1 ]
Koutris, Paraschos [1 ]
Patel, Jignesh [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 12卷 / 06期
基金
美国国家科学基金会;
关键词
SOCIALITE;
D O I
10.14778/3311880.3311886
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recursive query processing has experienced a recent resurgence, as a result of its use in many modern application domains, including data integration, graph analytics, security, program analysis, networking and decision making. Due to the large volumes of data being processed, several research efforts across multiple communities have explored how to scale up recursive queries, typically expressed in Datalog. Our experience with these tools indicate that their performance does not translate across domains e.g., a tool de- signed for large-scale graph analytics does not exhibit the same performance on program-analysis tasks, and vice versa. Starting from the above observation, we make the following two contributions. First, we perform a detailed experimental evaluation comparing a number of state-of-the-art Datalog systems on a wide spectrum of graph analytics and program-analysis tasks, and summarize the pros and cons of existing techniques. Second, we design and implement our own general-purpose Datalog engine, called RecStep, on top of a parallel single-node relational system. We outline the techniques we applied on RecStep, as well as the contribution of each technique to the overall performance. Using RecStep as a baseline, we demonstrate that it generally outperforms state-of-the-art parallel Datalog engines on complex and large-scale Datalog evaluation, by a 4-6X margin. An additional insight from our work is that it is possible to build a high-performance Datalog system on top of a relational engine, an idea that has been dismissed in past work.
引用
下载
收藏
页码:695 / 708
页数:14
相关论文
共 50 条
  • [1] Scaling-Up Robust Gradient Descent Techniques
    Holland, Matthew J.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7694 - 7701
  • [2] SCALING-UP
    ANDREWS, CJ
    IEEE TECHNOLOGY AND SOCIETY MAGAZINE, 1995, 14 (02) : 4 - 5
  • [3] SCALING-UP OF EMULSIONS
    HEERS, W
    FETTE SEIFEN ANSTRICHMITTEL, 1974, 76 (09): : 420 - 423
  • [4] Scaling-Up fire
    Torero, Jose L.
    PROCEEDINGS OF THE COMBUSTION INSTITUTE, 2013, 34 : 99 - 124
  • [5] Scaling-Up Distributed Processing of Data Streams for Machine Learning
    Nokleby, Matthew
    Raja, Haroon
    Bajwa, Waheed U.
    PROCEEDINGS OF THE IEEE, 2020, 108 (11) : 1984 - 2012
  • [6] Memory Processing Unit for In-Memory Processing
    Ben Hur, Rotem
    Kvatinsky, Shahar
    PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL SYMPOSIUM ON NANOSCALE ARCHITECTURES (NANOARCH), 2016, : 171 - 172
  • [7] SCALING-UP PREVENTION OF STUNTING
    Ahmed, T.
    Hossain, M.
    Chisti, J.
    Mahfuz, M.
    Islam, M.
    Hossain, I.
    ANNALS OF NUTRITION AND METABOLISM, 2013, 63 : 60 - 61
  • [8] Scaling-up participation at USAID
    Corneille, F
    Shiffman, J
    PUBLIC ADMINISTRATION AND DEVELOPMENT, 2004, 24 (03) : 255 - 262
  • [9] Scaling-up phase selection
    Mohite, Aditya D.
    Blancon, Jean-Christophe
    NATURE MATERIALS, 2018, 17 (12) : 1058 - 1059
  • [10] SCALING-UP IN MATHEMATICS EDUCATION
    Gomez, Pedro
    Molina, Marta
    Jose Gonzalez, Maria
    INVESTIGACION EN EDUCACION MATEMATICA XIII, 2009, : 237 - 246