SQLoop: High Performance Iterative Processing in Data Management

被引:5
|
作者
Floratos, Sofoklis [1 ]
Zhang, Yanfeng [1 ,2 ]
Yuan, Yuan [3 ]
Lee, Rubao [1 ]
Zhang, Xiaodong [1 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Northeastern Univ, Boston, MA 02115 USA
[3] Google Inc, Mountain View, CA USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
SOCIALITE; FRAMEWORK; QUERIES;
D O I
10.1109/ICDCS.2018.00104
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Increasingly more iterative and recursive query tasks are processed in data management systems, such as graph-structured data analytics, demanding fast response time. However, existing CTE-based recursive SQL and its implementation ineffectively respond to this intensive query processing with two major drawbacks. First, its iteration execution model is based on implicit set-oriented terminating conditions that cannot express aggregation-based tasks, such as PageRank. Second, its synchronous execution model cannot perform asynchronous computing to further accelerate execution in parallel. To address these two issues, we have designed and implemented SQLoop, a framework that extends the semantics of current SQL standard in order to accommodate iterative SQL queries. SQLoop interfaces between users and different database engines with two powerful components. First, it provides an uniform SQL expression for users to access any database engine so that they do not need to mite database dependent SQL or move datasets from a target engine to process in their own sites. Second, SQLoop automatically parallelizes iterative queries that contain certain aggregate functions in both synchronous and asynchronous ways. More specifically, SQLoop is able to take advantage of intermediate results generated between different iterations and to prioritize the execution of partitions that accelerate the query processing. We have tested and evaluated SQLoop by using three popular database engines with real-world datasets and queries, and shown its effectiveness and high performance.
引用
收藏
页码:1039 / 1051
页数:13
相关论文
共 50 条
  • [1] THE DESIGN OF A HIGH PERFORMANCE EARTH IMAGERY AND RASTER DATA MANAGEMENT AND PROCESSING PLATFORM
    Xie, Qingyun
    [J]. XXIII ISPRS Congress, Commission IV, 2016, 41 (B4): : 551 - 555
  • [2] High performance data deployment and processing
    Fleming, David
    Plante, Raymond
    [J]. ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVI, 2007, 376 : 38 - +
  • [3] FiM: Performance Prediction for Parallel Computation in Iterative Data Processing Applications
    Bhimani, Janki
    Mi, Ningfang
    Leeser, Miriam
    Yang, Zhengyu
    [J]. 2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 359 - 366
  • [4] Performance enhancement for iterative data computing with in-memory concurrent processing
    Wen, Yean-Fu
    Chen, Yu-Fang
    Chiu, Tse Kai
    Chen, Yen-Chou
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (07):
  • [5] pipsCloud: High performance cloud computing for remote sensing big data management and processing
    Wang, Lizhe
    Ma, Yan
    Yan, Jining
    Chang, Victor
    Zomaya, Albert Y.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 78 : 353 - 368
  • [6] Iterative algorithms for the post-processing of high-dimensional data
    Espig, Mike
    Hackbusch, Wolfgang
    Litvinenko, Alexander
    Matthies, Hermann G.
    Zander, Elmar
    [J]. Journal of Computational Physics, 2020, 410
  • [7] Iterative algorithms for the post-processing of high-dimensional data
    Espig, Mike
    Hackbusch, Wolfgang
    Litvinenko, Alexander
    Matthies, Hermann G.
    Zander, Elmar
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 410
  • [8] High Performance Computing needs high performance data management
    Kleese, K
    [J]. PROCEEDINGS OF THE HIGH PERFORMANCE COMPUTING SYMPOSIUM - HPC '99, 1999, : 331 - 336
  • [9] Advanced high performance algorithms for data processing
    Bogdanov, AV
    Boukhanovsky, AV
    [J]. COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS, 2004, 3036 : 239 - 246
  • [10] High-performance data processing for image and data fusion
    Zhang, Jixian
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND DATA FUSION, 2016, 7 (01) : 1 - 2