HyMJ: A Hybrid Structure Aware Approach to Distributed Multi-Way Join Query

被引:1
|
作者
Zhu, Guanghui [1 ]
Wu, Xiaoqi [1 ]
Yin, Liangliang [1 ]
Wang, Haogang [1 ]
Gu, Rong [1 ]
Yuan, Chunfeng [1 ]
Huang, Yihua [1 ]
机构
[1] Nanjing Univ, Collaborat Innovat Ctr Novel Software Technol & I, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
multi-way join; distributed computing; parallel query; Apache Spark;
D O I
10.1109/ICDE.2019.00183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The multi-way join query plays a fundamental role in many big data analytic scenarios. Recently, the hybrid join query is becoming increasingly important. However, the existing one-round and multi-round algorithms have limitations in the process of the hybrid query. In this paper, we present a novel hybrid structure-aware multi-way join algorithm called HyMJ, which combines the one-round and multi-round algorithms to compute the hybrid query efficiently. First, we propose the query structure graph (QSG) to represent the internal query structure of a given join query and the query structure decomposition tree (QSDT) to represent the structure-aware query plan. Each internal node of the QSDT denotes a subquery with a cyclic or acyclic query structure. Then, we design a graph contraction based algorithm to construct QSDT from QSG. Furthermore, to select the optimal join strategy for each subquery in the QSDT, we introduce a heuristic strategy selection model. Experimental results on Apache Spark reveal that HyMJ outperforms both the one-round and multi-round algorithms for hybrid multi-way join queries on real-world datasets.
引用
收藏
页码:1726 / 1729
页数:4
相关论文
共 50 条
  • [41] Generalized communication cost efficient multi-way spatial join: revisiting the curse of the last reducer
    S. Nagesh Bhattu
    Avinash Potluri
    Prashanth Kadari
    Subramanyam R. B. V.
    [J]. GeoInformatica, 2020, 24 : 557 - 589
  • [42] Foundations of a Multi-way Spectral Clustering Framework for Hybrid Linear Modeling
    Guangliang Chen
    Gilad Lerman
    [J]. Foundations of Computational Mathematics, 2009, 9 : 517 - 558
  • [43] Foundations of a Multi-way Spectral Clustering Framework for Hybrid Linear Modeling
    Chen, Guangliang
    Lerman, Gilad
    [J]. FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2009, 9 (05) : 517 - 558
  • [44] A distributed multilevel ant-colony algorithm for the multi-way graph partitioning
    Tashkova, K.
    Korosec, P.
    Silc, J.
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2011, 3 (05) : 286 - 296
  • [45] A Resource Requirement Aware Transmit Strategy for Non-Regenerative Multi-Way Relaying
    Ortiz, Andrea
    Degenhardt, Holger
    Klein, Anja
    [J]. 2015 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2015, : 19 - 24
  • [46] Optimization of Multi-Way Valve Structure in Digital Hydraulic System of Loader
    Li, Chunshuang
    Liu, Xinhui
    Wang, Xin
    Chen, Jinshi
    Wang, Yuqi
    [J]. ENERGIES, 2021, 14 (03)
  • [47] An Efficient Multi Join Query Optimization for DBMS Using Swarm Intelligent approach
    Al Saedi, Ahmed Khalaf Zager
    Ghazali, Rozaida Bt.
    Deris, Mustafa Bin Mat
    [J]. 2014 4TH WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2014, : 113 - 117
  • [48] Parallel multi-join query optimization algorithm for distributed sensor network in the internet of things
    Zheng, Yan
    [J]. SMART SENSOR PHENOMENA, TECHNOLOGY, NETWORKS, AND SYSTEMS INTEGRATION 2015, 2015, 9436
  • [49] A memetic algorithm approach for solving the multidimensional multi-way number partitioning problem
    Pop, Petrica C.
    Matei, Oliviu
    [J]. APPLIED MATHEMATICAL MODELLING, 2013, 37 (22) : 9191 - 9202