Parallel processing of "GroupBy-Before-Join" queries in cluster architecture

被引:0
|
作者
Taniar, D [1 ]
Rahayu, JW [1 ]
机构
[1] Monash Univ, Sch Business Syst, Clayton, Vic 3800, Australia
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
SQL queries in the real world are replete with group by and join operations. This type of queries is often known as "GroupBy-Join" queries. In some GroupBy-Join queries, it is desirable to perform group-by, before join in order to achieve better performance. This subset of GroupBy-Join queries is called "GroupBy-Before-Join" queries. In this paper, we present a study. on parallelization of GroupBy-Before-Join queries, particularly. by, exploiting cluster architectures. Front our study,, we have learned that in parallel query optimization. processing group-by, as early, as possible is not always desirable. In many occasions, performing data distribution first before group-by offers performance advantages. In this study,, we also describe our cluster-based scheme.
引用
收藏
页码:178 / 185
页数:8
相关论文
共 50 条
  • [31] Study on Architecture of Photogrammetric Parallel Processing System Based on Cluster Computing
    Liu Hangye
    Sui Xuelian
    Zong Jingchun
    [J]. 2009 INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND INFORMATION APPLICATION TECHNOLOGY,VOL I, PROCEEDINGS, 2009, : 378 - +
  • [32] DHTJoin: processing continuous join queries using DHT networks
    Palma, Wenceslao
    Akbarinia, Reza
    Pacitti, Esther
    Valduriez, Patrick
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2009, 26 (2-3) : 291 - 317
  • [33] Efficient parallel spatial join processing method in a shared-nothing database cluster system
    Chung, W
    Park, SY
    Bae, HY
    [J]. EMBEDDED SOFTWARE AND SYSTEMS, 2005, 3605 : 81 - 87
  • [34] DHTJoin: processing continuous join queries using DHT networks
    Wenceslao Palma
    Reza Akbarinia
    Esther Pacitti
    Patrick Valduriez
    [J]. Distributed and Parallel Databases, 2009, 26
  • [35] An Efficient Parallel Algorithm for Evaluating Join Queries on Heterogeneous Distributed Systems
    Hassan, M. Al Hajj
    Bamha, M.
    [J]. 16TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), PROCEEDINGS, 2009, : 350 - 358
  • [36] Parallel Algorithms for Sparse Matrix Multiplication and Join-Aggregate Queries
    Hu, Xiao
    Yi, Ke
    [J]. PODS'20: PROCEEDINGS OF THE 39TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2020, : 411 - 425
  • [37] ONE-SHOT SEMI-JOIN EXECUTION STRATEGIES FOR PROCESSING DISTRIBUTED JOIN QUERIES
    WANG, CP
    LI, VOK
    CHEN, ALP
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1993, 8 (04): : 245 - 253
  • [38] Data partitioning for parallel spatial join processing
    Zhou, XF
    Abel, DJ
    Truffet, D
    [J]. ADVANCES IN SPATIAL DATABASES, 1997, 1262 : 178 - 196
  • [39] Data Partitioning for Parallel Spatial Join Processing
    Zhou X.
    Abel D.J.
    Truffet D.
    [J]. GeoInformatica, 1998, 2 (2) : 175 - 204
  • [40] Parallel optimization of large join queries with set operators and aggregates in a parallel environment supporting pipeline
    Spiliopoulou, M
    Hatzopoulos, M
    Cotronis, Y
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (03) : 429 - 445