Parallel processing of "GroupBy-Before-Join" queries in cluster architecture

被引:0
|
作者
Taniar, D [1 ]
Rahayu, JW [1 ]
机构
[1] Monash Univ, Sch Business Syst, Clayton, Vic 3800, Australia
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
SQL queries in the real world are replete with group by and join operations. This type of queries is often known as "GroupBy-Join" queries. In some GroupBy-Join queries, it is desirable to perform group-by, before join in order to achieve better performance. This subset of GroupBy-Join queries is called "GroupBy-Before-Join" queries. In this paper, we present a study. on parallelization of GroupBy-Before-Join queries, particularly. by, exploiting cluster architectures. Front our study,, we have learned that in parallel query optimization. processing group-by, as early, as possible is not always desirable. In many occasions, performing data distribution first before group-by offers performance advantages. In this study,, we also describe our cluster-based scheme.
引用
收藏
页码:178 / 185
页数:8
相关论文
共 50 条
  • [1] Parallel "GroupBy-Before-Join" query processing for high performance parallel/distributed database systems
    Taniar, David
    Rahayu, Wenny
    [J]. 20TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS, 2006, : 693 - +
  • [2] Performance evaluation of parallel GroupBy-Before-Join query processing in high performance database systems
    Taniar, D
    Rahayu, JW
    Ekonomosa, H
    [J]. HIGH-PERFORMANCE COMPUTING AND NETWORKING, 2001, 2110 : 241 - 250
  • [3] An optimal evaluation of groupby-join queries in distributed architectures
    Hassan, M. Al Hajj
    Bamha, M.
    [J]. WEBIST 2007: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL IT: INTERNET TECHNOLOGY, 2007, : 246 - +
  • [4] Performance analysis of "Groupby-After-Join" query processing in parallel database systems
    Taniar, D
    Tan, RBN
    Leung, CHC
    Liu, KH
    [J]. INFORMATION SCIENCES, 2004, 168 (1-4) : 25 - 50
  • [5] Parallel processing of "Group-By Join" queries on shared nothing machines
    Hassan, M. Al Hajj
    Bamha, M.
    [J]. SOFTWARE AND DATA TECHNOLOGIES, 2008, 10 : 230 - 241
  • [6] Parallel processing of "group-by join" queries on shared nothing machines
    Hassan, M. Al Hajj
    Bamha, M.
    [J]. ICSOFT 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 1, 2006, : 301 - 307
  • [7] Efficient Parallel Processing of Distance Join Queries Over Distributed Graphs
    Zhang, Xiaofei
    Chen, Lei
    Wang, Min
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (03) : 740 - 754
  • [8] Parallel processing of olap queries using a cluster of workstations
    Dehuri, S.
    Mall, R.
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2007, 6 (02) : 279 - 299
  • [9] A parallel processing architecture to optimize runtime in aggregated SPARQL queries
    Rabhi, Ahmed
    Fissoune, Rachida
    Tabaa, Mohamed
    Badir, Hassan
    [J]. PROCEEDINGS OF 2022 14TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS, MEDES 2022, 2022, : 9 - 15
  • [10] Processing distance join queries with constraints
    Papadopoulos, Apostolos N.
    Nanopoulos, Alexandros
    Manolopoulos, Yannis
    [J]. Computer Journal, 2006, 49 (03): : 281 - 296