Parallel query processing for OLAP in grids

被引:7
|
作者
Kotowski, Nelson [1 ]
Lima, Alexandre A. B. [2 ]
Pacitti, Esther [3 ]
Valduriez, Patrick [3 ]
Mattoso, Marta [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, BR-21941972 Rio De Janeiro, Brazil
[2] UNIGRANRIO, Rio De Janeiro, Brazil
[3] Univ Nantes, INRIA & LINA, Nantes, Pays De Loire, France
来源
关键词
data grid; distributed autonomous databases; parallel OLAP query processing;
D O I
10.1002/cpe.1303
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
OLAP query processing is critical for enterprise grids. Capitalizing on our experience with the ParGRIES database cluster, we propose a middleware solution, GParGRES, which exploits database replication and inter- and intra-query parallelism to efficiently support OLAP queries in a grid. GParGRES is designed as a wrapper that enables the use of ParGRES in PC clusters of a grid (in our case, Grid5000). Our approach has two levels of query splitting: grid-level splitting, implemented by GParGRES, and node-level splitting, implemented by ParGRES. GParGRES has been partially implemented as database grid services compatible with existing grid solutions such as the open grid service architecture and the Web services resource framework. We give preliminary experimental results obtained with two clusters of Grid5000 using queries of the TPC-H Benchmark. The results show linear or almost linear speedup in query execution, as more nodes are added in all tested configurations. Copyright (c) 2008 John Wiley & Sons, Ltd.
引用
下载
收藏
页码:2039 / 2048
页数:10
相关论文
共 50 条
  • [21] Adaptive hybrid partitioning for OLAP query processing in a database cluster
    Computer Science Department, COPPE, Federal University of Rio de Janeiro , P.O. Box 68511, 21941-972 Rio de Janeiro, Brazil
    不详
    不详
    Int. J. High Perform. Comput. Networking, 2008, 4 (251-262):
  • [22] Parallel query processing in a polystore
    Pavlos Kranas
    Boyan Kolev
    Oleksandra Levchenko
    Esther Pacitti
    Patrick Valduriez
    Ricardo Jiménez-Peris
    Marta Patiño-Martinez
    Distributed and Parallel Databases, 2021, 39 : 939 - 977
  • [23] Reducing I/O Cost in OLAP Query Processing with MapReduce
    Kang, Woo-Lam
    Kim, Hyeon-Gyu
    Lee, Yoon-Joon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (02): : 444 - 447
  • [24] Skew in Parallel Query Processing
    Beame, Paul
    Koutris, Paraschos
    Suciu, Dan
    PODS'14: PROCEEDINGS OF THE 33RD ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2014, : 212 - 223
  • [25] Parallel query processing in a polystore
    Kranas, Pavlos
    Kolev, Boyan
    Levchenko, Oleksandra
    Pacitti, Esther
    Valduriez, Patrick
    Jimenez-Peris, Ricardo
    Patino-Martinez, Marta
    DISTRIBUTED AND PARALLEL DATABASES, 2021, 39 (04) : 939 - 977
  • [26] Adaptive parallel query processing
    Tok, WH
    Zhao, L
    Bressan, S
    PDPTA'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, 2001, : 590 - 597
  • [27] Cache-Based Aggregate Query Shipping: An Efficient Scheme of Distributed OLAP Query Processing
    Liao, Hua-Ming
    Pei, Guo-Shun
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2008, 23 (06) : 905 - 915
  • [28] Allocating resources to parallel query plans in data grids
    Bose, Sumit Kumar
    Krishnamoorthy, Srikumar
    Ranade, Nilesh
    SIXTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2007, : 210 - 217
  • [29] Cache-Based Aggregate Query Shipping: An Efficient Scheme of Distributed OLAP Query Processing
    Hua-Ming Liao
    Guo-Shun Pei
    Journal of Computer Science and Technology, 2008, 23 : 905 - 915
  • [30] Cache-Based Aggregate Query Shipping:An Efficient Scheme of Distributed OLAP Query Processing
    廖华明
    裴国顺
    Journal of Computer Science & Technology, 2008, (06) : 905 - 915