Parallel query processing for OLAP in grids

被引:7
|
作者
Kotowski, Nelson [1 ]
Lima, Alexandre A. B. [2 ]
Pacitti, Esther [3 ]
Valduriez, Patrick [3 ]
Mattoso, Marta [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, BR-21941972 Rio De Janeiro, Brazil
[2] UNIGRANRIO, Rio De Janeiro, Brazil
[3] Univ Nantes, INRIA & LINA, Nantes, Pays De Loire, France
来源
关键词
data grid; distributed autonomous databases; parallel OLAP query processing;
D O I
10.1002/cpe.1303
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
OLAP query processing is critical for enterprise grids. Capitalizing on our experience with the ParGRIES database cluster, we propose a middleware solution, GParGRES, which exploits database replication and inter- and intra-query parallelism to efficiently support OLAP queries in a grid. GParGRES is designed as a wrapper that enables the use of ParGRES in PC clusters of a grid (in our case, Grid5000). Our approach has two levels of query splitting: grid-level splitting, implemented by GParGRES, and node-level splitting, implemented by ParGRES. GParGRES has been partially implemented as database grid services compatible with existing grid solutions such as the open grid service architecture and the Web services resource framework. We give preliminary experimental results obtained with two clusters of Grid5000 using queries of the TPC-H Benchmark. The results show linear or almost linear speedup in query execution, as more nodes are added in all tested configurations. Copyright (c) 2008 John Wiley & Sons, Ltd.
引用
下载
收藏
页码:2039 / 2048
页数:10
相关论文
共 50 条
  • [1] OLAP parallel query processing in clouds with C-ParGRES
    Ribeiro, Marcello W. M.
    Lima, Alexandre A. B.
    de Oliveira, Daniel
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (07):
  • [2] Parallel OLAP query processing in database clusters with data replication
    Alexandre A. B. Lima
    Camille Furtado
    Patrick Valduriez
    Marta Mattoso
    Distributed and Parallel Databases, 2009, 25 : 97 - 123
  • [3] Parallel OLAP query processing in database clusters with data replication
    Lima, Alexandre A. B.
    Furtado, Camille
    Valduriez, Patrick
    Mattoso, Marta
    DISTRIBUTED AND PARALLEL DATABASES, 2009, 25 (1-2) : 97 - 123
  • [4] Resource scheduling for parallel query processing on computational grids
    Gounaris, A
    Sakellariou, R
    Paton, NW
    Fernandes, AAA
    FIFTH IEEE/ACM INTERNATIONAL WORKSHOP ON GRID COMPUTING, PROCEEDINGS, 2004, : 396 - 401
  • [6] A Graph-based Database Partitioning Method for Parallel OLAP Query Processing
    Nam, Yoon-Min
    Kim, Min-Soo
    Han, Donghyoung
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1025 - 1036
  • [7] OLAP query processing in a database cluster
    Lima, AAB
    Mattoso, M
    Valduriez, P
    EURO-PAR 2004 PARALLEL PROCESSING, PROCEEDINGS, 2004, 3149 : 355 - 362
  • [8] Parallel Star Join plus DataIndexes: Efficient query processing in data warehouses and OLAP
    Datta, A
    VanderMeer, D
    Ramamritham, K
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (06) : 1299 - 1316
  • [9] A novel approach to resource scheduling for parallel query processing on computational grids
    Anastasios Gounaris
    Rizos Sakellariou
    Norman W. Paton
    Alvaro A. A. Fernandes
    Distributed and Parallel Databases, 2006, 19 : 87 - 106
  • [10] A novel approach to resource scheduling for parallel query processing on computational grids
    Gounaris, Anastasios
    Sakellariou, Rizos
    Paton, Norman W.
    Fernandes, Alvaro A. A.
    DISTRIBUTED AND PARALLEL DATABASES, 2006, 19 (2-3) : 87 - 106