GeMDA: A multidimensional data partitioning technique for multiprocessor database systems

被引:7
|
作者
Lo, YL
Hua, KA
Young, HC
机构
[1] Chaoyang Univ Technol, Dept Informat Management, Wufeng 413, Taichung County, Taiwan
[2] Univ Cent Florida, Sch Elect Engn & Comp Sci, Orlando, FL 32816 USA
[3] IBM Corp, Almaden Res Ctr, Div Res, San Jose, CA 95120 USA
关键词
data allocation; data fragmentation; parallel database system; query processing; system utilization;
D O I
10.1023/A:1019265612794
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several studies have repeatedly demonstrated that both the performance and scalability of a shared nothing parallel database system depend on the physical layout of data across the processing nodes of the system. Today, data is allocated in these systems using horizontal partitioning strategies. This approach has a number of drawbacks. If a query involves the partitioning attribute, then typically only a small number of the processing nodes can be used to speedup the execution of this query. On the other hand, if the predicate of a selection query includes an attribute other than the partitioning attribute, then the entire data space must be searched. Again, this results in waste of computing resources. In recent years, several multidimensional data declustering techniques have been proposed to address these problems. However, these schemes are too restrictive (e.g., FX, ECC, etc.), or optimized for a certain type of queries (e.g., DM, HCAM, etc.). In this paper, we introduce a new technique which is flexible, and performs well for general queries, We prove its optimality properties, and present experimental results showing that our scheme outperforms DM and HCAM by a significant margin.
引用
收藏
页码:211 / 236
页数:26
相关论文
共 50 条
  • [41] Simulating of query processing on multiprocessor database systems with modern coprocessors
    Besedin, Konstantin Y.
    Kostenetskiy, Pavel S.
    2014 37TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2014, : 1614 - 1616
  • [42] PPS - A parallel partition sort algorithm for multiprocessor database systems
    Zhao, X
    Martin, NJ
    Johnson, RG
    11TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, PROCEEDINGS, 2000, : 635 - 644
  • [43] Effective skew handling for parallel sorting in multiprocessor database systems
    Lo, YL
    Huang, YC
    NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 151 - 156
  • [44] Algorithms for Efficient Load Control in Multiprocessor Database Systems.
    Rahm, Erhard
    Angewandte Informatik, Applied Informatics, 1986, 28 (04): : 161 - 169
  • [45] Vertical partitioning for flash and HDD database systems
    Clementsen, Davur S.
    He, Zhen
    JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (11) : 2237 - 2250
  • [46] Optimal view selection for multidimensional database systems
    Soutyrina, E
    Fotouhi, F
    IDEAS '97 - INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 1997, : 45 - 52
  • [47] A Pipeline Technique for Dynamic Data Transfer on a Multiprocessor Grid
    Stavros Souravlas
    Manos Roumeliotis
    International Journal of Parallel Programming, 2004, 32 : 361 - 388
  • [48] A pipeline technique for dynamic data transfer on a multiprocessor grid
    Souravlas, S
    Roumeliotis, M
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2004, 32 (05) : 361 - 388
  • [49] Multidimensional partitioning and bi-partitioning: analysis and application to gene expression data sets
    Kalna, Gabriela
    Vass, J. Keith
    Higham, Desmond J.
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2008, 85 (3-4) : 475 - 485
  • [50] USING MULTIPROCESSOR SYSTEMS FOR MULTISPECTRAL DATA PROCESSING
    Nita, Iulian
    Aldea, Olga
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2012, 74 (04): : 135 - 144