GeMDA: A multidimensional data partitioning technique for multiprocessor database systems

被引:7
|
作者
Lo, YL
Hua, KA
Young, HC
机构
[1] Chaoyang Univ Technol, Dept Informat Management, Wufeng 413, Taichung County, Taiwan
[2] Univ Cent Florida, Sch Elect Engn & Comp Sci, Orlando, FL 32816 USA
[3] IBM Corp, Almaden Res Ctr, Div Res, San Jose, CA 95120 USA
关键词
data allocation; data fragmentation; parallel database system; query processing; system utilization;
D O I
10.1023/A:1019265612794
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several studies have repeatedly demonstrated that both the performance and scalability of a shared nothing parallel database system depend on the physical layout of data across the processing nodes of the system. Today, data is allocated in these systems using horizontal partitioning strategies. This approach has a number of drawbacks. If a query involves the partitioning attribute, then typically only a small number of the processing nodes can be used to speedup the execution of this query. On the other hand, if the predicate of a selection query includes an attribute other than the partitioning attribute, then the entire data space must be searched. Again, this results in waste of computing resources. In recent years, several multidimensional data declustering techniques have been proposed to address these problems. However, these schemes are too restrictive (e.g., FX, ECC, etc.), or optimized for a certain type of queries (e.g., DM, HCAM, etc.). In this paper, we introduce a new technique which is flexible, and performs well for general queries, We prove its optimality properties, and present experimental results showing that our scheme outperforms DM and HCAM by a significant margin.
引用
收藏
页码:211 / 236
页数:26
相关论文
共 50 条
  • [31] Adaptive data partitioning for multiprocessor implementation of MPEG2 encoders
    Zhang, N
    Wu, CH
    ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 1221 - 1224
  • [32] Design and evaluation of database multiprocessor architecture with high data availability
    Sokolinsky, LB
    12TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2001, : 115 - 120
  • [33] Energy-optimal software partitioning in heterogeneous multiprocessor embedded systems
    Goraczko, Michel
    Matic, Slobodan
    Liu, Jie
    Priyantha, Bodhi
    Lymberopoulos, Dimitrios
    Zhao, Feng
    2008 45TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2008, : 191 - +
  • [34] Partitioning Real-Time Tasks With Replications on Multiprocessor Embedded Systems
    Lin, Jian
    Cheng, Albert M. K.
    Gercek, Gokhan
    IEEE EMBEDDED SYSTEMS LETTERS, 2016, 8 (04) : 89 - 92
  • [35] Partitioning and Interface Synthesis in Hierarchical Multiprocessor Real-Time Systems
    Biondi, Alessandro
    Buttazzo, Giorgio
    Bertogna, Marko
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON REAL-TIME NETWORKS AND SYSTEMS PROCEEDINGS (RTNS 2016), 2016, : 257 - 266
  • [36] MULTIPROCESSOR ALGORITHMS FOR RELATIONAL-DATABASE OPERATORS ON HYPERCUBE SYSTEMS
    FRIEDER, O
    COMPUTER, 1990, 23 (11) : 13 - 28
  • [37] An efficiently hardware-software partitioning for embedded multiprocessor FPGA systems
    Lee, Trong-Yen
    Fan, Yang-Hsin
    Cheng, Yu-Min
    Tsai, Chia-Chun
    Hsiao, Rong-Shue
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 346 - +
  • [38] Preference-oriented partitioning for multiprocessor real-time systems
    Xia, Qin
    Yan, Songming
    Chen, Haoxuan
    Zhu, Dakai
    Aydin, Hakan
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 126
  • [39] ALGORITHMS FOR EFFICIENT LOAD CONTROL IN MULTIPROCESSOR DATABASE-SYSTEMS
    RAHM, E
    ANGEWANDTE INFORMATIK, 1986, (04): : 161 - 169
  • [40] Differentially Private Data Release through Multidimensional Partitioning
    Xiao, Yonghui
    Xiong, Li
    Yuan, Chun
    SECURE DATA MANAGEMENT, 2010, 6358 : 150 - +