Load balancing and data placement for multi-tiered database systems

被引:3
|
作者
Li, Wen-Syan
Zilio, Daniel C.
Batra, Vishal S.
Zuzarte, Calisto
Narang, Inderpal
机构
[1] IBM Corp, Almaden Res Ctr, Dept Comp Sci, San Jose, CA 95120 USA
[2] IBM Canada Ltd, Markham, ON L6G 1C7, Canada
[3] Indian Inst Technol, IBM India Res Lab, New Delhi 110016, India
关键词
OLAP; materialized views; data placement; performance; caching;
D O I
10.1016/j.datak.2006.11.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A materialized view or Materialized Query Table (MQT) is an auxiliary table with precomputed data that can be used to significantly improve the performance of a database query. A Materialized Query Table Advisor (MQTA) is often used to recommend and create MQTs. The state-of-the-art MQTA works in a standalone database server where MQTs are placed on the same server as that in which the base tables are located. The MQTA does not apply to a federated or scaleout scenario in which MQTs need to be placed on other servers close to applications (i.e. a frontend database server) for off-loading the workload on the backend database server. In this paper, we propose a Data Placement Advisor (DPA) and load balancing strategies for multi-tiered database systems. Built on top of the MQTA, DPA recommends MQTs and advises placement strategies for minimizing the response time for a query workload. To demonstrate the benefit of the data placement advising, we implemented a prototype of DPA that works with the MQTA in the IBM (R) DB2 (R) Universal Database(TM) (DB2 UDB) and the IBM WebSphere (R) Information Integrator (WebSphere II). The evaluation results showed substantial improvements of workload response times when MQTs are intelligently recommended and placed on a frontend database server subject to space and load characteristics for TPC-H and OLAP type workloads. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:523 / 546
页数:24
相关论文
共 50 条
  • [1] A Load-Balancing Data Caching Scheme in Multi-tiered Storage Systems
    Chang, Hsung-Pin
    Luo, Jhih-Cheng
    Chang, Da-Wei
    [J]. PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 124 - +
  • [2] Adaptive Data Placement in Multi-Tiered Data Staging Runtime
    Jin, Tong
    Sun, Qian
    Romanus, Melissa
    Parashar, Manish
    [J]. NEW FRONTIERS IN HIGH PERFORMANCE COMPUTING AND BIG DATA, 2017, 30 : 175 - 196
  • [3] Predicting file lifetimes for data placement in multi-tiered storage systems for HPC
    Thomas, Luis
    Gougeaud, Sebastien
    Rubini, Stephane
    Deniel, Philippe
    Boukhobza, Jalil
    [J]. OPERATING SYSTEMS REVIEW, 2021, 55 (01) : 99 - 107
  • [4] Multi-tiered database clusters
    Kripac, Miroslav
    Brandejs, Michal
    [J]. 3rd International Conference on Computing, Communications and Control Technologies, Vol 1, Proceedings, 2005, : 113 - 117
  • [5] Predicting file lifetimes for data placement in multi-Tiered storage systems for HPC
    Thomas, Luis
    Gougeaud, Sebastien
    Rubini, Stéphane
    Deniel, Philippe
    Boukhobza, Jalil
    [J]. Operating Systems Review (ACM), 2021, 55 (01): : 99 - 107
  • [6] On Automated Feedback-Driven Data Placement in Multi-tiered Memory
    Effler, T. Chad
    Howard, Adam P.
    Zhou, Tong
    Jantz, Michael R.
    Doshi, Kshitij A.
    Kulkarni, Prasad A.
    [J]. ARCHITECTURE OF COMPUTING SYSTEMS, 2018, 10793 : 181 - 194
  • [7] Cost-Aware Region-Level Data Placement in Multi-Tiered Parallel I/O Systems
    He, Shuibing
    Wang, Yang
    Li, Zheng
    Sun, Xian-He
    Xu, Chenzhong
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) : 1853 - 1865
  • [8] Data Jockey: Automatic Data Management for HPC Multi-Tiered Storage Systems
    Shin, Woong
    Brumgard, Christopher D.
    Xie, Bing
    Vazhkudai, Sudharshan S.
    Ghoshal, Devarshi
    Oral, Sarp
    Ramakrishnan, Lavanya
    [J]. 2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 511 - 522
  • [9] TripS: Automated Multi-tiered Data Placement in a Geo-distributed Cloud Environment
    Oh, Kwangsung
    Chandra, Abhishek
    Weissman, Jon
    [J]. SYSTOR'17: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, 2017,
  • [10] A Prefetching Scheme for Multi-tiered Storage Systems
    Chang, Hsung-Pin
    Chen, Chia-Yu
    Liu, Chien-Yi
    [J]. 2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1582 - 1586