Selection of views to materialize in a data warehouse

被引:119
|
作者
Gupta, H [1 ]
Mumick, IS
机构
[1] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
[2] Kirusa Inc, Edison, NJ 08817 USA
关键词
views; view selection; data warehouse; materialization;
D O I
10.1109/TKDE.2005.16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A data warehouse stores materialized views of data from one or more sources, with the purpose of efficiently implementing decision-support or OLAP queries. One of the most important decisions in designing a data warehouse is the selection of materialized views to be maintained at the warehouse. The goal is to select an appropriate set of views that minimizes total query response time and the cost of maintaining the selected views, given a limited amount of resource, e. g., materialization time, storage space, etc. In this article, we have developed a theoretical framework for the general problem of selection of views in a data warehouse. We present polynomial-time heuristics for a selection of views to optimize total query response time under a disk-space constraint, for some important special cases of the general data warehouse scenario, viz.: 1) an AND view graph, where each query/view has a unique evaluation, e.g., when a multiple-query optimizer can be used to general a global evaluation plan for the queries, and 2) an OR view graph, in which any view can be computed from any one of its related views, e. g., data cubes. We present proofs showing that the algorithms are guaranteed to provide a solution that is fairly close to (within a constant factor ratio of) the optimal solution. We extend our heuristic to the general AND-OR view graphs. Finally, we address in detail the view-selection problem under the maintenance cost constraint and present provably competitive heuristics.
引用
收藏
页码:24 / 43
页数:20
相关论文
共 50 条
  • [1] Selection of views to materialize in a data warehouse
    Gupta, H
    [J]. DATABASE THEORY - ICDT'97, 1997, 1186 : 98 - 112
  • [2] The Model and Realization of Materialized Views Selection in Data Warehouse
    Zhou, Lijuan
    Wu, Minhua
    Ge, Xuebin
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 406 - 411
  • [3] Efficient approaches for materialized views selection in a data warehouse
    Hung, Ming-Chuan
    Huang, Man-Lin
    Yang, Don-Lin
    Hsueh, Nien-Lin
    [J]. INFORMATION SCIENCES, 2007, 177 (06) : 1333 - 1348
  • [4] Selection of views to materialize using simulated annealing algorithms
    Zhou, LJ
    Liu, C
    Wang, HF
    Liu, DX
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 29 - 33
  • [5] Selection of views to materialize under a maintenance cost constraint
    Gupta, H
    Mumick, IS
    [J]. DATABASE THEORY - ICDT'99, 1999, 1540 : 453 - 470
  • [6] An evolutionary approach to materialized views selection in a data warehouse environment
    Zhang, C
    Yao, X
    Yang, J
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2001, 31 (03): : 282 - 294
  • [7] Selecting materialized views in a data warehouse
    Zhou, LJ
    Liu, C
    Liu, DX
    [J]. STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2003, 2003, 5021 : 456 - 461
  • [8] Algorithms for Selecting Materialized Views in a Data Warehouse
    Yousri, Noha A. R.
    Ahmed, Khalil M.
    El-Makky, Nagwa M.
    [J]. 3RD ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, 2005, 2005,
  • [9] Research on Materialized Views Technology in Data Warehouse
    Zhou, Lijuan
    Xu, Min
    Shi, Qian
    Hao, Zhongxiao
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING WORKSHOP PROCEEDINGS, VOLS 1 AND 2, 2008, : 1030 - +
  • [10] Materialized View Selection in the Data Warehouse
    Zhou Lijuan
    Geng Haijun
    Xu Mingsheng
    [J]. APPLIED MECHANICS AND MECHANICAL ENGINEERING, PTS 1-3, 2010, 29-32 : 1133 - 1138