Specification-based data reduction in dimensional data warehouses

被引:10
|
作者
Skyt, Janne [1 ]
Jensen, Christian S. [1 ]
Pedersen, Torben Bach [1 ]
机构
[1] Univ Aalborg, Dept Comp Sci, DK-9200 Aalborg, Denmark
关键词
data reduction; data warehousing; multidimensional data; data models; physical deletion;
D O I
10.1016/j.is.2007.06.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many data warehouses contain massive amounts of data, accumulated over long periods of time. In some cases, it is necessary or desirable to either delete "old" data or to maintain the data at an aggregate level. This may be due to privacy concerns, in which case the data are aggregated to levels that ensure anonymity. Another reason is the desire to maintain a balance between the uses of data that change as the data age and the size of the data, thus avoiding overly large data warehouses. This paper presents effective techniques for data reduction that enable the gradual aggregation of detailed data as the data ages. With these techniques, data may be aggregated to higher levels as they age, enabling the maintenance of more compact, consolidated data and the compliance with privacy requirements. Special care is taken to avoid semantic problems in the aggregation process. The paper also describes the querying of the resulting data warehouses and an implementation strategy based on current database technology. (C) 2007 Elsevier BN. All rights reserved.
引用
收藏
页码:36 / 63
页数:28
相关论文
共 50 条
  • [11] Generating test data for specification-based tests via quasirandom sequences
    Chi, Hongmei
    Jones, Edward L.
    Evans, Deidre W.
    Brown, Martin
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 4, PROCEEDINGS, 2006, 3994 : 773 - 780
  • [12] An architecture for specification-based detection of semantic integrity violations in kernel dynamic data
    Petroni, Nick L., Jr.
    Fraser, Timothy
    Walters, Aaron
    Arbaugh, William A.
    [J]. USENIX ASSOCIATION PROCEEDINGS OF THE 15TH USENIX SECURITY SYMPOSIUM, 2006, : 289 - 304
  • [13] Requirements specification and conceptual modeling for spatial data warehouses
    Malinowski, E.
    Zimanyi, E.
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2006: OTM 2006 WORKSHOPS, PT 2, PROCEEDINGS, 2006, 4278 : 1616 - +
  • [14] System specification-based design of cloud data centre and DEVS simulation for availability evaluation
    Kim, Ji-Yeon
    Kim, Hyung-Jong
    [J]. Journal of Research and Practice in Information Technology, 2014, 46 (2-3): : 63 - 75
  • [15] APPROACHES TO SPECIFICATION-BASED TESTING
    RICHARDSON, DJ
    OMALLEY, O
    TITTLE, C
    [J]. PROCEEDINGS OF THE ACM SIGSOFT 89: THIRD SYMPOSIUM ON SOFTWARE TESTING, ANALYSIS, AND VERIFICATION ( TAV 3 ), 1989, 14 : 86 - 96
  • [16] Specification-based Protocol Obfuscation
    Duchene, Julien
    Alata, Eric
    Nicomette, Vincent
    Kaaniche, Mohamed
    Le Guernic, Colas
    [J]. 2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2018, : 478 - 489
  • [17] A framework for specification-based testing
    Stocks, P
    Carrington, D
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (11) : 777 - 793
  • [18] Specification-based testing for refinement
    Kahsai, Temesghen
    Roggenbach, Markus
    Schlingloff, Bernd-Holger
    [J]. SEFM 2007: FIFTH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND FORMAL METHODS, PROCEEDINGS, 2007, : 237 - +
  • [19] An Empirical Evaluation of Test Suite Reduction for Boolean Specification-based Testing
    Zhang, Xiaofang
    Xu, Baowen
    Chen, Zhenyu
    Nie, Changhai
    Li, Leifang
    [J]. QSIC 2008: PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, 2008, : 270 - 275
  • [20] Model Driven Dimensional Modeling of Data Warehouses
    Liu, J.
    Liu, S. Z.
    [J]. ITESS: 2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES, PT 2, 2008, : 977 - 982