Specification-based data reduction in dimensional data warehouses

被引:10
|
作者
Skyt, Janne [1 ]
Jensen, Christian S. [1 ]
Pedersen, Torben Bach [1 ]
机构
[1] Univ Aalborg, Dept Comp Sci, DK-9200 Aalborg, Denmark
关键词
data reduction; data warehousing; multidimensional data; data models; physical deletion;
D O I
10.1016/j.is.2007.06.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many data warehouses contain massive amounts of data, accumulated over long periods of time. In some cases, it is necessary or desirable to either delete "old" data or to maintain the data at an aggregate level. This may be due to privacy concerns, in which case the data are aggregated to levels that ensure anonymity. Another reason is the desire to maintain a balance between the uses of data that change as the data age and the size of the data, thus avoiding overly large data warehouses. This paper presents effective techniques for data reduction that enable the gradual aggregation of detailed data as the data ages. With these techniques, data may be aggregated to higher levels as they age, enabling the maintenance of more compact, consolidated data and the compliance with privacy requirements. Special care is taken to avoid semantic problems in the aggregation process. The paper also describes the querying of the resulting data warehouses and an implementation strategy based on current database technology. (C) 2007 Elsevier BN. All rights reserved.
引用
收藏
页码:36 / 63
页数:28
相关论文
共 50 条
  • [41] SPECIFICATION-BASED SOFTWARE ENGINEERING WITH TAGS
    SIEVERT, GE
    MIZELL, TA
    COMPUTER, 1985, 18 (04) : 56 - 65
  • [42] Specification-based testing of user interfaces
    Paiva, ACR
    Faria, JCP
    Vidal, RFAM
    INTERACTIVE SYSTEMS: DESIGN, SPECIFICATION, AND VERIFICATION, 2003, 2844 : 139 - 153
  • [43] Efficient specification-based component retrieval
    Penix J.
    Alexander P.
    Automated Software Engineering, 1999, 6 (2) : 139 - 170
  • [44] Agent based dynamic data storage and distribution in data warehouses
    Kolsi, Nader
    Abdellatif, Abdelaziz
    Ghedira, Khaled
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2007, 4496 : 375 - +
  • [45] Flash Memory SSD based Data Management for Data Warehouses and Data Marts
    Rizvi, Sanam Shahla
    Chung, Tae-Sun
    ICCIT: 2009 FOURTH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 858 - 860
  • [46] A Framework for Formal Specification Considering Review and Specification-Based Testing
    Nakatsugawa, Yasumasa
    Kurita, Taro
    Araki, Keijiro
    TENCON 2010: 2010 IEEE REGION 10 CONFERENCE, 2010, : 2444 - 2448
  • [47] Specification-Based Program Repair Using SAT
    Gopinath, Divya
    Malik, Muhammad Zubair
    Khurshid, Sarfraz
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, 2011, 6605 : 173 - 188
  • [48] Specification-based program slicing and its applications
    Lee, WK
    Chung, IS
    Yoon, GS
    Kwon, YR
    JOURNAL OF SYSTEMS ARCHITECTURE, 2001, 47 (05) : 427 - 443
  • [49] Specification-based Testing for Software Product Lines
    Kahsai, Temesghen
    Roggenbach, Markus
    Schlingloff, Bernd-Holger
    SEFM 2008: SIXTH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND FORMAL METHODS, PROCEEDINGS, 2008, : 149 - +
  • [50] Specification-Based Autonomous Driving System Testing
    Zhou, Yuan
    Sun, Yang
    Tang, Yun
    Chen, Yuqi
    Sun, Jun
    Poskitt, Christopher M. M.
    Liu, Yang
    Yang, Zijiang
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2023, 49 (06) : 3391 - 3410