Answering ad hoc aggregate queries from data streams using prefix aggregate trees

被引:0
|
作者
Moonjung Cho
Jian Pei
Ke Wang
机构
[1] State University of New York at Buffalo,Department of Computer Science and Engineering
[2] Simon Fraser University,School of Computing Science
[3] 8888 University Drive,undefined
来源
关键词
Data warehousing; Data cube; Data stream; Online analytic processing (OLAP); Aggregate query;
D O I
暂无
中图分类号
学科分类号
摘要
In some business applications such as trading management in financial institutions, it is required to accurately answer ad hoc aggregate queries over data streams. Materializing and incrementally maintaining a full data cube or even its compression or approximation over a data stream is often computationally prohibitive. On the other hand, although previous studies proposed approximate methods for continuous aggregate queries, they cannot provide accurate answers. In this paper, we develop a novel prefix aggregate tree (PAT) structure for online warehousing data streams and answering ad hoc aggregate queries. Often, a data stream can be partitioned into the historical segment, which is stored in a traditional data warehouse, and the transient segment, which can be stored in a PAT to answer ad hoc aggregate queries. The size of a PAT is linear in the size of the transient segment, and only one scan of the data stream is needed to create and incrementally maintain a PAT. Although the query answering using PAT costs more than the case of a fully materialized data cube, the query answering time is still kept linear in the size of the transient segment. Our extensive experimental results on both synthetic and real data sets illustrate the efficiency and the scalability of our design.
引用
收藏
页码:301 / 329
页数:28
相关论文
共 50 条
  • [1] Answering ad hoc aggregate queries from data streams using prefix aggregate trees
    Cho, Moonjung
    Pei, Jian
    Wang, Ke
    KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (03) : 301 - 329
  • [2] Answering ad-hoc continuous aggregate queries over data streams using Dynamic Prefix Aggregate Tree
    Safaei, Ali Asghar
    Mosaferi, Mehdi
    Abdi, Fatemeh
    INTELLIGENT DATA ANALYSIS, 2016, 20 (06) : 1351 - 1384
  • [3] Answering Range-Aggregate Queries over Objects Generating Data Streams
    Gorawski, Marcin
    Malczok, Rafal
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 436 - 439
  • [4] A Neural Database for Answering Aggregate Queries on Incomplete Relational Data
    Zeighami, Sepanta
    Seshadri, Raghav
    Shahabi, Cyrus
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 2790 - 2802
  • [5] Capturing Continuous Data and Answering Aggregate Queries in Probabilistic XML
    Abiteboul, Serge
    Chan, T. -H. Hubert
    Kharlamov, Evgeny
    Nutt, Werner
    Senellart, Pierre
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2011, 36 (04):
  • [6] μWheel: Aggregate Management for Streams and Queries
    Meldrurn, Max
    Carbone, Paris
    PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON DISTRIBUTED AND EVENT-BASED SYSTEMS, DEBS 2024, 2024, : 54 - 65
  • [7] Answering joint queries from multiple aggregate OLAP databases
    Pourabbas, E
    Shoshani, A
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2003, 2737 : 24 - 34
  • [8] Supporting aggregate queries over ad-hoc wireless sensor networks
    Madden, S
    Szewczyk, R
    Franklin, MJ
    Culler, D
    FOURTH IEEE WORKSHOP ON MOBILE COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2002, : 49 - 58
  • [9] Estimating aggregate join queries over data streams using discrete cosine transform
    Jiang, Zhewei
    Luo, Cheng
    Hou, Wen-Chi
    Yan, Feng
    Zhu, Qiang
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, 4080 : 182 - 192
  • [10] Answering approximate range aggregate queries on OLAP data cubes with probabilistic guarantees
    Cuzzocrea, A
    Wang, W
    Matrangolo, U
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 97 - 107