Sparse prefix sums: Constant-time range sum queries over sparse multidimensional data cubes

被引:1
|
作者
Shekelyan, Michael [1 ]
Dignos, Anton [1 ]
Gamper, Johann [1 ]
机构
[1] Free Univ Bozen Bolzano, Piazza Domenicani 3, I-39100 Bolzano, Italy
关键词
Query processing - Grid computing - Digital storage - Table lookup;
D O I
10.1016/j.is.2018.06.009
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Prefix sums are a powerful technique to answer range-sum queries over multi-dimensional arrays in O(1) time by looking up a constant number of values in an array of size O(N), where N is the number of cells in the multi-dimensional array. However, the technique suffers from O(N) update and storage costs. Relative prefix sums address the high update costs by partitioning the array into blocks, thereby breaking the dependency between cells. In this paper, we present sparse prefix sums that exploit data sparsity to reduce the high storage costs of relative prefix sums. By building upon relative prefix sums, sparse prefix sums achieve the same update complexity as relative prefix sums. The authors of relative prefix sums erroneously claimed that the update complexity is O(root N) for any number of dimensions. We show that this claim holds only for two dimensions, whereas the correct complexity for an arbitrary number of d dimensions is O(Nd-1/d). To reduce the storage costs, the sparse prefix sums technique exploits sparsity in the data and avoids to materialize prefix sums for empty rows and columns in the data grid; instead, look-up tables are used to preserve constant query time. Sparse prefix sums are the first approach to achieve O(1) query time with sub-linear storage costs for range-sum queries over sparse low-dimensional arrays. A thorough experimental evaluation shows that the approach works very well in practice, On the tested real-world data sets the storage costs are reduced by an order of magnitude with only a small overhead in query time, thus preserving microsecond-fast query answering. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:136 / 147
页数:12
相关论文
共 20 条
  • [1] Range top/bottom k queries in OLAP sparse data cubes
    Luo, ZW
    Ling, TW
    Ang, CH
    Lee, SY
    Cui, B
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, 2001, 2113 : 678 - 687
  • [2] Range sum queries in dynamic OLAP data cubes
    Li, HG
    Ling, TW
    Lee, SY
    Loh, ZX
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL SYMPOSIUM ON COOPERATIVE DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2000, : 74 - 81
  • [3] Approximate range-sum queries over data cubes using cosine transform
    Hou, Wen-Chi
    Luo, Cheng
    Jiang, Zhewei
    Yan, Feng
    Zhu, Qiang
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, 5181 : 376 - +
  • [4] Constant-Time Approximation Algorithms for the Optimum Branching Problem on Sparse Graphs
    Kusumoto, Mitsuru
    Yoshida, Yuichi
    Ito, Hiro
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON NETWORKING AND COMPUTING (ICNC 2012), 2012, : 407 - 413
  • [5] Efficient Range-Sum Queries along Dimensional Hierarchies in Data Cubes
    Lauer, Tobias
    Mai, Dominic
    Hagedorn, Philippe
    [J]. 2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN DATABASES, KNOWLEDGE, AND DATA APPLICATIONS, 2009, : 7 - +
  • [6] Secure multidimensional range queries over outsourced data
    Hore, Bijit
    Mehrotra, Sharad
    Canim, Mustafa
    Kantarcioglu, Murat
    [J]. VLDB JOURNAL, 2012, 21 (03): : 333 - 358
  • [7] Secure multidimensional range queries over outsourced data
    Bijit Hore
    Sharad Mehrotra
    Mustafa Canim
    Murat Kantarcioglu
    [J]. The VLDB Journal, 2012, 21 : 333 - 358
  • [8] Processing range top-k queries in a sparse data cube
    Hong, S
    Moon, B
    Lee, S
    [J]. IKE '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGNINEERING, 2004, : 282 - 287
  • [9] A Fast Algorithm for Range Sum Queries over Data Stream
    Wang Kaojie
    Zheng Xuefeng
    Xu Hong
    [J]. 2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL III, 2009, : 86 - +
  • [10] What Graph Properties Are Constant-Time Testable?Dense Graphs, Sparse Graphs, and Complex Networks
    Hiro Ito
    [J]. The Review of Socionetwork Strategies, 2019, 13 : 101 - 121