Optimizing Query Execution for Variable-Aligned Length Compression of Bitmap Indices

被引:3
|
作者
Slechta, Ryan [1 ]
Sawin, Jason [1 ]
McCamish, Ben [2 ]
Chiu, David [3 ]
Canahuate, Guadalupe [4 ]
机构
[1] Univ St Thomas, St Paul, MN 55105 USA
[2] Washington State Univ, Pullman, WA 99164 USA
[3] Univ Puget Sound, Tacoma, WA 98416 USA
[4] Univ Iowa, Iowa City, IA 52242 USA
关键词
bitmap indices; bitmap compression; query execution;
D O I
10.1145/2628194.2628252
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Indexing is a fundamental mechanism for efficient data access. Recently, we proposed the Variable-Aligned Length (VAL) bitmap index encoding framework, which generalizes the commonly used word-aligned compression techniques. VAL presented a variable-aligned compression framework, which allows columns of a bitmap to be compressed using different encoding lengths. This flexibility creates a tunable compression that balances the trade-off between space and query processing time. The variable format of VAL presents several unique opportunities for query optimization. In this paper we explore multiple algorithms to optimize both point queries and range queries in VAL. In particular, we propose a dynamic encoding-length translation heuristic to process point queries. For range queries, we propose several column orderings based on the bitmap's metadata: largest segment length first (lsf), column size (size), and weighted size (ws). In our empirical study over both real and synthetic data sets, we show that our dynamic translation selection scheme produces query execution times only 3:5% below the optimal. We also found that the weighted size column ordering significantly and consistently out-performs other ordering techniques. Finally, we show that algorithms scale to data sets that are row-ordered.
引用
收藏
页码:217 / 226
页数:10
相关论文
共 6 条
  • [1] Optimizing bitmap indices with efficient compression
    Wu, Kesheng
    Otoo, Ekow J.
    Shoshani, Arie
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2006, 31 (01): : 1 - 38
  • [2] Fault-Tolerant Query Execution over Distributed Bitmap Indices
    Burdick, Sam
    Risner, Jahrme
    Chiu, David
    Sawin, Jason
    [J]. 2018 IEEE/ACM 5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING APPLICATIONS AND TECHNOLOGIES (BDCAT), 2018, : 21 - 30
  • [3] Performance evaluation of word-aligned compression methods for bitmap indices
    Guzun, Gheorghi
    Canahuate, Guadalupe
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 48 (02) : 277 - 304
  • [4] Variable Length Compression of Codeword Indices for Lossy Compression
    Baron, Dror
    Jacob, Theju
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (12) : 849 - 852
  • [5] Very efficient variable-length codes for the lossless compression of VQ indices
    Chin-Chen Chang
    Tzu-Chuen Lu
    Gwoboa Horng
    Ying-Hsuan Huang
    [J]. Multimedia Tools and Applications, 2016, 75 : 3537 - 3552
  • [6] Very efficient variable-length codes for the lossless compression of VQ indices
    Chang, Chin-Chen
    Lu, Tzu-Chuen
    Homg, Gwoboa
    Huang, Ying-Hsuan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (06) : 3537 - 3552