COMBAT: A New Bitmap Index Coding Algorithm for Big Data

被引:1
|
作者
Yinjun Wu [1 ]
Zhen Chen [2 ]
Yuhao Wen [3 ]
Wenxun Zheng [1 ]
Junwei Cao [4 ]
机构
[1] Department of Automation and Tsinghua National Laboratory for Information Science and Technology (TNList), Tsinghua University
[2] iCenter of Tsinghua University
[3] Department of Computer Science, Duke University
[4] Research Institute of Information Technology and Tsinghua National Laboratory for Information Science and Technology (TNList), Tsinghua University
基金
中国国家自然科学基金;
关键词
bitmap index; big data; COMBAT; CONCISE; COMPAX; index encoding; performance evaluation;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Bitmap indexing has been widely used in various applications due to its speed in bitwise operations.However,it can consume large amounts of memory.To solve this problem,various bitmap coding algorithms have been proposed.In this paper,we present COMbining Binary And Ternary encoding(COMBAT),a new bitmap index coding algorithm.Typical algorithms derived from Word Aligned Hybrid(WAH)are COMPressed Adaptive inde X(COMPAX)and Compressed"n"Composable Integer Set(CONCISE),which can combine either two or three continuous words after WAH encoding.COMBAT combines both mechanisms and results in more compact bitmap indexes.Moreover,querying time of COMBAT can be faster than that of COMPAX and CONCISE,since bitmap indexes are smaller and it would take less time to load them into memory.To prove the advantages of COMBAT,we extend a theoretical analysis model proposed by our group,which is composed of the analysis of various possible bitmap indexes.Some experimental results based on real data are also provided,which show COMBAT’s storage and speed superiority.Our results demonstrate the advantages of COMBAT and codeword statistics are provided to solidify the proof.
引用
收藏
页码:136 / 145
页数:10
相关论文
共 50 条
  • [1] COMBAT: A New Bitmap Index Coding Algorithm for Big Data
    Wu, Yinjun
    Chen, Zhen
    Wen, Yuhao
    Zheng, Wenxun
    Cao, Junwei
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2016, 21 (02) : 136 - 145
  • [2] A Survey of Bitmap Index Compression Algorithms for Big Data
    Chen, Zhen
    Wen, Yuhao
    Cao, Junwei
    Zheng, Wenxun
    Chang, Jiahui
    Wu, Yinjun
    Ma, Ge
    Hakmaoui, Mourad
    Peng, Guodong
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2015, 20 (01) : 100 - 115
  • [3] A Survey of Bitmap Index Compression Algorithms for Big Data
    Zhen Chen
    Yuhao Wen
    Junwei Cao
    Wenxun Zheng
    Jiahui Chang
    Yinjun Wu
    Ge Ma
    Mourad Hakmaoui
    Guodong Peng
    [J]. Tsinghua Science and Technology, 2015, 20 (01) : 100 - 115
  • [4] BAH: A Bitmap Index Compression Algorithm for Fast Data Retrieval
    Li, Chenxing
    Chen, Zhen
    Zheng, Wenxun
    Wu, Yinjun
    Cao, Junwei
    [J]. 2016 IEEE 41ST CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2016, : 697 - 705
  • [5] MASC: A Bitmap Index Encoding Algorithm for Fast Data Retrieval
    Wen, Yuhao
    Wang, Han
    Chen, Zhen
    Cao, Junwei
    Peng, Guodong
    Huang, Wen-Liang
    Hu, Ziwei
    Zhou, Jing
    Guo, Jinghong
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [6] A new bitmap index and a new data cube compression technology
    Xi, Jianqing
    Chen, Fuqiang
    Zhang, Pingjian
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 2, PROCEEDINGS, 2008, 5073 : 1218 - 1228
  • [7] BreadZip: a Combination of Network Traffic Data and Bitmap Index Encoding Algorithm
    Ma, Ge
    Guo, Zhenhua
    Li, Xiu
    Chen, Zhen
    Cao, Junwei
    Jiang, Yixin
    Guo, Xiaobin
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3235 - 3240
  • [8] SECOMPAX: A bitmap index compression algorithm
    Wen, Yuhao
    Chen, Zhen
    Ma, Ge
    Cao, Junwei
    Zheng, Wenxun
    Peng, Guodong
    Li, Shiwei
    Huang, Wen-Liang
    [J]. 2014 23RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN), 2014,
  • [9] CAMP: A New Bitmap Index for Data Retrieval in Traffic Archivaln
    Wu, Yinjun
    Chen, Zhen
    Cao, Junwei
    Li, Haoxun
    Li, Chenxing
    Wang, Yijie
    Zheng, Wenxun
    Chang, Jiahui
    Zhou, Jing
    Hu, Ziwei
    Guo, Jinghong
    [J]. IEEE COMMUNICATIONS LETTERS, 2016, 20 (06) : 1128 - 1131
  • [10] A bitmap index for multidimensional data cubes
    Lim, Y
    Kim, M
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, 3180 : 349 - 358