COMBAT: A New Bitmap Index Coding Algorithm for Big Data

被引:1
|
作者
Yinjun Wu [1 ]
Zhen Chen [2 ]
Yuhao Wen [3 ]
Wenxun Zheng [1 ]
Junwei Cao [4 ]
机构
[1] Department of Automation and Tsinghua National Laboratory for Information Science and Technology (TNList), Tsinghua University
[2] iCenter of Tsinghua University
[3] Department of Computer Science, Duke University
[4] Research Institute of Information Technology and Tsinghua National Laboratory for Information Science and Technology (TNList), Tsinghua University
基金
中国国家自然科学基金;
关键词
bitmap index; big data; COMBAT; CONCISE; COMPAX; index encoding; performance evaluation;
D O I
暂无
中图分类号
TP391.41 [];
学科分类号
080203 ;
摘要
Bitmap indexing has been widely used in various applications due to its speed in bitwise operations.However,it can consume large amounts of memory.To solve this problem,various bitmap coding algorithms have been proposed.In this paper,we present COMbining Binary And Ternary encoding(COMBAT),a new bitmap index coding algorithm.Typical algorithms derived from Word Aligned Hybrid(WAH)are COMPressed Adaptive inde X(COMPAX)and Compressed"n"Composable Integer Set(CONCISE),which can combine either two or three continuous words after WAH encoding.COMBAT combines both mechanisms and results in more compact bitmap indexes.Moreover,querying time of COMBAT can be faster than that of COMPAX and CONCISE,since bitmap indexes are smaller and it would take less time to load them into memory.To prove the advantages of COMBAT,we extend a theoretical analysis model proposed by our group,which is composed of the analysis of various possible bitmap indexes.Some experimental results based on real data are also provided,which show COMBAT’s storage and speed superiority.Our results demonstrate the advantages of COMBAT and codeword statistics are provided to solidify the proof.
引用
收藏
页码:136 / 145
页数:10
相关论文
共 50 条
  • [41] HVSM: A new sequential pattern mining algorithm using bitmap representation
    Song, S
    Hu, HP
    Jin, SY
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 455 - 463
  • [42] Noiseless coding of VQ index using index grouping algorithm
    Hsieh, CH
    Tsai, JC
    Lu, PC
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 1996, 44 (12) : 1643 - 1648
  • [43] A New Rockburst Experiment Data Compression Storage Algorithm Based on Big Data Technology
    Zhang, Yu
    Wang, Yan-Ge
    Bai, Yan-Ping
    Li, Yong-Zhen
    Lv, Zhao-Yong
    Ding, Hong-Wei
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2019, 25 (03): : 561 - 572
  • [44] Effective Clustering Analysis Based on New Designed Clustering Validity Index and Revised K-means Algorithm for Big Data
    Zhu, Erzhou
    Wen, Peng
    Zhu, Binbin
    Liu, Feng
    Wang, Futian
    Li, Xuejun
    [J]. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 96 - 102
  • [45] A New Particle Swarm Optimization Algorithm for Optimizing Big Data Clustering
    Hashemi S.E.
    Tavana M.
    Bakhshi M.
    [J]. SN Computer Science, 3 (4)
  • [46] New mixed adaptive detection algorithm for moving target with big data
    Zhang, De-Gan
    Zhou, Shan
    Chen, Jie
    Liu, Si
    [J]. JOURNAL OF VIBROENGINEERING, 2016, 18 (07) : 4705 - 4719
  • [47] β Algorithm: A New Probabilistic Process Learning Approach for Big Data in Healthcare
    Zayoud, Maha
    Kotb, Yehia
    Ionescu, Sorin
    [J]. IEEE ACCESS, 2019, 7 : 78842 - 78869
  • [48] A hybrid index for temporal big data
    Wang, Mei
    Xiao, Meng
    Peng, Sancheng
    Liu, Guohua
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 72 : 264 - 272
  • [49] Combat COVID-19 with artificial intelligence and big data
    Lin, Leesa
    Hou, Zhiyuan
    [J]. JOURNAL OF TRAVEL MEDICINE, 2020, 27 (05)
  • [50] An index coding algorithm for image vector quantization
    Chen, PY
    Chen, RD
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2003, 49 (04) : 1513 - 1520