Compressed Bit vectors Based on Variable-to-Fixed Encodings

被引:2
|
作者
Jo, Seungbum [1 ]
Joannou, Stelios [2 ]
Okanohara, Daisuke [3 ]
Raman, Rajeev [2 ]
Satti, Srinivasa Rao [1 ]
机构
[1] Seoul Natl Univ, Dept Comp Sci & Engn, 1 Gwanak Ro, Seoul, South Korea
[2] Univ Leicester, Dept Informat, Univ Rd, Gb Leicester LE1 7RH, England
[3] Preferred Infrastruct, Chiyoda Ku, 1-6-1 Otemachi, Tokyo, Japan
来源
COMPUTER JOURNAL | 2017年 / 60卷 / 05期
关键词
bitvector; rank and select; variable-to-fixed encoding; entropy; DICTIONARIES;
D O I
10.1093/comjnl/bxw103
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider practical implementations of compressed bitvectors, which support rank and select operations on a given bit-string, while storing the bit-string in compressed form. Our approach relies on variable-to-fixed encodings of the bit-string, an approach that has not yet been considered systematically for practical encodings of bitvectors. We show that this approach leads to fast practical implementations with low redundancy (i.e. the space used by the bitvector in addition to the compressed representation of the bit-string), and is a flexible and promising solution to the problem of supporting rank and select on moderately compressible bit-strings, such as those encountered in real-world applications.
引用
收藏
页码:761 / 775
页数:15
相关论文
共 50 条
  • [1] Compressed bit vectors based on variable-to-fixed encodings
    Jo, Seungbum
    Joannou, Stelios
    Okanohara, Daisuke
    Raman, Rajeev
    Satti, Srinivasa Rao
    2014 DATA COMPRESSION CONFERENCE (DCC 2014), 2014, : 409 - 409
  • [2] Improved Variable-to-Fixed Length Codes
    Klein, Shmuel T.
    Shapira, Dana
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2008, 5280 : 39 - +
  • [3] Code compression using variable-to-fixed coding based on arithmetic coding
    Xie, Y
    Wolf, W
    Lekatsas, H
    DCC 2003: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2003, : 382 - 391
  • [4] Variable-to-fixed length codes for predictable sources
    Savari, SA
    DCC '98 - DATA COMPRESSION CONFERENCE, 1998, : 481 - 490
  • [5] Universal variable-to-fixed length source codes
    Visweswariah, K
    Kulkarni, SR
    Verdú, S
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (04) : 1461 - 1472
  • [6] Variable-to-fixed length codes and the conservation of entropy
    Savari, SA
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1999, 45 (05) : 1612 - 1620
  • [7] A UNIVERSAL VARIABLE-TO-FIXED LENGTH SOURCE CODE BASED ON LAWRENCE ALGORITHM
    TJALKENS, TJ
    WILLEMS, FMJ
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) : 247 - 253
  • [8] A Study on the Overflow Probability of Variable-to-Fixed Length Codes
    Kuzuoka, Shigeaki
    PROCEEDINGS OF 2020 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA2020), 2020, : 26 - 30
  • [9] Variable-to-fixed length codes and plurally parsable dictionaries
    Savari, SA
    DCC '99 - DATA COMPRESSION CONFERENCE, PROCEEDINGS, 1999, : 453 - 462
  • [10] Selective Compression Technique Using Variable-to-Fixed Coding
    Jacob, Karen Thangam
    Kumar, K. S. Ganesh
    Manjurathi, B.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,