Alphabet Partitioning for Compressed Rank/Select and Applications

被引:0
|
作者
Barbay, Jeremy [1 ]
Gagie, Travis [1 ]
Navarro, Gonzalo [1 ]
Nekrich, Yakov [2 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
[2] Univ Bonn, Dept Comp Sci, Bonn, Germany
来源
关键词
SEQUENCES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a data structure that stores a string s[1..n] over the alphabet [1..sigma] in nH(0)(s) o(n)(H-0(s)+1) bits, where H-0(s) is the zero-order entropy of s. This data structure supports the queries access and rank in time O (lg lg sigma), and the select query in constant time. This result improves on previously known data structures using nH(0)(s) + o(n lg sigma) bits, where on highly compressible instances the redundancy o(n lg sigma) cease to be negligible compared to the nH(0)(s) bits that encode the data. The technique is based on combining previous results through an ingenious partitioning of the alphabet, and practical enough to be implementable. It applies not only to strings, but also to several other compact data structures. For example, we achieve (i) faster search times and lower redundancy for the smallest existing full-text self-index; (ii) compressed permutations pi with times for pi() and pi(-1) improved to log-logarithmic; and (iii) the first compressed representation of dynamic collections of disjoint sets.
引用
收藏
页码:315 / +
页数:2
相关论文
共 50 条
  • [21] Did Evolution Select a Nonrandom "Alphabet" of Amino Acids?
    Philip, Gayle K.
    Freeland, Stephen J.
    ASTROBIOLOGY, 2011, 11 (03) : 235 - 240
  • [22] Compressed Encoding for Rank Modulation
    Gad, Eyal En
    Jiang, Anxiao
    Bruck, Jehoshua
    2011 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2011, : 884 - 888
  • [23] Processing of Huffman compressed texts with a super-alphabet
    Fredriksson, K
    Tarhio, J
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2003, 2857 : 108 - 121
  • [24] Rank and Select for Succinct Data Structures
    Farina, Antonio
    Ladra, Susana
    Pedreira, Oscar
    Places, Angeles S.
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2009, 236 (0C) : 131 - 145
  • [25] Broadword implementation of rank/select queries
    Vigna, Sebastiano
    EXPERIMENTAL ALGORITHMS, PROCEEDINGS, 2008, 5038 : 154 - 168
  • [26] Rank and select: Another lesson learned
    Grabowski, Szymon
    Raniszewski, Marcin
    INFORMATION SYSTEMS, 2018, 73 : 25 - 34
  • [27] Compressed Sensing with Rank Deficient Dictionaries
    Hansen, T. L.
    Johansen, D. H.
    Jorgensen, P. B.
    Trillingsgaard, K. F.
    Arildsen, T.
    Fyhn, K.
    Larsen, T.
    2012 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2012, : 3594 - 3599
  • [28] Weighted Compressed Sensing and Rank Minimization
    Oymak, Samet
    Khajehnejad, M. Amin
    Hassibi, Babak
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3736 - 3739
  • [29] Alphabet partitioning techniques for semiadaptive Huffman coding of large alphabets
    Chen, Dan
    Chiang, Yi-Jen
    Memon, Nasir
    Wu, Xiaolin
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2007, 55 (03) : 436 - 443
  • [30] Optimal lower bounds for rank and select indexes
    Golynski, Alexander
    THEORETICAL COMPUTER SCIENCE, 2007, 387 (03) : 348 - 359