Alphabet Partitioning for Compressed Rank/Select and Applications

被引:0
|
作者
Barbay, Jeremy [1 ]
Gagie, Travis [1 ]
Navarro, Gonzalo [1 ]
Nekrich, Yakov [2 ]
机构
[1] Univ Chile, Dept Comp Sci, Santiago, Chile
[2] Univ Bonn, Dept Comp Sci, Bonn, Germany
来源
关键词
SEQUENCES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a data structure that stores a string s[1..n] over the alphabet [1..sigma] in nH(0)(s) o(n)(H-0(s)+1) bits, where H-0(s) is the zero-order entropy of s. This data structure supports the queries access and rank in time O (lg lg sigma), and the select query in constant time. This result improves on previously known data structures using nH(0)(s) + o(n lg sigma) bits, where on highly compressible instances the redundancy o(n lg sigma) cease to be negligible compared to the nH(0)(s) bits that encode the data. The technique is based on combining previous results through an ingenious partitioning of the alphabet, and practical enough to be implementable. It applies not only to strings, but also to several other compact data structures. For example, we achieve (i) faster search times and lower redundancy for the smallest existing full-text self-index; (ii) compressed permutations pi with times for pi() and pi(-1) improved to log-logarithmic; and (iii) the first compressed representation of dynamic collections of disjoint sets.
引用
收藏
页码:315 / +
页数:2
相关论文
共 50 条
  • [1] Rank/select on dynamic compressed sequences and applications
    Gonzalez, Rodrigo
    Navarro, Gonzalo
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (43) : 4414 - 4422
  • [2] Grammar compressed sequences with rank/select support
    Ordóñez A.
    Navarro G.
    Brisaboa N.R.
    Navarro, Gonzalo (gnavarro@dcc.uchile.cl), 2017, Elsevier B.V., Netherlands (43) : 54 - 71
  • [3] Grammar Compressed Sequences with Rank/Select Support
    Navarro, Gonzalo
    Ordonez, Alberto
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 31 - 44
  • [4] Run Compressed Rank/Select for Large Alphabets
    Fuentes-Sepulveda, Jose
    Karkkainen, Juha
    Kosolobov, Dmitry
    Puglisi, Simon J.
    2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, : 315 - 324
  • [5] Access, Rank, and Select in Grammar-compressed Strings
    Belazzougui, Djamal
    Cording, Patrick Hagge
    Puglisi, Simon J.
    Tabei, Yasuo
    ALGORITHMS - ESA 2015, 2015, 9294 : 142 - 154
  • [6] Practical Entropy-Compressed Rank/Select Dictionary
    Okanohara, Daisuke
    Sadakane, Kunihiko
    PROCEEDINGS OF THE NINTH WORKSHOP ON ALGORITHM ENGINEERING AND EXPERIMENTS AND THE FOURTH WORKSHOP ON ANALYTIC ALGORITHMICS AND COMBINATORICS, 2007, : 60 - +
  • [7] A Learned Approach to Design Compressed Rank/Select Data Structures
    Boffa, Antonio
    Ferragina, Paolo
    Vinciguerra, Giorgio
    ACM TRANSACTIONS ON ALGORITHMS, 2022, 18 (03)
  • [8] A Hybrid Compressed Data Structure Supporting Rank and Select on Bit Sequences
    Arroyuelo, Diego
    Weitzman, Manuel
    2020 39TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2020,
  • [9] High-Order Entropy Compressed Bit Vectors with Rank/Select
    Beskers, Kai
    Fischer, Johannes
    ALGORITHMS, 2014, 7 (04) : 608 - 620
  • [10] ALPHABET AS A RANK ORDER
    SCHURIAN, W
    HEINRICH, P
    PSYCHOLOGIE IN ERZIEHUNG UND UNTERRICHT, 1974, 21 (02): : 100 - 104