A "Learned" Approach to Quicken and Compress Rank/Select Dictionaries

被引:0
|
作者
Boffa, Antonio [1 ]
Ferragina, Paolo [1 ]
Vinciguerra, Giorgio [1 ]
机构
[1] Univ Pisa, Dipartimento Informat, Pisa, Italy
关键词
RANK;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We address the well-known problem of designing, implementing and experimenting compressed data structures for supporting rank and select queries over a dictionary of integers. This problem has been studied far and wide since the end of the `80s with tons of important theoretical and practical results. Following a recent line of research on the so-called learned data structures, we first show that this problem has a surprising connection with the geometry of a set of points in the Cartesian plane suitably derived from the input integers. We then build upon some classical results in computational geometry to introduce the first "learned" scheme for implementing a compressed rank/select dictionary. We prove theoretical bounds on its time and space performance both in the worst case and in the case of input distributions with finite mean and variance. We corroborate these theoretical results with a large set of experiments over datasets originating from a variety of sources and applications (Web, DNA sequencing, information retrieval and natural language processing), and we show that a carefully engineered version of our approach provides new interesting space-time trade-offs with respect to several well-established implementations of Elias-Fano, RRRvector, and random-access vectors of Elias gamma /delta-coded gaps.
引用
收藏
页码:46 / 59
页数:14
相关论文
共 50 条
  • [21] SPARSE REPRESENTATION OVER LEARNED AND DISCRIMINATIVELY LEARNED DICTIONARIES FOR SPEAKER VERIFICATION
    Haris, B. C.
    Sinha, Rohit
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4785 - 4788
  • [22] Compressed Sensing via Collaboratively Learned Dictionaries
    Guo, Kai
    Liang, Xijun
    Lu, Weizhi
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2021), 2021, : 23 - 28
  • [23] To Compress or not to Compress? A Finite-State Approach to Nen Verbal Morphology
    Muradoglu, Saliha
    Evans, Nicholas
    Suominen, Hanna
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 207 - 213
  • [24] SPEECH ENHANCEMENT WITH SPARSE CODING IN LEARNED DICTIONARIES
    Sigg, Christian D.
    Dikk, Tomas
    Buhmann, Joachim M.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4758 - 4761
  • [25] Multiscale sparse image representation with learned dictionaries
    Mairal, Julien
    Sapiro, Guillermo
    Elad, Michael
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 1233 - +
  • [26] Discriminative learned dictionaries for local image analysis
    Mairal, Julien
    Bach, Francis
    Ponce, Jean
    Sapiro, Guillermo
    Zisserman, Andrew
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 2415 - 2422
  • [27] SPARSE STEREO IMAGE CODING WITH LEARNED DICTIONARIES
    Palaz, Dimitri
    Tosic, Ivana
    Frossard, Pascal
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 133 - 136
  • [28] Sparse Denoising with Learned Composite Structured Dictionaries
    Irofti, Paul
    2015 19TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2015, : 331 - 336
  • [29] Image Denoising using Locally Learned Dictionaries
    Chatterjee, Priyam
    Milanfar, Peyman
    COMPUTATIONAL IMAGING VII, 2009, 7246
  • [30] Rank and Select for Succinct Data Structures
    Farina, Antonio
    Ladra, Susana
    Pedreira, Oscar
    Places, Angeles S.
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2009, 236 (0C) : 131 - 145