A "Learned" Approach to Quicken and Compress Rank/Select Dictionaries

被引:0
|
作者
Boffa, Antonio [1 ]
Ferragina, Paolo [1 ]
Vinciguerra, Giorgio [1 ]
机构
[1] Univ Pisa, Dipartimento Informat, Pisa, Italy
关键词
RANK;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We address the well-known problem of designing, implementing and experimenting compressed data structures for supporting rank and select queries over a dictionary of integers. This problem has been studied far and wide since the end of the `80s with tons of important theoretical and practical results. Following a recent line of research on the so-called learned data structures, we first show that this problem has a surprising connection with the geometry of a set of points in the Cartesian plane suitably derived from the input integers. We then build upon some classical results in computational geometry to introduce the first "learned" scheme for implementing a compressed rank/select dictionary. We prove theoretical bounds on its time and space performance both in the worst case and in the case of input distributions with finite mean and variance. We corroborate these theoretical results with a large set of experiments over datasets originating from a variety of sources and applications (Web, DNA sequencing, information retrieval and natural language processing), and we show that a carefully engineered version of our approach provides new interesting space-time trade-offs with respect to several well-established implementations of Elias-Fano, RRRvector, and random-access vectors of Elias gamma /delta-coded gaps.
引用
收藏
页码:46 / 59
页数:14
相关论文
共 50 条
  • [31] Broadword implementation of rank/select queries
    Vigna, Sebastiano
    EXPERIMENTAL ALGORITHMS, PROCEEDINGS, 2008, 5038 : 154 - 168
  • [32] Regression trees for survival data - An approach to select discontinuous split points by rank statistics
    Schlittgen, R
    BIOMETRICAL JOURNAL, 1999, 41 (08) : 943 - 954
  • [33] HYPERSPECTRAL IMAGE REPRESENTATION USING LEARNED MULTISCALE DICTIONARIES
    Wu, Qian
    Zhang, Rong
    Xu, Dawei
    2014 6TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2014,
  • [34] EFFICIENT LEARNING OF DICTIONARIES WITH LOW-RANK ATOMS
    Ravishankar, Saiprasad
    Moore, Brian E.
    Nadakuditi, Raj Rao
    Fessler, Jeffrey A.
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 222 - 226
  • [35] A novel approach to compress image set
    Shie, Shih-Chieh
    Lin, Shinfeng D.
    WSEAS Transactions on Computers, 2005, 4 (02): : 263 - 266
  • [36] Learned dictionaries for sparse image representation: Properties and results
    Skretting, Karl
    Engan, Kjersti
    WAVELETS AND SPARSITY XIV, 2011, 8138
  • [37] Sparse image coding using learned overcomplete dictionaries
    Murray, JF
    Kreutz-Delgado, K
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 579 - 588
  • [38] IMAGE COMPRESSION VIA MULTIPLE LEARNED GEOMETRIC DICTIONARIES
    Huang, Danlan
    Tao, Xiaoming
    Xu, Mai
    Gao, Shenghua
    Lu, Jianhua
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 1373 - 1377
  • [39] Sparse Representations for Pattern Classification using Learned Dictionaries
    Thiagarajan, Jayaraman J.
    Ramamurthy, Karthikeyan N.
    Spanias, Andreas
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXV, 2009, : 33 - 45
  • [40] Clustering-Based Denoising With Locally Learned Dictionaries
    Chatterjee, Priyam
    Milanfar, Peyman
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (07) : 1438 - 1451