On demand string sorting over unbounded alphabets

被引:3
|
作者
Kent, Carmel [2 ]
Lewenstein, Moshe [1 ]
Sheinwald, Dafna [2 ]
机构
[1] Bar Ilan Univ, Dept Comp Sci, Ramat Gan, Israel
[2] IBM Res Lab, Haifa, Israel
基金
以色列科学基金会;
关键词
String matching; Data structures; SUFFIX TREE CONSTRUCTION; LINEAR-TIME CONSTRUCTION; ARRAYS;
D O I
10.1016/j.tcs.2011.12.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
On-demand string sorting is the problem of preprocessing a set of strings to allow subsequent queries for finding the k lexicographically smallest strings (and afterward the next k etc.) This on-demand variant strongly resembles the search engine queries which give you the best k-ranked pages recurringly. We present a data structure that supports this in O(n) preprocessing time, where n is the number of strings, and answer queries in O(log n) time. There is also a cost of O(N) time amortized over all operations, where N is the total length of the strings. Our data structure is a heap of strings, which supports heapify and delete-mins. As it turns out, implementing a full heap with all operations is not that simple. For the sake of completeness, we propose a heap with full operations based on balanced indexing trees that supports the heap operations in optimal times. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 74
页数:9
相关论文
共 50 条
  • [21] Variable Automata over Infinite Alphabets
    Grumberg, Orna
    Kupferman, Orna
    Sheinvald, Sarai
    LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS, 2010, 6031 : 561 - +
  • [22] LDPC codes over mixed alphabets
    Ng, K. S.
    Armand, M. A.
    ELECTRONICS LETTERS, 2006, 42 (22) : 1290 - 1292
  • [23] Reversals and transpositions over finite alphabets
    Radcliffe, AJ
    Scott, AD
    Wilmer, EL
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2005, 19 (01) : 224 - 244
  • [24] Induced Suffix Sorting for String Collections
    Louza, Felipe A.
    Gog, Simon
    Telles, Guilherme P.
    2016 DATA COMPRESSION CONFERENCE (DCC), 2016, : 43 - 52
  • [25] Communication-Efficient String Sorting
    Bingmann, Timo
    Sanders, Peter
    Schimek, Matthias
    2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 137 - 147
  • [26] PERIODICITY PROBLEM OF SUBSTITUTIONS OVER TERNARY ALPHABETS
    Tan, Bo
    Wen, Zhi-Ying
    RAIRO-THEORETICAL INFORMATICS AND APPLICATIONS, 2008, 42 (04): : 747 - 762
  • [27] Regular expressions for languages over infinite alphabets
    Kaminski, M
    Tan, T
    FUNDAMENTA INFORMATICAE, 2006, 69 (03) : 301 - 318
  • [28] Regular expressions for languages over infinite alphabets
    Kaminski, M
    Tan, T
    COMPUTING AND COMBINATORICS, PROCEEDINGS, 2004, 3106 : 171 - 178
  • [29] On Dejean's conjecture over large alphabets
    Carpi, Arturo
    THEORETICAL COMPUTER SCIENCE, 2007, 385 (1-3) : 137 - 151
  • [30] Diletter circular codes over finite alphabets
    Fimmel, Elena
    Michel, Christian J.
    Struengrnann, Lutz
    MATHEMATICAL BIOSCIENCES, 2017, 294 : 120 - 129