On demand string sorting over unbounded alphabets

被引:3
|
作者
Kent, Carmel [2 ]
Lewenstein, Moshe [1 ]
Sheinwald, Dafna [2 ]
机构
[1] Bar Ilan Univ, Dept Comp Sci, Ramat Gan, Israel
[2] IBM Res Lab, Haifa, Israel
基金
以色列科学基金会;
关键词
String matching; Data structures; SUFFIX TREE CONSTRUCTION; LINEAR-TIME CONSTRUCTION; ARRAYS;
D O I
10.1016/j.tcs.2011.12.001
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
On-demand string sorting is the problem of preprocessing a set of strings to allow subsequent queries for finding the k lexicographically smallest strings (and afterward the next k etc.) This on-demand variant strongly resembles the search engine queries which give you the best k-ranked pages recurringly. We present a data structure that supports this in O(n) preprocessing time, where n is the number of strings, and answer queries in O(log n) time. There is also a cost of O(N) time amortized over all operations, where N is the total length of the strings. Our data structure is a heap of strings, which supports heapify and delete-mins. As it turns out, implementing a full heap with all operations is not that simple. For the sake of completeness, we propose a heap with full operations based on balanced indexing trees that supports the heap operations in optimal times. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 74
页数:9
相关论文
共 50 条
  • [1] On demand string sorting over unbounded alphabets
    Kent, Carmel
    Lewenstein, Moshe
    Sheinwald, Dafna
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2007, 4580 : 16 - +
  • [2] Efficient Sorting Suffixes of Big Alphabets
    Nong, Ge
    Zhang, Sen
    2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 273 - 282
  • [3] HISTORY SENSITIVE STRING FOR MULTIPLE ALPHABETS
    CHUNG, TC
    CHO, JW
    INFORMATION PROCESSING LETTERS, 1987, 25 (03) : 183 - 188
  • [4] STRING-MATCHING ON ORDERED ALPHABETS
    CROCHEMORE, M
    THEORETICAL COMPUTER SCIENCE, 1992, 92 (01) : 33 - 47
  • [5] Universal lossless coding of sources with large and unbounded alphabets
    Yang, EH
    Jia, YW
    2000 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2000, : 16 - 16
  • [6] DICTIONARY-MATCHING ON UNBOUNDED ALPHABETS - UNIFORM LENGTH DICTIONARIES
    BRESLAUER, D
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 1995, 18 (02): : 278 - 295
  • [7] LANGUAGES OVER INFINITE ALPHABETS
    AUTEBERT, JM
    BEAUQUIER, J
    BOASSON, L
    DISCRETE APPLIED MATHEMATICS, 1980, 2 (01) : 1 - 20
  • [8] Shift-or string matching with super-alphabets
    Fredriksson, K
    INFORMATION PROCESSING LETTERS, 2003, 87 (04) : 201 - 204
  • [9] Engineering Parallel String Sorting
    Bingmann, Timo
    Eberle, Andreas
    Sanders, Peter
    ALGORITHMICA, 2017, 77 (01) : 235 - 286
  • [10] Optimal suffix sorting and LCP array construction for constant alphabets
    Louza, Felipe A.
    Gog, Simon
    Telles, Guilherme P.
    INFORMATION PROCESSING LETTERS, 2017, 118 : 30 - 34