Cache-oblivious index for approximate string matching

被引:0
|
作者
Hon, Wing-Kai [1 ]
Lam, Tak-Wah [2 ]
Shah, Rahul
Tam, Siu-Lung [2 ]
Vitter, Jeffrey Scott [3 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Purdue Univ, Dept Comp Sci, Indiana, PA USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper revisits the problem of indexing a text for approximate string matching. Specifically, given a text T of length n and a positive integer k, we want to construct an index of T such that for any input pattern P, we can find all its k-error matches in T efficiently. This problem is well-studied in the internal-memory setting. Here, we extend some of these recent results to external-memory solutions, which are also cache-oblivious. Our first index occupies O((n log(k) n)/B) disk pages and finds all k-error matches with O((vertical bar P vertical bar + occ)/B + log(k) n log log(B) n) I/Os, where B denotes the number of words in a disk page. To the best of our knowledge, this index is the first external-memory data structure that does not require Omega(vertical bar P vertical bar + occ + poly(log n)) I/Os. The second index reduces the space to O((n log n)/B) disk pages, and the I/O complexity is O((vertical bar P vertical bar + occ)/B + log(k(k+1)) n log log n).
引用
收藏
页码:40 / +
页数:3
相关论文
共 50 条
  • [41] Cache-Oblivious Streaming B-trees
    Bender, Michael A.
    Farach-Colton, Martin
    Fineman, Jeremy T.
    Fogel, Yonatan R.
    Kuszmaul, Bradley C.
    Nelson, Jelani
    SPAA'07: PROCEEDINGS OF THE NINETEENTH ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2007, : 81 - +
  • [42] Cache-oblivious computation: Algorithms and experimental evaluation
    Ramachandran, Vijaya
    ICCTA 2007: INTERNATIONAL CONFERENCE ON COMPUTING: THEORY AND APPLICATIONS, PROCEEDINGS, 2007, : 20 - 25
  • [43] An Experimental Comparison of Cache-oblivious and Cache-conscious Programs
    Yotov, Kamen
    Roeder, Tom
    Pingali, Keshav
    Gunnels, John
    Gustavson, Fred
    SPAA'07: PROCEEDINGS OF THE NINETEENTH ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2007, : 93 - +
  • [44] An Efficient Cache-oblivious Parallel Viterbi Algorithm
    Chowdhury, Rezaul
    Ganapathi, Pramod
    Pradhan, Vivek
    Tithi, Jesmin Jahan
    Xiao, Yunpeng
    EURO-PAR 2016: PARALLEL PROCESSING, 2016, 9833 : 574 - 587
  • [45] Exponential structures for efficient cache-oblivious algorithms
    Bender, MA
    Cole, R
    Raman, R
    AUTOMATA, LANGUAGES AND PROGRAMMING, 2002, 2380 : 195 - 207
  • [46] Processor-Aware Cache-Oblivious Algorithms*
    Tang, Yuan
    Gao, Weiguo
    50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,
  • [47] Cache-oblivious High-performance Similarity Join
    Perdacher, Martin
    Plant, Claudia
    Boehm, Christian
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 87 - 104
  • [48] Brief Announcement: Cache-Oblivious Scheduling of Streaming Applications
    Agrawal, Kunal
    Fineman, Jeremy T.
    PROCEEDINGS OF THE 26TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'14), 2014, : 79 - 81
  • [49] A locality-preserving cache-oblivious dynamic dictionary
    Bender, MA
    Duan, ZY
    Iacono, J
    Wu, J
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2004, 53 (02): : 115 - 136
  • [50] Improved Space Bounds for Cache-Oblivious Range Reporting
    Afshani, Peyman
    Zeh, Norbert
    PROCEEDINGS OF THE TWENTY-SECOND ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2011, : 1745 - 1758