Sparse LCS Common Substring Alignment

被引:0
|
作者
Landau, GM
Schieber, B
Ziv-Ukelson, M
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Univ Haifa, Dept Comp Sci, IL-31905 Haifa, Israel
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The "Common Substring Alignment". problem is defined as follows. The input consists of a set of strings S-1, S-2...S-c, with a common substring appearing at least once in each of them, and a target string T. The goal is to compute similarity of all strings S-i with T, without computing the part of the common substring over and over again. In this paper we consider the Common Substring Alignment problem for the LCS (Longest Common Subsequence) similarity metric. Our algorithm gains its efficiency by exploiting the sparsity inherent to the LCS problem. Let Y be the common substring, n be the size of the compared sequences, L-y be the length of the LCS of T and Y, denoted \LCS[T,Y]\, and L be max {\LCS[T,S-i]\}. Our algorithm consists of an O(nL(y)) time encoding stage that is executed once per common substring, and an O(L) time alignment stage that is executed once for each appearance of the common substring in each source string. The. additional running time depends only on the length of the parts of the strings that are not in any common substring.
引用
收藏
页码:225 / 236
页数:12
相关论文
共 50 条
  • [1] Sparse LCS common substring alignment
    Landau, GM
    Schieber, B
    Ziv-Ukelson, M
    INFORMATION PROCESSING LETTERS, 2003, 88 (06) : 259 - 270
  • [2] On the common substring alignment problem
    Landau, GM
    Ziv-Ukelson, M
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2001, 41 (02): : 338 - 359
  • [3] A substring-substring LCS data structure
    Sakai, Yoshifumi
    THEORETICAL COMPUTER SCIENCE, 2019, 753 : 16 - 34
  • [4] A data structure for substring-substring LCS length queries
    Sakai, Yoshifumi
    THEORETICAL COMPUTER SCIENCE, 2022, 911 : 41 - 54
  • [5] A Simple Algorithm for Solving for the Generalized Longest Common Subsequence (LCS) Problem with a Substring Exclusion Constraint
    Zhu, Daxin
    Wang, Xiaodong
    ALGORITHMS, 2013, 6 (03) : 485 - 493
  • [6] On the shared substring alignment problem
    Landau, GM
    Ziv-Ukelson, M
    PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2000, : 804 - 814
  • [7] Substring alignment using suffix trees
    Kay, M
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2004, 2945 : 275 - 282
  • [8] The longest common substring problem
    Crochemore, Maxime
    Iliopoulos, Costas S.
    Langiu, Alessio
    Mignosi, Filippo
    MATHEMATICAL STRUCTURES IN COMPUTER SCIENCE, 2017, 27 (02) : 277 - 295
  • [9] kmacs: the k-mismatch average common substring approach to alignment-free sequence comparison
    Leimeister, Chris-Andre
    Morgenstern, Burkhard
    BIOINFORMATICS, 2014, 30 (14) : 2000 - 2008
  • [10] On the Longest Common Cartesian Substring Problem†
    Faro, Simone
    Lecroq, Thierry
    Park, Kunsoo
    Scafiti, Stefano
    COMPUTER JOURNAL, 2023, 66 (04): : 907 - 923