SAIL-APPROX: An efficient on-line algorithm for approximate pattern matching with wildcards and length constraints

被引:8
|
作者
He, Dan [1 ]
Wu, Xindong [1 ]
Zhu, Xingquan [2 ]
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[2] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/BIBM.2007.48
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding patterns in biological sequences owns a significant impact on many real-world applications such as biological sequence analysis, text indexing, stream data mining, and sensor networking. The problem of Pattern Matching with Wildcards and Length Constraints is to find all locations of occurrences of a pattern P in a text T, which can be a biological sequence, text string, etc. The user can specify a varying range for the number of wildcards between every two consecutive letters in P and also the length constraints of P. Another constraint is the one-off condition, where every literal in T can only be used once for matching with P. The on-line version of this problem is to find out an occurrence of the given pattern that satisfies all constraints as soon as the occurrence appears in the input of T so far. There is an algorithm SAIL to find the optimal solution for the on-line version of this problem. However, SAIL only handles exact pattern matching. In this paper, we propose an efficient on-line algorithm for approximate pattern matching with wildcards and length constraints, which is a more general problem than exact matching. We apply dynamic programming in our algorithm and prove that our algorithm is correct.
引用
收藏
页码:151 / +
页数:3
相关论文
共 19 条
  • [1] Efficient string matching with wildcards and length constraints
    Gong Chen
    Xindong Wu
    Xingquan Zhu
    Abdullah N. Arslan
    Yu He
    [J]. Knowledge and Information Systems, 2006, 10 : 399 - 419
  • [2] Efficient string matching with wildcards and length constraints
    Chen, Gong
    Wu, Xindong
    Zhu, Xingquan
    Arslan, Abdullah N.
    He, Yu
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 10 (04) : 399 - 419
  • [3] New algorithms for pattern matching with wildcards and length constraints
    Arslan, Abdullah N.
    George, Betsy
    Stor, Kirsten
    [J]. DISCRETE MATHEMATICS ALGORITHMS AND APPLICATIONS, 2015, 7 (03)
  • [4] BPBM: An Algorithm for String Matching with Wildcards and Length Constraints
    Hong, Xiao-Li
    Wu, Xindong
    Hu, Xue-Gang
    Liu, Ying-Ling
    Gao, Jun
    Wu, Gong-Qing
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 518 - 525
  • [5] Impact of pattern feature on pattern matching problem with wildcards and length constraints
    Wang, Hai-Ping
    Hu, Xue-Gang
    Xie, Fei
    Guo, Dan
    Wu, Xin-Dong
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2012, 25 (06): : 1013 - 1021
  • [6] Pattern matching with wildcards and length constraints using maximum network flow
    Arslan, Abdullah N.
    He, Dan
    He, Yu
    Wu, Xindong
    [J]. JOURNAL OF DISCRETE ALGORITHMS, 2015, 35 : 9 - 16
  • [7] NetDAP: (δ, γ)-approximate pattern matching with length constraints
    Wu, Youxi
    Fan, Jinquan
    Li, Yan
    Guo, Lei
    Wu, Xindong
    [J]. APPLIED INTELLIGENCE, 2020, 50 (11) : 4094 - 4116
  • [8] NetDAP: (δ, γ) −approximate pattern matching with length constraints
    Youxi Wu
    Jinquan Fan
    Yan Li
    Lei Guo
    Xindong Wu
    [J]. Applied Intelligence, 2020, 50 : 4094 - 4116
  • [9] An Efficient Algorithm for Approximate Pattern Matching with Swaps
    Campanelli, Matteo
    Cantone, Domenico
    Faro, Simone
    Giaquinta, Emanuele
    [J]. PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2009, 2009, : 90 - 104
  • [10] Efficient Algorithm for δ - Approximate Jumbled Pattern Matching
    Castellanos, Ivan
    Pinzon, Yoan
    [J]. PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2015, 2015, : 47 - 56