Flexible and Efficient Algorithms for Abelian Matching in Genome Sequence

被引:0
|
作者
Faro, Simone [1 ]
Pavone, Arianna [2 ]
机构
[1] Univ Catania, Dipartimento Matemat & Informat, Viale Andrea Doria 6, I-95125 Catania, Italy
[2] Univ Messina, Dipartimento Sci Cognit, Via Concez 6, I-98122 Messina, Italy
关键词
Approximate string matching; Abelian matching jumbled matching; Experimental algorithms;
D O I
10.1007/978-3-030-17938-0_28
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Approximate matching in strings is a fundamental and challenging problem in computer science and in computational biology, and increasingly fast algorithms are highly demanded in many applications including text processing and dna sequence analysis. Recently efficient solutions to specific approximate matching problems on genomic sequences have been designed using a filtering technique, based on the general abelian matching problem, which firstly locates the set of all candidate matching positions and then perform an additional verification test on the collected positions. The abelian pattern matching problem consists in finding all substrings of a text which are permutations of a given pattern. In this paper we present a new class of algorithms based on a new efficient fingerprint computation approach, called Heap-Counting, which turns out to be fast, flexible and easy to be implemented. We prove that, when applied for searching short patterns on a dna sequence, our solutions have a linear worst case time complexity. In addition we present an experimental evaluation which shows that our newly presented algorithms are among the most efficient and flexible solutions in practice for the abelian matching problem in dna sequences.
引用
收藏
页码:307 / 318
页数:12
相关论文
共 50 条
  • [31] Efficient algorithms for Petersen's matching theorem
    Biedl, TC
    Bese, P
    Demaine, ED
    Lubiw, A
    PROCEEDINGS OF THE TENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 1999, : 130 - 139
  • [32] Efficient algorithms for approximate string matching with swaps
    Lee, JS
    Kim, DK
    Park, K
    Cho, Y
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 1997, 1264 : 28 - 39
  • [33] Efficient algorithms for Petersen's matching theorem
    Biedl, TC
    Bose, P
    Demaine, ED
    Lubiw, A
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2001, 38 (01): : 110 - 134
  • [34] Efficient rule-matching algorithms on snort
    Department of Computer, School of Electronic Information Engineering, Dalian University of Technology, Dalian 116023, China
    Jisuanji Gongcheng, 2006, 18 (155-156+213):
  • [35] EFFICIENT ALGORITHMS FOR FINDING MAXIMAL MATCHING IN GRAPHS
    GALIL, Z
    LECTURE NOTES IN COMPUTER SCIENCE, 1983, 159 : 90 - 113
  • [36] Efficient parallel hardware algorithms for string matching
    Park, JH
    George, KM
    MICROPROCESSORS AND MICROSYSTEMS, 1999, 23 (03) : 155 - 168
  • [37] Efficient Algorithms for Image Template and Dictionary Matching
    Sung-Hyuk Cha
    Journal of Mathematical Imaging and Vision, 2000, 12 : 81 - 90
  • [38] EFFICIENT RANDOMIZED PATTERN-MATCHING ALGORITHMS
    KARP, RM
    RABIN, MO
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 1987, 31 (02) : 249 - 260
  • [39] Efficient algorithms for image template and dictionary matching
    Cha, SH
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2000, 12 (01) : 81 - 90
  • [40] Efficient Algorithms for Flexible Sweep Coverage in Crowdsensing
    Huang, Peihuang
    Zhu, Wenxing
    Liao, Kewen
    Sellis, Timos
    Yu, Zhiyong
    Guo, Longkun
    IEEE ACCESS, 2018, 6 : 50055 - 50065