Efficient algorithms for biological stems search

被引:2
|
作者
Mi, Tian [1 ]
Rajasekaran, Sanguthevar [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
FINDING MOTIFS;
D O I
10.1186/1471-2105-14-161
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Motifs are significant patterns in DNA, RNA, and protein sequences, which play an important role in biological processes and functions, like identification of open reading frames, RNA transcription, protein binding, etc. Several versions of the motif search problem have been studied in the literature. One such version is called the Planted Motif Search (PMS) or (l, d)-motif Search. PMS is known to be NP complete. The time complexities of most of the planted motif search algorithms depend exponentially on the alphabet size. Recently a new version of the motif search problem has been introduced by Kuksa and Pavlovic. We call this version as the Motif Stems Search (MSS) problem. A motif stem is an l-mer (for some relevant value of l) with some wildcard characters and hence corresponds to a set of l-mers (without wildcards), some of which are (l, d)-motifs. Kuksa and Pavlovic have presented an efficient algorithm to find motif stems for inputs from large alphabets. Ideally, the number of stems output should be as small as possible since the stems form a superset of the motifs. Results: In this paper we propose an efficient algorithm for MSS and evaluate it on both synthetic and real data. This evaluation reveals that our algorithm is much faster than Kuksa and Pavlovic's algorithm. Conclusions: Our MSS algorithm outperforms the algorithm of Kuksa and Pavlovic in terms of the run time as well as the number of stems output. Specifically, the stems output by our algorithm form a proper (and much smaller) subset of the stems output by Kuksa and Pavlovic's algorithm.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Efficient algorithms for biological stems search
    Tian Mi
    Sanguthevar Rajasekaran
    BMC Bioinformatics, 14
  • [2] Efficient algorithms for similarity search
    Rajasekaran, S
    Hu, Y
    Luo, J
    Nick, H
    Pardalos, PM
    Sahni, S
    Shaw, G
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2001, 5 (01) : 125 - 132
  • [3] Efficient Algorithms for Similarity Search
    S. Rajasekaran
    Y. Hu
    J. Luo
    H. Nick
    P.M. Pardalos
    S. Sahni
    G. Shaw
    Journal of Combinatorial Optimization, 2001, 5 : 125 - 132
  • [4] Space-efficient search algorithms
    Korf, RE
    ACM COMPUTING SURVEYS, 1995, 27 (03) : 337 - 339
  • [5] Efficient algorithms for local alignment search
    Rajasekaran, S
    Nick, H
    Pardalos, PM
    Sahni, S
    Shaw, G
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2001, 5 (01) : 117 - 124
  • [6] Evolving Efficient List Search Algorithms
    Wolfson, Kfir
    Sipper, Moshe
    ARTIFICIAL EVOLUTION, 2010, 5975 : 158 - 169
  • [7] Efficient algorithms for degenerate primer search
    Balla, Sudha
    Rajasekaran, Sanguthevar
    Mandoiu, Ion I.
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2007, 18 (04) : 899 - 910
  • [8] DESIGN EFFICIENT LOCAL SEARCH ALGORITHMS
    GU, J
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1992, 604 : 651 - 654
  • [9] Efficient Algorithms for Local Alignment Search
    S. Rajasekaran
    H. Nick
    P.M. Pardalos
    S. Sahni
    G. Shaw
    Journal of Combinatorial Optimization, 2001, 5 : 117 - 124
  • [10] Rapid and efficient search of biological motion
    de Lussanet, M. H. E.
    Naumowicz, A.
    Lappe, M.
    PERCEPTION, 2007, 36 : 74 - 74