Efficient algorithms for biological stems search

被引:2
|
作者
Mi, Tian [1 ]
Rajasekaran, Sanguthevar [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
FINDING MOTIFS;
D O I
10.1186/1471-2105-14-161
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Motifs are significant patterns in DNA, RNA, and protein sequences, which play an important role in biological processes and functions, like identification of open reading frames, RNA transcription, protein binding, etc. Several versions of the motif search problem have been studied in the literature. One such version is called the Planted Motif Search (PMS) or (l, d)-motif Search. PMS is known to be NP complete. The time complexities of most of the planted motif search algorithms depend exponentially on the alphabet size. Recently a new version of the motif search problem has been introduced by Kuksa and Pavlovic. We call this version as the Motif Stems Search (MSS) problem. A motif stem is an l-mer (for some relevant value of l) with some wildcard characters and hence corresponds to a set of l-mers (without wildcards), some of which are (l, d)-motifs. Kuksa and Pavlovic have presented an efficient algorithm to find motif stems for inputs from large alphabets. Ideally, the number of stems output should be as small as possible since the stems form a superset of the motifs. Results: In this paper we propose an efficient algorithm for MSS and evaluate it on both synthetic and real data. This evaluation reveals that our algorithm is much faster than Kuksa and Pavlovic's algorithm. Conclusions: Our MSS algorithm outperforms the algorithm of Kuksa and Pavlovic in terms of the run time as well as the number of stems output. Specifically, the stems output by our algorithm form a proper (and much smaller) subset of the stems output by Kuksa and Pavlovic's algorithm.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Adaptive algorithms for cache-efficient trio search
    Acharya, A
    Zhu, HC
    Shen, K
    ALGORITHM ENGINEERING AND EXPERIMENTATION, 1999, 1619 : 296 - 311
  • [32] EFFICIENT ALGORITHMS TO GLOBALLY BALANCE A BINARY SEARCH TREE
    CHANG, H
    IYENGAR, SS
    COMMUNICATIONS OF THE ACM, 1984, 27 (07) : 695 - 702
  • [33] Approximating the ε-efficient set of an MOP with stochastic search algorithms
    Schuetze, Oliver
    Coello Coello, Carlos A.
    Talbi, El-Ghazali
    MICAI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4827 : 128 - +
  • [34] Biological-inspired Algorithms for Dynamic Search in Web Graphs
    Kouzas, Georgios
    Anagnostopoulos, Ioannis
    2011 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2011,
  • [35] Simple and Efficient Pattern Matching Algorithms for Biological Sequences
    Neamatollahi, Peyman
    Hadi, Montassir
    Naghibzadeh, Mahmoud
    IEEE ACCESS, 2020, 8 (08): : 23838 - 23846
  • [36] Efficient algorithms for motion and deformation recovery with biological applications
    Mukherjee, L
    Xu, JH
    Pliss, A
    Berezney, R
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 3110 - 3115
  • [37] Identification of efficient algorithms for web search through implementation of learning-to-rank algorithms
    Nikhil Dhake
    Shital Raut
    Ashwini Rahangdale
    Sādhanā, 2019, 44
  • [38] Identification of efficient algorithms for web search through implementation of learning-to-rank algorithms
    Dhake, Nikhil
    Raut, Shital
    Rahangdale, Ashwini
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (04): : 1 - 12
  • [39] EFFICIENT SEARCH STRATEGIES IN ITERATIVE ALGORITHMS FOR VARIANCE COMPONENT ESTIMATION
    SIMIANER, H
    JOURNAL OF ANIMAL BREEDING AND GENETICS-ZEITSCHRIFT FUR TIERZUCHTUNG UND ZUCHTUNGSBIOLOGIE, 1988, 105 (06): : 468 - 483
  • [40] Space-Efficient Parallel Algorithms for Combinatorial Search Problems
    Pietracaprina, Andrea
    Pucci, Geppino
    Silvestri, Francesco
    Vandin, Fabio
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2013, 2013, 8087 : 717 - 728