Efficient algorithms for biological stems search

被引:2
|
作者
Mi, Tian [1 ]
Rajasekaran, Sanguthevar [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
FINDING MOTIFS;
D O I
10.1186/1471-2105-14-161
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Motifs are significant patterns in DNA, RNA, and protein sequences, which play an important role in biological processes and functions, like identification of open reading frames, RNA transcription, protein binding, etc. Several versions of the motif search problem have been studied in the literature. One such version is called the Planted Motif Search (PMS) or (l, d)-motif Search. PMS is known to be NP complete. The time complexities of most of the planted motif search algorithms depend exponentially on the alphabet size. Recently a new version of the motif search problem has been introduced by Kuksa and Pavlovic. We call this version as the Motif Stems Search (MSS) problem. A motif stem is an l-mer (for some relevant value of l) with some wildcard characters and hence corresponds to a set of l-mers (without wildcards), some of which are (l, d)-motifs. Kuksa and Pavlovic have presented an efficient algorithm to find motif stems for inputs from large alphabets. Ideally, the number of stems output should be as small as possible since the stems form a superset of the motifs. Results: In this paper we propose an efficient algorithm for MSS and evaluate it on both synthetic and real data. This evaluation reveals that our algorithm is much faster than Kuksa and Pavlovic's algorithm. Conclusions: Our MSS algorithm outperforms the algorithm of Kuksa and Pavlovic in terms of the run time as well as the number of stems output. Specifically, the stems output by our algorithm form a proper (and much smaller) subset of the stems output by Kuksa and Pavlovic's algorithm.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Efficient local search algorithms for the linear ordering problem
    Sakuraba, Celso S.
    Yagiura, Mutsunori
    INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2010, 17 (06) : 711 - 737
  • [22] Index Based Efficient Algorithms For Closest Community Search
    Akbas, Esra
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 701 - 710
  • [23] Space Efficient Algorithms for Breadth-Depth Search
    Chakraborty, Sankardeep
    Mukherjee, Anish
    Satti, Srinivasa Rao
    FUNDAMENTALS OF COMPUTATION THEORY, FCT 2019, 2019, 11651 : 201 - 212
  • [24] Efficient sequential and parallel algorithms for planted motif search
    Nicolae, Marius
    Rajasekaran, Sanguthevar
    BMC BIOINFORMATICS, 2014, 15
  • [25] Space and time efficient algorithms for Planted Motif Search
    Davila, Jaime
    Balla, Sudha
    Rajasekaran, Sanguthevar
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 822 - 829
  • [26] Implementation of efficient quantum search algorithms on NISQ computers
    Kun Zhang
    Pooja Rao
    Kwangmin Yu
    Hyunkyung Lim
    Vladimir Korepin
    Quantum Information Processing, 2021, 20
  • [27] ALGORITHMS FOR CONSTRUCTING SEARCH TREES FOR EFFICIENT QUERYING.
    Ng, Peter A.
    Paul, Jean F.
    Proceedings of the Texas Conference on Computing Systems, 1979, : 11 - 7
  • [28] Efficient Local Search Algorithms for GTSP: A Comparative Study
    Ben Nejma, Ibtissem
    Mhalla, Hedi
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 2247 - 2252
  • [29] Efficient search algorithms implementation for SAR image analysis
    Belean, Bogdan
    Carata, Serban Vasile
    Ghenescu, Marian
    2018 CONFERENCE GRID, CLOUD & HIGH PERFORMANCE COMPUTING IN SCIENCE (ROLCG), 2018,
  • [30] Implementation of efficient quantum search algorithms on NISQ computers
    Zhang, Kun
    Rao, Pooja
    Yu, Kwangmin
    Lim, Hyunkyung
    Korepin, Vladimir
    QUANTUM INFORMATION PROCESSING, 2021, 20 (07)