Algorithms for extracting motifs from biological weighted sequences

被引:1
|
作者
Iliopoulos, C. [1 ]
Perdikuri, K. [2 ,3 ]
Theodoridis, E. [2 ,3 ]
Tsakalidis, A. [2 ,3 ]
Tsichlas, K. [1 ]
机构
[1] Kings Coll London, London WC2R 2LS, England
[2] Univ Patras, Comp Engn & Informat Dept, GR-26500 Patras, Greece
[3] Res Acad Comp Technol Inst RACTI, 61 Riga Feraiou Str, GR-26221 Patras 26221, Greece
关键词
Motif extraction; Biological weighted sequences;
D O I
10.1016/j.jda.2006.03.018
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper we present three algorithms for the Motif Identification Problem in Biological Weighted Sequences. The first algorithm extracts repeated motifs from a biological weighted sequence. The motifs correspond to repetitive words which are approximately equal, under a Hamming distance, with probability of occurrence >= 1/k, where k is a small constant. The second algorithm extracts common motifs from a set of N >= 2 weighted sequences. In this case, the motifs consists of words that must occur with probability >= 1/k, in 1 <= q < N distinct sequences of the set. The third algorithm extracts maximal pairs from a biological weighted sequence. A pair in a sequence is the occurrence of the same word twice. In addition, the algorithms presented in this paper improve previous work on these problems. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:229 / 242
页数:14
相关论文
共 50 条
  • [21] Networks of Motifs from Sequences of Symbols
    Sinatra, Roberta
    Condorelli, Daniele
    Latora, Vito
    PHYSICAL REVIEW LETTERS, 2010, 105 (17)
  • [22] Discovering Motifs in Biological Sequences Using the Micron Automata Processor
    Roy, Indranil
    Aluru, Srinivas
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (01) : 99 - 111
  • [23] Identifying discriminative classification-based motifs in biological sequences
    Vens, Celine
    Rosso, Marie-Noelle
    Danchin, Etienne G. J.
    BIOINFORMATICS, 2011, 27 (09) : 1231 - 1238
  • [24] An Algorithm to Find All Identical Motifs in Multiple Biological Sequences
    Bindal, Ashish Kishor
    Sabarinathan, R.
    Sridhar, J.
    Sherlin, D.
    Sekar, K.
    PATTERN RECOGNITION IN BIOINFORMATICS, 2010, 6282 : 137 - +
  • [25] Efficient mining gapped sequential patterns for motifs in biological sequences
    Liao, Vance Chiang-Chi
    Chen, Ming-Syan
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [26] Weighted and unweighted selection algorithms for k sorted sequences
    Hayashi, T
    Nakano, K
    Olariu, S
    ALGORITHMS AND COMPUTATION, PROCEEDINGS, 1997, 1350 : 52 - 61
  • [27] EXTRACTING REGULARITIES FROM SOUND SEQUENCES
    Winkler, Istvan
    Schroger, Erich
    PSYCHOPHYSIOLOGY, 2017, 54 : S4 - S4
  • [28] Extracting grammars from RNA sequences
    Andrejkova, Gabriela
    Lengenova, Helena
    Mati, Michal
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 1, 2007, 4431 : 404 - +
  • [29] Faster Algorithms for Sampling and Counting Biological Sequences
    Boucher, Christina
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5721 : 243 - 253
  • [30] Efficient parallel algorithms for processing biological sequences
    Rajasekaran, S.
    Ammar, R.
    Shin, D.
    Zhang, G.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2006, 26 (03) : 119 - 125