Algorithms for extracting motifs from biological weighted sequences

被引:1
|
作者
Iliopoulos, C. [1 ]
Perdikuri, K. [2 ,3 ]
Theodoridis, E. [2 ,3 ]
Tsakalidis, A. [2 ,3 ]
Tsichlas, K. [1 ]
机构
[1] Kings Coll London, London WC2R 2LS, England
[2] Univ Patras, Comp Engn & Informat Dept, GR-26500 Patras, Greece
[3] Res Acad Comp Technol Inst RACTI, 61 Riga Feraiou Str, GR-26221 Patras 26221, Greece
关键词
Motif extraction; Biological weighted sequences;
D O I
10.1016/j.jda.2006.03.018
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper we present three algorithms for the Motif Identification Problem in Biological Weighted Sequences. The first algorithm extracts repeated motifs from a biological weighted sequence. The motifs correspond to repetitive words which are approximately equal, under a Hamming distance, with probability of occurrence >= 1/k, where k is a small constant. The second algorithm extracts common motifs from a set of N >= 2 weighted sequences. In this case, the motifs consists of words that must occur with probability >= 1/k, in 1 <= q < N distinct sequences of the set. The third algorithm extracts maximal pairs from a biological weighted sequence. A pair in a sequence is the occurrence of the same word twice. In addition, the algorithms presented in this paper improve previous work on these problems. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:229 / 242
页数:14
相关论文
共 50 条
  • [1] Extracting biological knowledge from DNA sequences
    DelaVega, FM
    Thieffry, D
    ColladoVides, J
    PACIFIC SYMPOSIUM ON BIOCOMPUTING '97, 1996, : 6 - 7
  • [2] Extracting glycan motifs using a biochemically-weighted kernel
    Jiang, Hao
    Aoki-Kinoshita, Kiyoko F.
    Ching, Wai-Ki
    BIOINFORMATION, 2011, 7 (08) : 405 - 412
  • [3] Extracting Information from Weighted Contact Networks via Genetic Algorithms
    Rutkowski, Emilia
    Houghten, Sheridan
    Brown, Joseph Alexander
    2020 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2020, : 228 - 235
  • [4] DNA Motifs Detection Algorithms in Long Sequences
    Voina, Alin G.
    Pop, Petre G.
    Vaida, Mircea F.
    IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING, 2012, : 169 - 174
  • [5] Algorithms for searching RNA motifs in genomic sequences
    Liu, JP
    Ma, B
    Zhang, KZ
    PROCEEDINGS OF THE 8TH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1-3, 2005, : 1303 - 1306
  • [6] ARCS-Motif: discovering correlated motifs from unaligned biological sequences
    Zhang, Shijie
    Su, Wei
    Yang, Jiong
    BIOINFORMATICS, 2009, 25 (02) : 183 - 189
  • [7] Detecting motifs from sequences
    Hu, YJ
    Sandmeyer, S
    Kibler, D
    MACHINE LEARNING, PROCEEDINGS, 1999, : 181 - 190
  • [8] Suffix tree characterization of maximal motifs in biological sequences
    Federico, Maria
    Pisanti, Nadia
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (43) : 4391 - 4401
  • [9] Computing distribution of scale independent motifs in biological sequences
    Jonas S Almeida
    Susana Vinga
    Algorithms for Molecular Biology, 1
  • [10] Computing distribution of scale independent motifs in biological sequences
    Almeida, Jonas S.
    Vinga, Susana
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2006, 1 (1)