Locating tandem repeats in weighted sequences in proteins

被引:0
|
作者
Hui Zhang
Qing Guo
Costas S Iliopoulos
机构
[1] Zhejiang University of Technology,College of Computer Science and Technology
[2] Zhejiang University,Corresponding author. College of Computer Science and Engineering
[3] King's College London Strand,Department of Computer Science
来源
关键词
Equivalence Class; Tandem Repeat; Independent Component Analysis; Weighted Sequence; Nonnegative Matrix Factorization;
D O I
暂无
中图分类号
学科分类号
摘要
A weighted biological sequence is a string in which a set of characters may appear at each position with respective probabilities of occurrence. We attempt to locate all the tandem repeats in a weighted sequence. A repeated substring is called a tandem repeat if each occurrence of the substring is directly adjacent to each other. By introducing the idea of equivalence classes in weighted sequences, we identify the tandem repeats of every possible length using an iterative partitioning technique. We also present the algorithm for recording the tandem repeats, and prove that the problem can be solved in O(n2) time.
引用
收藏
相关论文
共 50 条
  • [1] Locating tandem repeats in weighted sequences in proteins
    Zhang, Hui
    Guo, Qing
    Iliopoulos, Costas S.
    BMC BIOINFORMATICS, 2013, 14
  • [2] Locating Tandem Repeats in Weighted Biological Sequences
    Zhang, Hui
    Guo, Qing
    Iliopoulos, Costas S.
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 118 - +
  • [3] Loose and Strict Repeats in Weighted Sequences of Proteins
    Zhang, Hui
    Guo, Qing
    Fan, Jing
    Iliopoulos, Costas S.
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09): : 1136 - 1142
  • [4] Editorial for special issue "Proteins with tandem repeats: sequences, structures and functions"
    Kajava, Andrey V.
    Tosatto, Silvio C. E.
    JOURNAL OF STRUCTURAL BIOLOGY, 2018, 201 (02) : 86 - 87
  • [5] Impact of tandem repeats on the scaling of nucleotide sequences
    Nagarajan, Radhakrishnan
    Upreti, Meenaksh
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2006, 16 (10): : 3103 - 3108
  • [6] Detection and visualization of tandem repeats in DNA sequences
    Buchner, M
    Janjarasjitt, S
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (09) : 2280 - 2287
  • [7] STRING: finding tandem repeats in DNA sequences
    Parisi, V
    De Fonzo, V
    Aluffi-Pentini, F
    BIOINFORMATICS, 2003, 19 (14) : 1733 - 1738
  • [8] Finding approximate tandem repeats in genomic sequences
    Wexler, Y
    Yakhini, Z
    Kashi, Y
    Geiger, D
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2005, 12 (07) : 928 - 942
  • [9] Optimal computation of all tandem repeats in a weighted sequence
    Barton, Carl
    Iliopoulos, Costas S.
    Pissis, Solon P.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2014, 9
  • [10] Optimal computation of all tandem repeats in a weighted sequence
    Carl Barton
    Costas S Iliopoulos
    Solon P Pissis
    Algorithms for Molecular Biology, 9