Locating tandem repeats in weighted sequences in proteins

被引:0
|
作者
Hui Zhang
Qing Guo
Costas S Iliopoulos
机构
[1] Zhejiang University of Technology,College of Computer Science and Technology
[2] Zhejiang University,Corresponding author. College of Computer Science and Engineering
[3] King's College London Strand,Department of Computer Science
来源
关键词
Equivalence Class; Tandem Repeat; Independent Component Analysis; Weighted Sequence; Nonnegative Matrix Factorization;
D O I
暂无
中图分类号
学科分类号
摘要
A weighted biological sequence is a string in which a set of characters may appear at each position with respective probabilities of occurrence. We attempt to locate all the tandem repeats in a weighted sequence. A repeated substring is called a tandem repeat if each occurrence of the substring is directly adjacent to each other. By introducing the idea of equivalence classes in weighted sequences, we identify the tandem repeats of every possible length using an iterative partitioning technique. We also present the algorithm for recording the tandem repeats, and prove that the problem can be solved in O(n2) time.
引用
收藏
相关论文
共 50 条
  • [21] Finding the region of pseudo-periodic tandem repeats in biological sequences
    Xiaowen Liu
    Lusheng Wang
    Algorithms for Molecular Biology, 1
  • [22] Expansion of tandem repeats and oligomer clustering in coding and noncoding DNA sequences
    Buldyrev, SV
    Dokholyan, NV
    Havlin, S
    Stanley, HE
    Stanley, RHR
    PHYSICA A, 1999, 273 (1-2): : 19 - 32
  • [23] PORCINE (CT)(N) SEQUENCES - STRUCTURE AND ASSOCIATION WITH DISPERSED AND TANDEM REPEATS
    WILKE, K
    JUNG, M
    CHEN, YZ
    GELDERMANN, H
    GENOMICS, 1994, 21 (01) : 63 - 70
  • [24] Detection of Tandem Repeats in DNA Sequences Based on Parametric Spectral Estimation
    Zhou, Hongxia
    Du, Liping
    Yan, Hong
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2009, 13 (05): : 747 - 755
  • [25] Power law distribution of dimeric tandem repeats in DNA sequences.
    Dokholyan, N
    Buldyrev, S
    Havlin, S
    Stanley, HE
    PHYSICS OF COMPLEX SYSTEMS, 1997, 134 : 742 - 742
  • [26] A STRP-ed definition of Structured Tandem Repeats in Proteins
    Monzon, Alexander Miguel
    Arrias, Paula Nazarena
    Elofsson, Arne
    Mier, Pablo
    Andrade-Navarro, Miguel A.
    Bevilacqua, Martina
    Clementel, Damiano
    Bateman, Alex
    Hirsh, Layla
    Fornasari, Maria Silvina
    Parisi, Gustavo
    Piovesan, Damiano
    Kajava, Andrey V.
    Tosatto, Silvio C. E.
    JOURNAL OF STRUCTURAL BIOLOGY, 2023, 215 (04)
  • [27] Finding the region of pseudo-periodic tandem repeats in biological sequences
    Liu, Xiaowen
    Wang, Lusheng
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2006, 1 (1)
  • [28] Autoregressive models for spectral analysis of short tandem repeats in DNA sequences
    Hongxia Zhou
    Hong Yan
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 1286 - +
  • [29] Repetitive sequences in the crocodilian mitochondrial control region: Poly-A sequences and heteroplasmic tandem repeats
    Ray, DA
    Densmore, LD
    MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (06) : 1006 - 1013