Locating tandem repeats in weighted sequences in proteins

被引:0
|
作者
Hui Zhang
Qing Guo
Costas S Iliopoulos
机构
[1] Zhejiang University of Technology,College of Computer Science and Technology
[2] Zhejiang University,Corresponding author. College of Computer Science and Engineering
[3] King's College London Strand,Department of Computer Science
来源
关键词
Equivalence Class; Tandem Repeat; Independent Component Analysis; Weighted Sequence; Nonnegative Matrix Factorization;
D O I
暂无
中图分类号
学科分类号
摘要
A weighted biological sequence is a string in which a set of characters may appear at each position with respective probabilities of occurrence. We attempt to locate all the tandem repeats in a weighted sequence. A repeated substring is called a tandem repeat if each occurrence of the substring is directly adjacent to each other. By introducing the idea of equivalence classes in weighted sequences, we identify the tandem repeats of every possible length using an iterative partitioning technique. We also present the algorithm for recording the tandem repeats, and prove that the problem can be solved in O(n2) time.
引用
收藏
相关论文
共 50 条
  • [11] Tandem repeats finder: a program to analyze DNA sequences
    Benson, G
    NUCLEIC ACIDS RESEARCH, 1999, 27 (02) : 573 - 580
  • [12] Tandem repeats in proteins: From sequence to structure
    Kajava, Andrey V.
    JOURNAL OF STRUCTURAL BIOLOGY, 2012, 179 (03) : 279 - 288
  • [13] TANDEM SEQUENCE REPEATS IN TRANSMEMBRANE CHANNEL PROTEINS
    WISTOW, GJ
    PISANO, MM
    CHEPELINSKY, AB
    TRENDS IN BIOCHEMICAL SCIENCES, 1991, 16 (05) : 170 - 171
  • [14] A MAPREDUCE FRAMEWORK FOR DETECTION OF TANDEM REPEATS IN DNA SEQUENCES
    Vandanababu, T.
    Bhukya, Raju
    Veeraiah, D.
    Paul, P. Victer
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2019, 12 (01): : 13 - 24
  • [15] Variable Tandem Repeats Accelerate Evolution of Coding and Regulatory Sequences
    Gemayel, Rita
    Vinces, Marcelo D.
    Legendre, Matthieu
    Verstrepen, Kevin J.
    ANNUAL REVIEW OF GENETICS, VOL 44, 2010, 44 : 445 - 477
  • [16] Search for Highly Divergent Tandem Repeats in Amino Acid Sequences
    Rudenko, Valentina
    Korotkov, Eugene
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (13)
  • [17] Statistical approaches to detecting and analyzing tandem repeats in genomic sequences
    Anisimova, Maria
    Pecerska, Julija
    Schaper, Elke
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2015, 3
  • [18] Non-Globular Structures of Tandem Repeats in Proteins
    Matsushima, Norio
    Tanaka, Takanori
    Kretsinger, Robert H.
    PROTEIN AND PEPTIDE LETTERS, 2009, 16 (11): : 1297 - 1322
  • [19] Tandem repeats in proteins: prediction algorithms and biological role
    Pellegrini, Marco
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2015, 3
  • [20] MGWT based Algorithm for Tandem Repeats Detection in DNA Sequences
    Garg, Pardeep
    Sharma, SunilDatt
    PROCEEDINGS OF 2019 5TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTING AND CONTROL (ISPCC 2K19), 2019, : 196 - 199