Exhaustive whole-genome tandem repeats search

被引:27
|
作者
Krishnan, A [1 ]
Tang, F [1 ]
机构
[1] Bioinformat Inst, Singapore 138671, Singapore
关键词
D O I
10.1093/bioinformatics/bth311
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Approximate tandem repeats (ATR) occur frequently in the genomes of organisms, and are a source of polymorphisms observed in individuals, and thus are of interest to those studying genetic disorders. Though extensive work has been done in order to identify ATRs, there are inherent limitations with the current approaches in terms of the number of pattern sizes that can be searched or the size of the input length. Results: This paper describes (1) a new algorithm which exhaustively finds all variable-length ATRs in a genomic sequence and (2) a precise description of, and an algorithm to significantly reduce, redundancy in the output. Our ATR definition is parameterized by a mismatch ratio p which allows for more mismatches in longer tandem repeats (and fewer in shorter). Furthermore, our algorithm is embarrassingly parallel and thus can attain near-linear speed-up on Beowulf clusters. We present results of our algorithm applied to sequences of widely differing lengths (from genes to chromosomes).
引用
收藏
页码:2702 / 2710
页数:9
相关论文
共 50 条
  • [21] Comparison of methods of genotyping short tandem repeats (STRs) from whole genome sequences
    Oketch, John W.
    Wain, Louise V.
    Hollox, Edward J.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (SUPPL 1) : 500 - 501
  • [22] Distribution of tandem repeats in human genome
    Fridman, M.
    Kulakovskiy, I.
    Lvovs, D.
    Oparina, N.
    Makeev, V.
    FEBS JOURNAL, 2013, 280 : 20 - 20
  • [23] Tandem repeats in the rodent genome and their mapping
    Ostromyshenskii D.I.
    Kuznetsova I.S.
    Komissarov A.S.
    Kartavtseva I.V.
    Podgornaya O.I.
    Cell and Tissue Biology, 2015, 9 (3) : 217 - 225
  • [24] The whole-genome survey of Acer griseum, its polymorphic simple sequence repeats development and application
    Zhou, Xiao-Jun
    Tian, Yu-Wei
    Li, Rui-Han
    BIOCELL, 2023, 47 (08) : 1907 - 1913
  • [25] Whole-genome analyses of whole-brain data: working within an expanded search space
    Medland, Sarah E.
    Jahanshad, Neda
    Neale, Benjamin M.
    Thompson, Paul M.
    NATURE NEUROSCIENCE, 2014, 17 (06) : 791 - 800
  • [26] Whole-genome analyses of whole-brain data: working within an expanded search space
    Sarah E Medland
    Neda Jahanshad
    Benjamin M Neale
    Paul M Thompson
    Nature Neuroscience, 2014, 17 : 791 - 800
  • [27] Whole-genome experimental identification of insertion/deletion polymorphisms of interspersed repeats by a new general approach
    Mamedov, IZ
    Arzumanyan, ES
    Amosova, AL
    Lebedev, YB
    Sverdlov, ED
    NUCLEIC ACIDS RESEARCH, 2005, 33 (02) : e16
  • [28] High-depth whole-genome sequencing identifies structure variants, copy number variants and short tandem repeats associated with Parkinson's disease
    Wang, Chaodong
    Liu, Hankui
    Li, Xu-Ying
    Ma, Jinghong
    Gu, Zhuqin
    Feng, Xiuli
    Xie, Shu
    Tang, Bei-Sha
    Chen, Shengdi
    Wang, Wei
    Wang, Jian
    Zhang, Jianguo
    Chan, Piu
    NPJ PARKINSONS DISEASE, 2024, 10 (01)
  • [29] Interpreting Whole-Genome Sequencing
    Grody, Wayne W.
    Vilain, Eric
    Nelson, Stanley F.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2014, 312 (03): : 296 - 296
  • [30] Whole-genome sequencing in pharmacogeneticson
    Urban, Thomas J.
    PHARMACOGENOMICS, 2013, 14 (04) : 345 - 348