Decomposing mosaic tandem repeats accurately from long reads

被引:4
|
作者
Masutani, Bansho [1 ]
Kawahara, Riki [1 ]
Morishita, Shinichi [1 ]
机构
[1] Univ Tokyo, Grad Sch Frontier Sci, Dept Computat Biol & Med Sci, Chiba 2778562, Japan
关键词
EXPANSION; DNA; SEQUENCES; EVOLUTION; GLOBIN;
D O I
10.1093/bioinformatics/btad185
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Over the past 30 years, extended tandem repeats (TRs) have been correlated with similar to 60 diseases with high odds ratios, and most known TRs consist of single repeat units. However, in the last few years, mosaic TRs composed of different units have been found to be associated with several brain disorders by long-read sequencing techniques. Mosaic TRs are difficult-to-characterize sequence configurations that are usually confirmed by manual inspection. Widely used tools are not designed to solve the mosaic TR problem and often fail to properly decompose mosaic TRs. Results: We propose an efficient algorithm that can decompose mosaic TRs in the input string with high sensitivity. Using synthetic benchmark data, we demonstrate that our program named uTR outperforms TRF and RepeatMasker in terms of prediction accuracy, this is especially true when mosaic TRs are more complex, and uTR is faster than TRF and RepeatMasker in most cases.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Haplotype-aware diplotyping from noisy long reads
    Ebler, Jana
    Haukness, Marina
    Pesout, Trevor
    Marschall, Tobias
    Paten, Benedict
    GENOME BIOLOGY, 2019, 20 (1)
  • [42] Haplotype-aware diplotyping from noisy long reads
    Jana Ebler
    Marina Haukness
    Trevor Pesout
    Tobias Marschall
    Benedict Paten
    Genome Biology, 20
  • [43] Detection and visualization of complex structural variants from long reads
    Zachary Stephens
    Chen Wang
    Ravishankar K. Iyer
    Jean-Pierre Kocher
    BMC Bioinformatics, 19
  • [44] Inferring short tandem repeat variation from paired-end short reads
    Minh Duc Cao
    Tasker, Edward
    Willadsen, Kai
    Imelfort, Michael
    Vishwanathan, Sailaja
    Sureshkumar, Sridevi
    Balasubramanian, Sureshkumar
    Boden, Mikael
    NUCLEIC ACIDS RESEARCH, 2014, 42 (03) : e16
  • [45] Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
    Liu, Zhenhua
    Zhao, Guihu
    Xiao, Yuhui
    Zeng, Sheng
    Yuan, Yanchun
    Zhou, Xun
    Fang, Zhenghuan
    He, Runcheng
    Li, Bin
    Zhao, Yuwen
    Pan, Hongxu
    Wang, Yige
    Yu, Guoliang
    Peng, I-Feng
    Wang, Depeng
    Meng, Qingtuan
    Xu, Qian
    Sun, Qiying
    Yan, Xinxiang
    Shen, Lu
    Jiang, Hong
    Xia, Kun
    Wang, Junling
    Guo, Jifeng
    Liang, Fan
    Li, Jinchen
    Tang, Beisha
    FRONTIERS IN GENETICS, 2022, 13
  • [46] Characterization of short tandem repeats from thirty-one human telomeres
    Rosenberg, M
    Hui, L
    Ma, JL
    Nusbaum, HC
    Clark, K
    Robinson, L
    Dziadzio, L
    Swain, PM
    Keith, T
    Hudson, TJ
    Biesecker, LG
    Flint, J
    GENOME RESEARCH, 1997, 7 (09): : 917 - 923
  • [47] Comprehensive analysis of tandem amino acid repeats from ten angiosperm genomes
    Yuan Zhou
    Jing Liu
    Lei Han
    Zhi-Gang Li
    Ziding Zhang
    BMC Genomics, 12
  • [48] Amplification of human short tandem repeats from medieval teeth and bone samples
    Zierdt, H
    Hummel, S
    Herrmann, B
    HUMAN BIOLOGY, 1996, 68 (02) : 185 - 199
  • [49] Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats
    Gibbs, M
    Dawson, DA
    McCamley, C
    Wardle, AF
    Armour, JAL
    Burke, T
    ANIMAL GENETICS, 1997, 28 (06) : 401 - 417
  • [50] The biological effects of simple tandem repeats: Lessons from the repeat expansion diseases
    Usdin, Karen
    GENOME RESEARCH, 2008, 18 (07) : 1011 - 1019