RF: A method for filtering short reads with tandem repeats for genome mapping

被引:7
|
作者
Misawa, Kazuharu [1 ]
机构
[1] RIKEN, Res Program Computat Sci, Res & Dev Grp Next Generat Integrated Living Matt, Fus Data & Anal Res & Dev Team, Yokohama, Kanagawa 2300045, Japan
关键词
Tandem repeats; Human genome; Mapping; Next-generation sequencing; REPETITIVE DNA; ALIGNMENT; PARAMETERS; ELEMENTS;
D O I
10.1016/j.ygeno.2013.03.002
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Next-generation sequencing platforms generate short (50-150 bp) reads that can be mapped onto the reference genome. Repetitive sequences in the genome, because of the presence of similar or identical sequences, cause mapping errors in the case of the short reads. By filtering short reads with repeats, mapping will be improved. I developed RF. RF is a new method that filters short reads with tandem repeats. A scoring scheme was developed that assigned higher scores to regions with tandem repeats and lower scores to regions without tandem repeats. In this study, IF was applied to filter out short reads with repeats, before short reads were mapped onto the same genomic contig by using a short read-mapping program. The result suggests RF improved the proportion of correctly mapped short reads on filtering the repeats. RF is a useful tool for reducing mapping errors of short reads onto reference genomes. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:35 / 37
页数:3
相关论文
共 50 条
  • [21] Decomposing mosaic tandem repeats accurately from long reads
    Masutani, Bansho
    Kawahara, Riki
    Morishita, Shinichi
    BIOINFORMATICS, 2023, 39 (04)
  • [22] A reference haplotype panel for genome-wide imputation of short tandem repeats
    Shubham Saini
    Ileena Mitra
    Nima Mousavi
    Stephanie Feupe Fotsing
    Melissa Gymrek
    Nature Communications, 9
  • [23] A reference haplotype panel for genome-wide imputation of short tandem repeats
    Saini, Shubham
    Mitra, Ileena
    Mousavi, Nima
    Fotsing, Stephanie Feupe
    Gymrek, Melissa
    NATURE COMMUNICATIONS, 2018, 9
  • [24] CoLoRMap: Correcting Long Reads by Mapping short reads
    Haghshenas, Ehsan
    Hach, Faraz
    Sahinalp, S. Cenk
    Chauve, Cedric
    BIOINFORMATICS, 2016, 32 (17) : 545 - 551
  • [25] Short tandem repeats of human genome are intrinsically unstable in cultured cells in vivo
    Liu, Yuzhe
    Li, Jinhuan
    Wu, Qiang
    GENE, 2023, 877
  • [26] Probably Correct: Rescuing Repeats with Short and Long Reads
    Cechova, Monika
    GENES, 2021, 12 (01) : 1 - 13
  • [27] Distribution of tandem repeats in human genome
    Fridman, M.
    Kulakovskiy, I.
    Lvovs, D.
    Oparina, N.
    Makeev, V.
    FEBS JOURNAL, 2013, 280 : 20 - 20
  • [28] An enrichment method for mapping ambiguous reads to the reference genome for NGS analysis
    Liu, Yuan
    Ma, Yongchao
    Salsman, Evan
    Manthey, Frank A.
    Elias, Elias M.
    Li, Xuehui
    Yan, Changhui
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2019, 17 (06)
  • [29] G-MAPSEQ - A NEW METHOD FOR MAPPING READS TO A REFERENCE GENOME
    Wojciechowski, Pawel
    Frohmberg, Wojciech
    Kierzynka, Michal
    Zurkowski, Piotr
    Blazewicz, Jacek
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2016, 41 (02) : 123 - 142
  • [30] STR PRIMER EXTENSION - AN EFFICIENT METHOD FOR ISOLATING SHORT TANDEM REPEATS
    YANG, H
    UDAR, N
    DANDEKAR, S
    LIANG, T
    UHRHAMMER, N
    SAMARA, GJ
    CHIPLUNKAR, S
    CHEN, X
    HUO, Y
    PATEL, N
    DORIAN, A
    GATTI, RA
    AMERICAN JOURNAL OF HUMAN GENETICS, 1993, 53 (03) : 1111 - 1111