Review of alignment and SNP calling algorithms for next-generation sequencing data

被引:0
|
作者
M. Mielczarek
J. Szyda
机构
[1] Wroclaw University of Environmental and Life Sciences,Biostatistics Group, Department of Genetics
来源
关键词
Alignment; Genotype calling; NGS; SNP calling; Review; Software;
D O I
暂无
中图分类号
学科分类号
摘要
Application of the massive parallel sequencing technology has become one of the most important issues in life sciences. Therefore, it was crucial to develop bioinformatics tools for next-generation sequencing (NGS) data processing. Currently, two of the most significant tasks include alignment to a reference genome and detection of single nucleotide polymorphisms (SNPs). In many types of genomic analyses, great numbers of reads need to be mapped to the reference genome; therefore, selection of the aligner is an essential step in NGS pipelines. Two main algorithms—suffix tries and hash tables—have been introduced for this purpose. Suffix array-based aligners are memory-efficient and work faster than hash-based aligners, but they are less accurate. In contrast, hash table algorithms tend to be slower, but more sensitive. SNP and genotype callers may also be divided into two main different approaches: heuristic and probabilistic methods. A variety of software has been subsequently developed over the past several years. In this paper, we briefly review the current development of NGS data processing algorithms and present the available software.
引用
收藏
页码:71 / 79
页数:8
相关论文
共 50 条
  • [41] Identification of indels in next-generation sequencing data
    Ratan, Aakrosh
    Olson, Thomas L.
    Loughran, Thomas P., Jr.
    Miller, Webb
    [J]. BMC BIOINFORMATICS, 2015, 16
  • [42] Visualizing next-generation sequencing data with JBrowse
    Westesson, Oscar
    Skinner, Mitchell
    Holmes, Ian
    [J]. BRIEFINGS IN BIOINFORMATICS, 2013, 14 (02) : 172 - 177
  • [43] Focus on next-generation sequencing data analysis
    Rusk N.
    [J]. Nature Methods, 2009, 6 (Suppl 11) : S1 - S1
  • [44] Next-generation sequencing: adjusting to data overload
    Baker, Monya
    [J]. NATURE METHODS, 2010, 7 (07) : 495 - 499
  • [45] Next-generation sequencing: adjusting to data overload
    Monya Baker
    [J]. Nature Methods, 2010, 7 : 495 - 499
  • [46] Next-generation sequencing and the evolution of data sharing
    de Macena Sobreira, Nara Lygia
    Hamosh, Ada
    [J]. AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2021, 185 (09) : 2633 - 2635
  • [47] Applications and data analysis of next-generation sequencing
    Vogl, Ina
    Benet-Pages, Anna
    Eck, Sebastian H.
    Kuhn, Marius
    Vosberg, Sebastian
    Greif, Philipp A.
    Metzeler, Klaus H.
    Biskup, Saskia
    Mueller-Reible, Clemens
    Klein, Hanns-Georg
    [J]. LABORATORIUMSMEDIZIN-JOURNAL OF LABORATORY MEDICINE, 2013, 37 (06): : 305 - 315
  • [48] Identification of indels in next-generation sequencing data
    Aakrosh Ratan
    Thomas L Olson
    Thomas P Loughran
    Webb Miller
    [J]. BMC Bioinformatics, 16
  • [49] Pathway analysis with next-generation sequencing data
    Zhao, Jinying
    Zhu, Yun
    Boerwinkle, Eric
    Xiong, Momiao
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2015, 23 (04) : 507 - 515
  • [50] Genotyping microsatellites in next-generation sequencing data
    Harriet Dashnow
    Susan Tan
    Debjani Das
    Simon Easteal
    Alicia Oshlack
    [J]. BMC Bioinformatics, 16