DNA Sequencing Technologies: Sequencing Data Protocols and Bioinformatics Tools

被引:9
|
作者
Wong, Ka-Chun [1 ]
Zhang, Jiao [1 ]
Yan, Shankai [1 ]
Li, Xiangtao [2 ]
Lin, Qiuzhen [3 ]
Kwong, Sam [1 ]
Liang, Cheng [4 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Kowloon Tong, Tat Chee Ave, Hong Kong, Peoples R China
[2] Northeast Normal Univ, Sch Comp Sci, Changchun, Jilin, Peoples R China
[3] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[4] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Shandong, Peoples R China
关键词
DNA sequencing; third-generation sequencing (TGS); history; technology; data protocols; bioinformatics; computational biology; tools; software; SOMATIC POINT MUTATIONS; SHORT-READ ALIGNMENT; QUALITY SCORES; SNP DETECTION; COPY NUMBER; STRUCTURAL VARIATION; INSERTION-DELETION; GENOME; VARIANTS; FRAMEWORK;
D O I
10.1145/3340286
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The recent advances in DNA sequencing technology, from first-generation sequencing (FGS) to third-generation sequencing (TGS), have constantly transformed the genome research landscape. Its data throughput is unprecedented and severalfold as compared with past technologies. DNA sequencing technologies generate sequencing data that are big, sparse, and heterogeneous. This results in the rapid development of various data protocols and bioinformatics tools for handling sequencing data. In this review, a historical snapshot of DNA sequencing is taken with an emphasis on data manipulation and tools. The technological history of DNA sequencing is described and reviewed in thorough detail. To manipulate the sequencing data generated, different data protocols are introduced and reviewed. In particular, data compression methods are highlighted and discussed to provide readers a practical perspective in the real-world setting. A large variety of bioinformatics tools are also reviewed to help readers extract the most from their sequencing data in different aspects, such as sequencing quality control, genomic visualization, single-nucleotide variant calling, INDEL calling, structural variation calling, and integrative analysis. Toward the end of the article, we critically discuss the existing DNA sequencing technologies for their pitfalls and potential solutions.
引用
收藏
页数:30
相关论文
共 50 条
  • [21] The Source of the Data Flood: Sequencing Technologies
    Magi, Alberto
    Pisanti, Nadia
    Tattini, Lorenzo
    ERCIM NEWS, 2016, (104): : 25 - 26
  • [22] DNA methylation data by sequencing: experimental approaches and recommendations for tools and pipelines for data analysis
    Rauluseviciute, Ieva
    Drablos, Finn
    Rye, Morten Beck
    CLINICAL EPIGENETICS, 2019, 11 (01)
  • [23] DNA methylation data by sequencing: experimental approaches and recommendations for tools and pipelines for data analysis
    Ieva Rauluseviciute
    Finn Drabløs
    Morten Beck Rye
    Clinical Epigenetics, 2019, 11
  • [24] Confirming variants discovered by Next Generation Sequencing (NGS) with Sanger sequencing using innovative bioinformatics tools
    Schreiber, E.
    Berosik, S.
    Wenz, M.
    Chang, S.
    Jackson, S.
    Zhai, J.
    Schneider, S.
    Brzoska, P.
    EUROPEAN JOURNAL OF CANCER, 2016, 61 : S19 - S19
  • [25] Sequencing technologies and genome sequencing
    Chandra Shekhar Pareek
    Rafal Smoczynski
    Andrzej Tretyn
    Journal of Applied Genetics, 2011, 52 : 413 - 435
  • [26] Sequencing technologies and genome sequencing
    Pareek, Chandra Shekhar
    Smoczynski, Rafal
    Tretyn, Andrzej
    JOURNAL OF APPLIED GENETICS, 2011, 52 (04) : 413 - 435
  • [27] DNA sequencing trumps standard screening tools
    Linda Koch
    Nature Reviews Genetics, 2014, 15 : 288 - 288
  • [28] Quest for technologies to cut DNA sequencing costs
    Chapman, CR
    GENETIC ENGINEERING NEWS, 2006, 26 (12): : 9 - 11
  • [29] Bioinformatics of nanopore sequencing
    Makalowski, Wojciech
    Shabardina, Victoria
    JOURNAL OF HUMAN GENETICS, 2020, 65 (01) : 61 - 67
  • [30] New DNA-sequencing technologies advancing
    Gibbs, RA
    GENETIC ENGINEERING NEWS, 2006, 26 (12): : 4 - 5