Next-Generation Sequencing Data Analysis on Pool-Seq and Low-Coverage Retinoblastoma Data

被引:1
|
作者
Ozdemir Ozdogan, Gulistan [1 ]
Kaya, Hilal [1 ]
机构
[1] Ankara Yildirim Beyazit Univ, Dept Comp Engn, TR-06010 Ankara, Turkey
关键词
Low-coverage sequencing; NGS data analysis; Pool-seq; Retinoblastoma; READ ALIGNMENT; VARIANTS;
D O I
10.1007/s12539-020-00374-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Next-generation sequencing (NGS) is related to massively parallel or deep deoxyribonucleic acid (DNA) sequencing technology which has revolutionized genomic researches in recent years. Although the cost of generating NGS data was decreased compared to the one at the time of emerging this technology, its cost might still be somewhat a problem. Hence, new strategies as pool-seq and low-coverage NGS data have been developed to overcome the cost problem. Despite decreasing cost, it is important to elucidate whether they are efficient in NGS studies. We applied a bioinformatics pipeline on pool-seq and low-coverage retinoblastoma data retrieved from only tumor data. Retinoblastoma is an eye malignancy in childhood that is initiated by RB1 mutation or MYCN amplification and can lead to the loss of vision of eye(s), and even sometimes life. We applied our pipeline on both retinoblastoma disease data and two other particular data to testify the validity and also for comparison purposes in the aspect of performance. High-confidence variant calls from Genome in a Bottle Consortium were used for fulfilling these purposes. We observed that our pipeline successfully called higher number of variants than a standard pipeline for all these three different data. Besides, the recall and F-score values were quite better in our pipeline as being noteworthy. We further presented our results on disease data in the aspects of the variants, variant types and disease-related genes. This study provides a guideline for performing NGS data analysis pipeline on pool-seq and low-coverage sequencing data in conjunction. To get more conclusive outcomes of these two strategies, we recommend using cancer data having higher mutation rates and larger pools.
引用
收藏
页码:302 / 310
页数:9
相关论文
共 50 条
  • [1] Next-Generation Sequencing Data Analysis on Pool-Seq and Low-Coverage Retinoblastoma Data
    Gülistan Özdemir Özdoğan
    Hilal Kaya
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2020, 12 : 302 - 310
  • [2] Estimating optimal window size for analysis of low-coverage next-generation sequence data
    Gusnanto, Arief
    Taylor, Charles C.
    Nafisah, Ibrahim
    Wood, Henry M.
    Rabbitts, Pamela
    Berri, Stefano
    [J]. BIOINFORMATICS, 2014, 30 (13) : 1823 - 1829
  • [3] Detection of chromosomal abnormalities in spontaneous miscarriage by low-coverage next-generation sequencing
    Li, Fen-Xia
    Xie, Mei-Juan
    Qu, Shou-Fang
    He, Dan
    Wu, Long
    Liang, Zhi-Kun
    Wu, Ying-Song
    Yang, Fang
    Yang, Xue-Xi
    [J]. MOLECULAR MEDICINE REPORTS, 2020, 22 (02) : 1269 - 1276
  • [4] Robust Linear Trend Test for Low-Coverage Next-Generation Sequence Data Controlling for Covariates
    Lee, Jung Yeon
    Kim, Myeong-Kyu
    Kim, Wonkuk
    [J]. MATHEMATICS, 2020, 8 (02)
  • [6] Pathway analysis with next-generation sequencing data
    Jinying Zhao
    Yun Zhu
    Eric Boerwinkle
    Momiao Xiong
    [J]. European Journal of Human Genetics, 2015, 23 : 507 - 515
  • [7] Focus on next-generation sequencing data analysis
    Rusk N.
    [J]. Nature Methods, 2009, 6 (Suppl 11) : S1 - S1
  • [8] Applications and data analysis of next-generation sequencing
    Vogl, Ina
    Benet-Pages, Anna
    Eck, Sebastian H.
    Kuhn, Marius
    Vosberg, Sebastian
    Greif, Philipp A.
    Metzeler, Klaus H.
    Biskup, Saskia
    Mueller-Reible, Clemens
    Klein, Hanns-Georg
    [J]. LABORATORIUMSMEDIZIN-JOURNAL OF LABORATORY MEDICINE, 2013, 37 (06): : 305 - 315
  • [9] Pathway analysis with next-generation sequencing data
    Zhao, Jinying
    Zhu, Yun
    Boerwinkle, Eric
    Xiong, Momiao
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2015, 23 (04) : 507 - 515
  • [10] Novel Methods to Optimize Genotypic Imputation for Low-Coverage, Next-Generation Sequence Data in Crop Plants
    Swarts, Kelly
    Li, Huihui
    Navarro, J. Alberto Romero
    An, Dong
    Romay, Maria Cinta
    Hearne, Sarah
    Acharya, Charlotte
    Glaubitz, Jeffrey C.
    Mitchell, Sharon
    Elshire, Robert J.
    Buckler, Edward S.
    Bradbury, Peter J.
    [J]. PLANT GENOME, 2014, 7 (03):