SNP genotyping and parameter estimation in polyploids using low-coverage sequencing data

被引:52
|
作者
Blischak, Paul D. [1 ]
Kubatko, Laura S. [1 ,2 ]
Wolfe, Andrea D. [1 ]
机构
[1] Ohio State Univ, Dept Evolut Ecol & Organismal Biol, Columbus, OH 43210 USA
[2] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
POPULATION GENETIC-STRUCTURE; MAXIMUM-LIKELIHOOD; GENOME; DISCOVERY; FRAMEWORK; LOCI; DIFFERENTIATION; FREQUENCY; DOMINANT;
D O I
10.1093/bioinformatics/btx587
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Genotyping and parameter estimation using high throughput sequencing data are everyday tasks for population geneticists, but methods developed for diploids are typically not applicable to polyploid taxa. This is due to their duplicated chromosomes, as well as the complex patterns of allelic exchange that often accompany whole genome duplication (WGD) events. For WGDs within a single lineage (autopolyploids), inbreeding can result from mixed mating and/or double reduction. For WGDs that involve hybridization (allopolyploids), alleles are typically inherited through independently segregating subgenomes. Results: We present two new models for estimating genotypes and population genetic parameters from genotype likelihoods for auto-and allopolyploids. We then use simulations to compare these models to existing approaches at varying depths of sequencing coverage and ploidy levels. These simulations show that our models typically have lower levels of estimation error for genotype and parameter estimates, especially when sequencing coverage is low. Finally, we also apply these models to two empirical datasets from the literature. Overall, we show that the use of genotype likelihoods to model non-standard inheritance patterns is a promising approach for conducting population genomic inferences in polyploids.
引用
收藏
页码:407 / 415
页数:9
相关论文
共 50 条
  • [1] SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples
    Le, Si Quang
    Durbin, Richard
    GENOME RESEARCH, 2011, 21 (06) : 952 - 960
  • [2] Comparing a few SNP calling algorithms using low-coverage sequencing data
    Xiaoqing Yu
    Shuying Sun
    BMC Bioinformatics, 14
  • [3] Comparing a few SNP calling algorithms using low-coverage sequencing data
    Yu, Xiaoqing
    Sun, Shuying
    BMC BIOINFORMATICS, 2013, 14
  • [4] Reveel: large-scale population genotyping using low-coverage sequencing data
    Huang, Lin
    Wang, Bo
    Chen, Ruitang
    Bercovici, Sivan
    Batzoglou, Serafim
    BIOINFORMATICS, 2016, 32 (11) : 1686 - 1696
  • [5] Kinship Estimation Based on Extremely Low-Coverage Sequencing Data
    Dou, Jinzhuang
    Chothani, Sonia
    Sim, Xueling
    Hughes, Jason D.
    Reilly, Dermot F.
    Tai, E. Shyong
    Liu, Jianjun
    Wang, Chaolong
    GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 619 - 620
  • [6] CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data
    Soylev, Arda
    Cokoglu, Sevim Seda
    Koptekin, Dilek
    Alkan, Can
    Somel, Mehmet
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (12)
  • [7] Comparison of Genotype Imputation for SNP Array and Low-Coverage Whole-Genome Sequencing Data
    Deng, Tianyu
    Zhang, Pengfei
    Garrick, Dorian
    Gao, Huijiang
    Wang, Lixian
    Zhao, Fuping
    FRONTIERS IN GENETICS, 2022, 12
  • [8] NanoSNP: a progressive and haplotype-aware SNP caller on low-coverage nanopore sequencing data
    Huang, Neng
    Xu, Minghua
    Nie, Fan
    Ni, Peng
    Xiao, Chuan-Le
    Luo, Feng
    Wang, Jianxin
    BIOINFORMATICS, 2023, 39 (01)
  • [9] Improved computations for relationship inference using low-coverage sequencing data
    Mostad, Petter
    Tillmar, Andreas
    Kling, Daniel
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [10] Improved computations for relationship inference using low-coverage sequencing data
    Petter Mostad
    Andreas Tillmar
    Daniel Kling
    BMC Bioinformatics, 24