Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure

被引:29
|
作者
Orozco-terWengel, Pablo [1 ]
Corander, Jukka [2 ]
Schloetterer, Christian [1 ]
机构
[1] Vetmeduni Vienna, Inst Populat Genet, A-1210 Vienna, Austria
[2] Univ Helsinki, Dept Math & Stat, FIN-00014 Helsinki, Finland
基金
奥地利科学基金会;
关键词
confidence of inference; Drosophila melanogaster; microsatellites; population structure; DROSOPHILA-MELANOGASTER POPULATIONS; GENOME-WIDE ASSOCIATION; MICROSATELLITE DATA; PHYLOGENETIC TREES; MITOCHONDRIAL-DNA; GENETIC-STRUCTURE; F-ST; SELECTION; FREQUENCY; AFRICAN;
D O I
10.1111/j.1365-294X.2010.04990.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Over the past decades, the use of molecular markers has revolutionized biology and led to the foundation of a new research discipline-phylogeography. Of particular interest has been the inference of population structure and biogeography. While initial studies focused on mtDNA as a molecular marker, it has become apparent that selection and genealogical lineage sorting could lead to erroneous inferences. As it is not clear to what extent these forces affect a given marker, it has become common practice to use the combined evidence from a set of molecular markers as an attempt to recover the signals that approximate the true underlying demography. Typically, the number of markers used is determined by either budget constraints or by statistical power required to recognize significant population differentiation. Using microsatellite markers from Drosophila and humans, we show that even large numbers of loci (> 50) can frequently result in statistically well-supported, but incorrect inference of population structure using the software baps. Most importantly, genomic features, such as chromosomal location, variability of the markers, or recombination rate, cannot explain this observation. Instead, it can be attributed to sampling variation among loci with different realizations of the stochastic lineage sorting. This phenomenon is particularly pronounced for low levels of population differentiation. Our results have important implications for ongoing studies of population differentiation, as we unambiguously demonstrate that statistical significance of population structure inferred from a random set of genetic markers cannot necessarily be taken as evidence for a reliable demographic inference.
引用
收藏
页码:1108 / 1121
页数:14
相关论文
共 22 条
  • [1] Inference of population structure using multilocus genotype data
    Pritchard, JK
    Stephens, M
    Donnelly, P
    [J]. GENETICS, 2000, 155 (02) : 945 - 959
  • [2] Structurama: Bayesian Inference of Population Structure
    Huelsenbeck, John P.
    Andolfatto, Peter
    Huelsenbeck, Edna T.
    [J]. EVOLUTIONARY BIOINFORMATICS, 2011, 7 : 55 - 59
  • [3] A non-parametric approach to population structure inference using multilocus genotypes
    Liu N.
    Zhao H.
    [J]. Human Genomics, 2 (6) : 353 - 364
  • [4] The impact of ancestral population size and incomplete lineage sorting on Bayesian estimation of species divergence times
    Angelis, Konstantinos
    Dos Reis, Mario
    [J]. CURRENT ZOOLOGY, 2015, 61 (05) : 874 - 885
  • [5] Spectrum: joint bayesian inference of population structure and recombination events
    Sohn, Kyung-Ah
    Xing, Eric P.
    [J]. BIOINFORMATICS, 2007, 23 (13) : I479 - I489
  • [6] Inference of hazel grouse population structure using multilocus data: a landscape genetic approach
    J Sahlsten
    H Thörngren
    J Höglund
    [J]. Heredity, 2008, 101 : 475 - 482
  • [7] Inference of hazel grouse population structure using multilocus data: a landscape genetic approach
    Sahlsten, J.
    Thorngren, H.
    Hoglund, J.
    [J]. HEREDITY, 2008, 101 (06) : 475 - 482
  • [8] Inference of population structure using multilocus genotype data: dominant markers and null alleles
    Falush, Daniel
    Stephens, Matthew
    Pritchard, Jonathan K.
    [J]. MOLECULAR ECOLOGY NOTES, 2007, 7 (04): : 574 - 578
  • [9] Bayesian statistical treatment of the fluorescence of AFLP bands leads to accurate genetic structure inference
    Gaggiotti, Oscar E.
    [J]. MOLECULAR ECOLOGY, 2010, 19 (21) : 4586 - 4588
  • [10] Inference of population structure using multilocus genotype data: Linked loci and correlated allele frequencies
    Falush, D
    Stephens, M
    Pritchard, JK
    [J]. GENETICS, 2003, 164 (04) : 1567 - 1587