Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models

被引:7
|
作者
Soni, Vivak [1 ]
Johri, Parul [1 ,2 ,3 ]
Jensen, Jeffrey D. [1 ]
机构
[1] Arizona State Univ, Sch Life Sci, Tempe, AZ 85287 USA
[2] Univ N Carolina, Dept Biol, Chapel Hill, NC USA
[3] Univ N Carolina, Dept Genet, Chapel Hill, NC USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
selective sweeps; genome scans; genetic hitchhiking; background selection; distribution of fitness effects; demography; DROSOPHILA-MELANOGASTER; POSITIVE SELECTION; BACKGROUND SELECTION; MUTATION-RATE; GENETIC HITCHHIKING; LINKAGE DISEQUILIBRIUM; RECOMBINATION RATES; DEMOGRAPHIC HISTORY; NATURAL-SELECTION; GENOMIC SCANS;
D O I
10.1093/evolut/qpad120
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Outlier-based genomic scans have proven a popular approach for identifying loci that have potentially experienced recent positive selection. However, it has previously been shown that an evolutionarily appropriate baseline model that incorporates nonequilibrium population histories, purifying and background selection, and variation in mutation and recombination rates is necessary to reduce often extreme false-positive rates when performing genomic scans. Here we evaluate the power to detect recurrent selective sweeps using common SFS-based and haplotype-based methods under these increasingly realistic models. We find that while these appropriate evolutionary baselines are essential to reduce false-positive rates, the power to accurately detect recurrent selective sweeps is generally low across much of the biologically relevant parameter space. The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modeled by a realistic mutation rate and as part of a realistic distribution of fitness effects, as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modeled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false-positive rates are in excess of true-positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong.
引用
收藏
页码:2113 / 2127
页数:15
相关论文
共 8 条
  • [1] Evaluating the performance of selection scans to detect selective sweeps in domestic dogs
    Schlamp, Florencia
    van der Made, Julian
    Stambler, Rebecca
    Chesebrough, Lewis
    Boyko, Adam R.
    Messer, Philipp W.
    MOLECULAR ECOLOGY, 2016, 25 (01) : 342 - 356
  • [2] Allele frequency distribution under recurrent selective sweeps
    Kim, Y
    GENETICS, 2006, 172 (03) : 1967 - 1978
  • [3] Patterns of Neutral Diversity Under General Models of Selective Sweeps
    Coop, Graham
    Ralph, Peter
    GENETICS, 2012, 192 (01) : 205 - U438
  • [4] Soft selective sweeps: Addressing new definitions, evaluating competing models, and interpreting empirical outliers
    Johri, Parul
    Stephan, Wolfgang
    Jensen, Jeffrey D.
    PLOS GENETICS, 2022, 18 (02):
  • [5] Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models
    Yang, ZH
    Nielsen, R
    MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (01) : 32 - 43
  • [6] Evaluating the Power of Latent Growth Curve Models to Detect Individual Differences in Change
    Hertzog, Christopher
    von Oertzen, Timo
    Ghisletta, Paolo
    Lindenberger, Ulman
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2008, 15 (04) : 541 - 563
  • [7] Evaluating Latent Space Robustness and Uncertainty of EEG-ML Models under Realistic Distribution Shifts
    Wagh, Neeraj
    Wei, Jionghao
    Rawal, Samarth
    Berry, Brent
    Varatharajah, Yogatheesan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [8] Comparative Power of Family-Based Association Strategies to Detect Disease-Causing Variants Under Two-Locus Models
    Babron, Marie-Claude
    Guilloud-Bataille, Michel
    Sahbatou, Mourad
    Demenais, Florence
    Genin, Emmanuelle
    Dizier, Marie-Helene
    GENETIC EPIDEMIOLOGY, 2012, 36 (08) : 848 - 855