Estimating linkage disequilibrium and selection from allele frequency trajectories

被引:4
|
作者
Li, Yunxiao [1 ]
Barton, John P. [1 ,2 ,3 ]
机构
[1] Univ Calif Riverside, Dept Phys & Astron, Riverside, CA 92521 USA
[2] Univ Pittsburgh, Sch Med, Dept Computat & Syst Biol, Pittsburgh, PA 15260 USA
[3] Univ Pittsburgh, Sch Med, Dept Computat & Syst Biol, 830 Murdoch Bldg,3420 Forbes Ave, Pittsburgh, PA 15260 USA
关键词
statistical inference; selection coefficients; genetic linkage; short-read data; allele frequency time series; covariance estimation; FITNESS MODEL; EVOLUTION; ADAPTATION; MUTATIONS; RECONSTRUCTION; CONSEQUENCES; CONSTRAINTS; ESCAPE;
D O I
10.1093/genetics/iyac189
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genetic sequences collected over time provide an exciting opportunity to study natural selection. In such studies, it is important to account for linkage disequilibrium to accurately measure selection and to distinguish between selection and other effects that can cause changes in allele frequencies, such as genetic hitchhiking or clonal interference. However, most high-throughput sequencing methods cannot directly measure linkage due to short-read lengths. Here we develop a simple method to estimate linkage disequilibrium from time-series allele frequencies. This reconstructed linkage information can then be combined with other inference methods to infer the fitness effects of individual mutations. Simulations show that our approach reliably outperforms inference that ignores linkage disequilibrium and, with sufficient sampling, performs similarly to inference using the true linkage information. We also introduce two regularization methods derived from random matrix theory that help to preserve its performance under limited sampling effects. Overall, our method enables the use of linkage-aware inference methods even for data sets where only allele frequency time series are available.
引用
收藏
页数:12
相关论文
共 50 条