A practical guide to environmental association analysis in landscape genomics

被引:513
|
作者
Rellstab, Christian [1 ]
Gugerli, Felix [1 ]
Eckert, Andrew J. [2 ]
Hancock, Angela M. [3 ,4 ]
Holderegger, Rolf [1 ,5 ]
机构
[1] WSL Swiss Fed Res Inst, CH-8903 Birmensdorf, Switzerland
[2] Virginia Commonwealth Univ, Dept Biol, Richmond, VA 23284 USA
[3] Max F Perutz Labs, Fac Mol Biol, A-1090 Vienna, Austria
[4] Univ Vienna, A-1090 Vienna, Austria
[5] ETH, Inst Integrat Biol, CH-8092 Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
adaptive genetic variation; ecological association; environmental correlation analysis; genetic-environment association; genotype-environment correlation; local adaptation; natural selection; neutral genetic structure; population genomics; ADAPTIVE GENETIC-VARIATION; PINE PINUS-TAEDA; LOCAL ADAPTATION; NEXT-GENERATION; SPATIAL-ANALYSIS; ARABIDOPSIS-THALIANA; POPULATION-STRUCTURE; ECOLOGICAL GENOMICS; DETECTING SELECTION; ALLELE FREQUENCIES;
D O I
10.1111/mec.13322
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Landscape genomics is an emerging research field that aims to identify the environmental factors that shape adaptive genetic variation and the gene variants that drive local adaptation. Its development has been facilitated by next-generation sequencing, which allows for screening thousands to millions of single nucleotide polymorphisms in many individuals and populations at reasonable costs. In parallel, data sets describing environmental factors have greatly improved and increasingly become publicly accessible. Accordingly, numerous analytical methods for environmental association studies have been developed. Environmental association analysis identifies genetic variants associated with particular environmental factors and has the potential to uncover adaptive patterns that are not discovered by traditional tests for the detection of outlier loci based on population genetic differentiation. We review methods for conducting environmental association analysis including categorical tests, logistic regressions, matrix correlations, general linear models and mixed effects models. We discuss the advantages and disadvantages of different approaches, provide a list of dedicated software packages and their specific properties, and stress the importance of incorporating neutral genetic structure in the analysis. We also touch on additional important aspects such as sampling design, environmental data preparation, pooled and reduced-representation sequencing, candidate-gene approaches, linearity of allele-environment associations and the combination of environmental association analyses with traditional outlier detection tests. We conclude by summarizing expected future directions in the field, such as the extension of statistical approaches, environmental association analysis for ecological gene annotation, and the need for replication and post hoc validation studies.
引用
收藏
页码:4348 / 4370
页数:23
相关论文
共 50 条