GADMA: Genetic algorithm for inferring demographic history of multiple populations from allele frequency spectrum data

被引:40
|
作者
Noskova, Ekaterina [1 ]
Ulyantsev, Vladimir [1 ]
Koepfli, Klaus-Peter [1 ,2 ]
O'Brien, Stephen J. [1 ,3 ]
Dobrynin, Pavel [1 ,2 ]
机构
[1] ITMO Univ, Comp Technol Lab, 49 Kronverkskiy Pr, St Petersburg 197101, Russia
[2] Natl Zool Pk, Ctr Species Survival, Smithsonian Conservat Biol Inst, 3001 Connecticut Ave NW, Washington, DC USA
[3] Nova Southeastern Univ, Guy Harvey Oceanog Ctr, 8000 North Ocean Dr, Ft Lauderdale, FL 33004 USA
来源
GIGASCIENCE | 2020年 / 9卷 / 03期
关键词
demographic inference; population genetics; genetic algorithm; allele frequency spectrum; genomics; INFERENCE; SIZE; DIVERSITY; SEQUENCE; MODELS; TIME;
D O I
10.1093/gigascience/giaa005
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g.,partial derivative a partial derivative i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. Results: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). Conclusions: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data
    Gutenkunst, Ryan N.
    Hernandez, Ryan D.
    Williamson, Scott H.
    Bustamante, Carlos D.
    [J]. PLOS GENETICS, 2009, 5 (10):
  • [2] Efficiently Inferring the Demographic History of Many Populations With Allele Count Data
    Kamm, Jack
    Terhorst, Jonathan
    Durbin, Richard
    Song, Yun S.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (531) : 1472 - 1487
  • [3] Inferring the Joint Demographic History of Multiple Populations: Beyond the Diffusion Approximation
    Jouganous, Julien
    Long, Will
    Ragsdale, Aaron P.
    Gravel, Simon
    [J]. GENETICS, 2017, 206 (03) : 1549 - 1567
  • [4] GADMA2: more efficient and flexible demographic inference from genetic data
    Noskova, Ekaterina
    Abramov, Nikita
    Iliutkin, Stanislav
    Sidorin, Anton
    Dobrynin, Pavel
    Ulyantsev, Vladimir I.
    [J]. GIGASCIENCE, 2023, 12
  • [5] Inferring patterns of dispersal from allele frequency data
    Raybould, AF
    Clarke, RT
    Bond, JM
    Welters, RE
    Gliddon, CJ
    [J]. DISPERSAL ECOLOGY, 2002, : 89 - 110
  • [6] PopHist: inferring population history from the spectrum of allele frequencies
    Wooding, S
    [J]. BIOINFORMATICS, 2003, 19 (04) : 539 - 540
  • [7] Inferring Demographic History from a Spectrum of Shared Haplotype Lengths
    Harris, Kelley
    Nielsen, Rasmus
    [J]. PLOS GENETICS, 2013, 9 (06):
  • [8] The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations
    Marth, GT
    Czabarka, E
    Murvai, J
    Sherry, ST
    [J]. GENETICS, 2004, 166 (01) : 351 - 372
  • [9] Inferring the Demographic History of Inbred Species from Genome-Wide SNP Frequency Data
    Blischak, Paul D.
    Barker, Michael S.
    Gutenkunst, Ryan N.
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2020, 37 (07) : 2124 - 2136
  • [10] Inferring population structure and demographic history using Y-STR data from worldwide populations
    Xu, Hongyang
    Wang, Chuan-Chao
    Shrestha, Rukesh
    Wang, Ling-Xiang
    Zhang, Manfei
    He, Yungang
    Kidd, Judith R.
    Kidd, Kenneth K.
    Jin, Li
    Li, Hui
    [J]. MOLECULAR GENETICS AND GENOMICS, 2015, 290 (01) : 141 - 150