Robust Demographic Inference from Genomic and SNP Data

被引:982
|
作者
Excoffier, Laurent [1 ,2 ]
Dupanloup, Isabelle [1 ,2 ]
Huerta-Sanchez, Emilia [3 ]
Sousa, Vitor C. [1 ,2 ]
Foll, Matthieu [1 ,2 ,4 ]
机构
[1] CMPG, Inst Ecol & Evolut, Bern, Switzerland
[2] Swiss Inst Bioinformat, Lausanne, Switzerland
[3] Univ Calif Berkeley, Ctr Theoret Evolutionary Genom, Dept Integrat Biol, Berkeley, CA 94720 USA
[4] Ecole Polytech Fed Lausanne, Sch Life Sci, Lausanne, Switzerland
来源
PLOS GENETICS | 2013年 / 9卷 / 10期
基金
瑞士国家科学基金会;
关键词
APPROXIMATE BAYESIAN COMPUTATION; MAXIMUM-LIKELIHOOD-ESTIMATION; ALLELE FREQUENCY-SPECTRUM; CHAIN MONTE-CARLO; UNSAMPLED POPULATIONS; REVEALS ADAPTATION; MIGRATION RATES; MODEL SELECTION; PARAMETERS; HISTORY;
D O I
10.1371/journal.pgen.1003905
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with partial derivative a partial derivative i, the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Inferring the Demographic History of Inbred Species from Genome-Wide SNP Frequency Data
    Blischak, Paul D.
    Barker, Michael S.
    Gutenkunst, Ryan N.
    MOLECULAR BIOLOGY AND EVOLUTION, 2020, 37 (07) : 2124 - 2136
  • [42] Supervised enzyme network inference from the integration of genomic data and chemical information
    Yamanishi, Y
    Vert, JP
    Kanehisa, M
    BIOINFORMATICS, 2005, 21 : I468 - I477
  • [43] Addressing the Threats of Inference Attacks on Traits and Genotypes from Individual Genomic Data
    He, Zaobo
    Li, Yingshu
    Li, Ji
    Yu, Jiguo
    Gao, Hong
    Wang, Jinbao
    BIOINFORMATICS RESEARCH AND APPLICATIONS (ISBRA 2017), 2017, 10330 : 223 - 233
  • [44] A Robust Method for the Interpretation of Genomic Data
    Hind, Jade
    Hussain, Abir
    Al-jumeily, Dhiya
    Abdulaimma, Basma
    Montanez, Casimiro Aday Curbelo
    Lisboa, Paulo
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3385 - 3390
  • [45] GADMA2: more efficient and flexible demographic inference from genetic data
    Noskova, Ekaterina
    Abramov, Nikita
    Iliutkin, Stanislav
    Sidorin, Anton
    Dobrynin, Pavel
    Ulyantsev, Vladimir I.
    GIGASCIENCE, 2023, 12
  • [46] GADMA2: more efficient and flexible demographic inference from genetic data
    Noskova, Ekaterina
    Abramov, Nikita
    Iliutkin, Stanislav
    Sidorin, Anton
    Dobrynin, Pavel
    Ulyantsev, Vladimir, I
    GIGASCIENCE, 2023, 12
  • [47] Inference of Gorilla Demographic and Selective History from Whole-Genome Sequence Data
    McManus, Kimberly F.
    Kelley, Joanna L.
    Song, Shiya
    Veeramah, Krishna R.
    Woerner, August E.
    Stevison, Laurie S.
    Ryder, Oliver A.
    Kidd, Jeffrey M.
    Wall, Jeffrey D.
    Bustamante, Carlos D.
    Hammer, Michael F.
    MOLECULAR BIOLOGY AND EVOLUTION, 2015, 32 (03) : 600 - 612
  • [48] Author Correction: Stairway Plot 2: demographic history inference with folded SNP frequency spectra
    Xiaoming Liu
    Yun-Xin Fu
    Genome Biology, 21
  • [49] Demographic inference
    Marchi, Nina
    Schlichta, Flavia
    Excoffi, Laurent
    CURRENT BIOLOGY, 2021, 31 (06) : R276 - R279
  • [50] Robust empirical likelihood inference for longitudinal data
    Qin, Guoyou
    Bai, Yang
    Zhu, Zhongyi
    STATISTICS & PROBABILITY LETTERS, 2009, 79 (20) : 2101 - 2108