Robust Demographic Inference from Genomic and SNP Data

被引:982
|
作者
Excoffier, Laurent [1 ,2 ]
Dupanloup, Isabelle [1 ,2 ]
Huerta-Sanchez, Emilia [3 ]
Sousa, Vitor C. [1 ,2 ]
Foll, Matthieu [1 ,2 ,4 ]
机构
[1] CMPG, Inst Ecol & Evolut, Bern, Switzerland
[2] Swiss Inst Bioinformat, Lausanne, Switzerland
[3] Univ Calif Berkeley, Ctr Theoret Evolutionary Genom, Dept Integrat Biol, Berkeley, CA 94720 USA
[4] Ecole Polytech Fed Lausanne, Sch Life Sci, Lausanne, Switzerland
来源
PLOS GENETICS | 2013年 / 9卷 / 10期
基金
瑞士国家科学基金会;
关键词
APPROXIMATE BAYESIAN COMPUTATION; MAXIMUM-LIKELIHOOD-ESTIMATION; ALLELE FREQUENCY-SPECTRUM; CHAIN MONTE-CARLO; UNSAMPLED POPULATIONS; REVEALS ADAPTATION; MIGRATION RATES; MODEL SELECTION; PARAMETERS; HISTORY;
D O I
10.1371/journal.pgen.1003905
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with partial derivative a partial derivative i, the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] An empirical approach to demographic inference with genomic data
    Ralph, Peter L.
    THEORETICAL POPULATION BIOLOGY, 2019, 127 : 91 - 101
  • [2] Joint inference of adaptive and demographic history from temporal population genomic data
    Pavinato, Vitor A. C.
    De Mita, Stephane
    Marin, Jean-Michel
    de Navascues, Miguel
    PEER COMMUNITY JOURNAL, 2022, 2
  • [3] Robust inference of population size histories from genomic sequencing data
    Upadhya, Gautam
    Steinrucken, Matthias
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (09)
  • [4] Evolutionary inference from genomic data
    Goldstein, DB
    Harvey, PH
    BIOESSAYS, 1999, 21 (02) : 148 - 156
  • [5] Nonparametric inference of population demography from SNP data
    Kawai, Yosuke
    Sato, Yukuto
    Yamaguchi, Yumi
    Nariai, Naoki
    Sugimoto, Sachiyo
    Mimori, Takahiro
    Kojima, Kaname
    Nagasaki, Masao
    GENES & GENETIC SYSTEMS, 2014, 89 (06) : 314 - 314
  • [6] Phenotype Inference from Text and Genomic Data
    Brbic, Maria
    Piskorec, Matija
    Vidulin, Vedrana
    Krisko, Anita
    Smuc, Tomislav
    Supek, Fran
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT III, 2017, 10536 : 373 - 377
  • [7] Repetitive genomic regions and the inference of demographic history
    Patil, Ajinkya Bharatraj
    Vijay, Nagarjun
    HEREDITY, 2021, 127 (02) : 151 - 166
  • [8] Repetitive genomic regions and the inference of demographic history
    Ajinkya Bharatraj Patil
    Nagarjun Vijay
    Heredity, 2021, 127 : 151 - 166
  • [9] Demographic Information Inference from Passively Collected Data
    Zhang, Yiran
    Ban, Xuegang
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2025, 151 (03)
  • [10] Grouping preprocess for haplotype inference from SNP and CNV data
    Shindo, Hiroyuki
    Chigira, Hiroshi
    Nagaoka, Tomoyo
    Kamatani, Naoyuki
    Inoue, Masato
    INTERNATIONAL WORKSHOP ON STATISTICAL-MECHANICAL INFORMATICS 2009 (IW-SMI 2009), 2009, 197