Probabilistic models of genetic variation in structured populations applied to global human studies

被引:21
|
作者
Hao, Wei [1 ]
Song, Minsun [1 ]
Storey, John D. [1 ,2 ]
机构
[1] Princeton Univ, Lewis Sigler Inst Integrat Genom, Princeton, NJ 08544 USA
[2] Princeton Univ, Ctr Stat & Machine Learning, Princeton, NJ 08544 USA
关键词
TRANSCRIPTION FACTOR; SYNTHETIC MAPS; EXPRESSION; CANDIDATE; INFERENCE; CANCER; FOXP1;
D O I
10.1093/bioinformatics/btv641
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Modern population genetics studies typically involve genome-wide genotyping of individuals from a diverse network of ancestries. An important problem is how to formulate and estimate probabilistic models of observed genotypes that account for complex population structure. The most prominent work on this problem has focused on estimating a model of admixture proportions of ancestral populations for each individual. Here, we instead focus on modeling variation of the genotypes without requiring a higher-level admixture interpretation. Results: We formulate two general probabilistic models, and we propose computationally efficient algorithms to estimate them. First, we show how principal component analysis can be utilized to estimate a general model that includes the well-known Pritchard-Stephens-Donnelly admixture model as a special case. Noting some drawbacks of this approach, we introduce a new 'logistic factor analysis' framework that seeks to directly model the logit transformation of probabilities underlying observed genotypes in terms of latent variables that capture population structure. We demonstrate these advances on data from the Human Genome Diversity Panel and 1000 Genomes Project, where we are able to identify SNPs that are highly differentiated with respect to structure while making minimal modeling assumptions.
引用
收藏
页码:713 / 721
页数:9
相关论文
共 50 条
  • [21] Genetic predisposition for essential hypertension, based on studies of genetic polymorphisms in modern global human populations: The perspective of evolutionary biology
    Bicho, Manuel
    REVISTA PORTUGUESA DE CARDIOLOGIA, 2018, 37 (06) : 509 - 510
  • [22] MAINTENANCE OF POLYGENIC VARIATION IN SPATIALLY STRUCTURED POPULATIONS - ROLES FOR LOCAL MATING AND GENETIC REDUNDANCY
    GOLDSTEIN, DB
    HOLSINGER, KE
    EVOLUTION, 1992, 46 (02) : 412 - 429
  • [23] Integration of Global Resources for Human Genetic Variation and Disease
    Schofield, Paul N.
    Hancock, John M.
    HUMAN MUTATION, 2012, 33 (05) : 813 - 816
  • [24] Study of genetic variation in three human populations in Piedmont (Italy)
    Selvaggi, A.
    Santovito, A.
    Cervella, P.
    DelPero, M.
    Borghese, F.
    Sella, G.
    JOURNAL OF BIOLOGICAL RESEARCH-BOLLETTINO DELLA SOCIETA ITALIANA DI BIOLOGIA SPERIMENTALE, 2009, 82 (01): : 76 - 79
  • [25] Genetic variation at twentythree microsatellite loci in sixteen human populations
    Deka, R
    Shriver, MD
    Yu, LM
    Heidreich, EM
    Jin, L
    Zhong, YX
    McGarvey, ST
    Agarwal, SS
    Bunker, CH
    Miki, T
    Hundrieser, J
    Yin, SJ
    Raskin, S
    Barrantes, R
    Ferrell, RE
    Chakraborty, R
    JOURNAL OF GENETICS, 1999, 78 (02) : 99 - 121
  • [27] Integrating common and rare genetic variation in diverse human populations
    Altshuler, David M.
    Gibbs, Richard A.
    Peltonen, Leena
    Dermitzakis, Emmanouil
    Schaffner, Stephen F.
    Yu, Fuli
    Bonnen, Penelope E.
    de Bakker, Paul I. W.
    Deloukas, Panos
    Gabriel, Stacey B.
    Gwilliam, Rhian
    Hunt, Sarah
    Inouye, Michael
    Jia, Xiaoming
    Palotie, Aarno
    Parkin, Melissa
    Whittaker, Pamela
    Chang, Kyle
    Hawes, Alicia
    Lewis, Lora R.
    Ren, Yanru
    Wheeler, David
    Muzny, Donna Marie
    Barnes, Chris
    Darvishi, Katayoon
    Hurles, Matthew
    Korn, Joshua M.
    Kristiansson, Kati
    Lee, Charles
    McCarroll, Steven A.
    Nemesh, James
    Keinan, Alon
    Montgomery, Stephen B.
    Pollack, Samuela
    Price, Alkes L.
    Soranzo, Nicole
    Gonzaga-Jauregui, Claudia
    Anttila, Verneri
    Brodeur, Wendy
    Daly, Mark J.
    Leslie, Stephen
    McVean, Gil
    Moutsianas, Loukas
    Nguyen, Huy
    Zhang, Qingrun
    Ghori, Mohammed J. R.
    McGinnis, Ralph
    McLaren, William
    Takeuchi, Fumihiko
    Grossman, Sharon R.
    NATURE, 2010, 467 (7311) : 52 - 58
  • [28] Genetic variation at twentythree microsatellite loci in sixteen human populations
    Ranjan Deka
    Mark D. Shriver
    Ling Mei Yu
    Elisa Mueller Heidreich
    Li Jin
    Yixi Zhong
    Stephen T. Mcgarvey
    Shyam Swarup Agarwal
    Clareann H. Bunker
    Tetsuro Miki
    Joachim Hundrieser
    Shih-Jiun Yin
    Salmo Raskin
    Ramiro Barrantes
    Robert E. Ferrell
    Ranajit Chakraborty
    Journal of Genetics, 1999, 78 : 99 - 121
  • [29] Genetic variation at fifteen microsatellite loci in human populations of India
    Kashyap, VK
    Sarkar, N
    Sahoo, S
    Sarkar, BN
    Trivedi, R
    CURRENT SCIENCE, 2003, 85 (04): : 464 - 473
  • [30] MECHANISMS OF MAINTENANCE OF GENETIC-VARIATION IN HUMAN-POPULATIONS
    ROTHHAMMER, F
    ARCHIVOS DE BIOLOGIA Y MEDICINA EXPERIMENTALES, 1985, 18 (02): : R94 - R95