Estimating variance components in population scale family trees

被引:8
|
作者
Shor, Tal [1 ,2 ]
Kalka, Iris [3 ,4 ]
Geiger, Dan [1 ]
Erlich, Yaniv [2 ,5 ,6 ]
Weissbrod, Omer [1 ,7 ]
机构
[1] Technion Israel Inst Technol, Comp Sci Dept, Haifa, Israel
[2] MyHeritage Ltd, Or Yehuda, Israel
[3] Weizmann Inst Sci, Dept Comp Sci & Appl Math, Rehovot, Israel
[4] Weizmann Inst Sci, Dept Mol Cell Biol, Rehovot, Israel
[5] New York Genome Ctr, New York, NY USA
[6] Columbia Univ, Fu Sch Engn, Dept Comp Sci, New York, NY USA
[7] Harvard TH Chan Sch Publ Hlth, Dept Epidemiol, Boston, MA 02115 USA
来源
PLOS GENETICS | 2019年 / 15卷 / 05期
关键词
AVERAGE-INFORMATION REML; EFFICIENT ALGORITHM; GENOMIC SELECTION; FULL PEDIGREE; SINGLE-STEP; HERITABILITY; PREDICTION; GENETICS; DISEASES; TRAITS;
D O I
10.1371/journal.pgen.1008124
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The rapid digitization of genealogical and medical records enables the assembly of extremely large pedigree records spanning millions of individuals and trillions of pairs of relatives. Such pedigrees provide the opportunity to investigate the sociological and epidemiological history of human populations in scales much larger than previously possible. Linear mixed models (LMMs) are routinely used to analyze extremely large animal and plant pedigrees for the purposes of selective breeding. However, LMMs have not been previously applied to analyze population-scale human family trees. Here, we present Sparse Cholesky factorIzation LMM (Sci-LMM), a modeling framework for studying population-scale family trees that combines techniques from the animal and plant breeding literature and from human genetics literature. The proposed framework can construct a matrix of relationships between trillions of pairs of individuals and fit the corresponding LMM in several hours. We demonstrate the capabilities of Sci-LMM via simulation studies and by estimating the heritability of longevity and of reproductive fitness (quantified via number of children) in a large pedigree spanning millions of individuals and over five centuries of human history. Sci-LMM provides a unified framework for investigating the epidemiological history of human populations via genealogical records.
引用
收藏
页数:22
相关论文
共 50 条