SEMIPARAMETRIC TRANSFORMATION MODELS WITH MULTILEVEL RANDOM EFFECTS FOR CORRELATED DISEASE ONSET IN FAMILIES

被引:1
|
作者
Liang, Baosheng [1 ]
Wang, Yuanjia [2 ]
Zeng, Donglin [3 ]
机构
[1] Peking Univ, Dept Nat Sci Med, Hlth Sci Ctr, Beijing 100871, Peoples R China
[2] Columbia Univ, Dept Biostat, New York, NY 10027 USA
[3] Univ North Carolina Chapel Hill, Dept Biostat, Chapel Hill, NC 27599 USA
基金
中国国家自然科学基金;
关键词
Alzheimer's disease; family data; multilevel random effects; nonparametric maximum-likelihood estimation; semiparametric efficiency; CARIBBEAN HISPANICS; AFRICAN-AMERICANS; FRAILTY-MODEL; BREAST; SUSCEPTIBILITY; BRCA1;
D O I
10.5705/ss.202017.0326
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large cohort studies are often used to investigate the impact of genetic variants or other risk factors on the age at onset (AAO) of a chronic disorder. These studies collect family history data, including the AAO of a disease in family members, in order to provide additional information and to improve the efficiency of estimating associations. Statistical analyses of these data are challenging owing to missing genotypes in family members and the heterogeneous dependence attributed to both their shared genetic background and shared environmental factors (e.g., lifestyle). Therefore, we propose a class of semiparametric transformation models with multilevel random effects to address these challenges. The proposed models include both the proportional-hazards model and the proportional-odds model as special cases. The multilevel random effects contain individual-specific random effects, including the kinship correlation structure dependent on the family pedigree, and a shared random effect to account for any unobserved exposure to the environment. We use a nonparametric maximum-likelihood approach for our inferences and propose an expectation-maximization algorithm for the computation in the presence of missing genotypes among family members. The obtained estimators are shown to be consistent, asymptotically normal, and semiparametrically efficient. Simulation studies demonstrate that the proposed method performs well with finite sample sizes. Finally, we apply the proposed method to examine genetic risks in an Alzheimer's disease study.
引用
收藏
页码:1851 / 1871
页数:21
相关论文
共 50 条