Rare variant association tests for ancestry-matched case-control data based on conditional logistic regression

被引:4
|
作者
Cheng, Shanshan [1 ]
Lyu, Jingjing [2 ]
Shi, Xian [2 ]
Wang, Kai [2 ]
Wang, Zengmiao [3 ]
Deng, Minghua [4 ,5 ]
Sun, Baoluo [6 ]
Wang, Chaolong [1 ]
机构
[1] Huazhong Univ Sci & Technol, Minist Educ, Key Lab Environm & Hlth, Sch Publ Hlth,Tongji Med Coll, 13 Hangkong Rd, Wuhan 430030, Peoples R China
[2] Huazhong Univ Sci & Technol, Tongji Med Coll, Sch Publ Hlth, Wuhan, Peoples R China
[3] Beijing Normal Univ, State Key Lab Remote Sensing Sci, Ctr Global Change & Publ Hlth, Coll Global Change & Earth Syst Sci, Beijing, Peoples R China
[4] Peking Univ, Ctr Quantitat Biol, Sch Math Sci, Beijing, Peoples R China
[5] Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
[6] Natl Univ Singapore, Dept Stat & Data Sci, 6 Sci Dr 2, Singapore 117546, Singapore
关键词
rare variant association tests; common controls; population stratification; matched analysis; conditional logistic regression; GENOME-WIDE ASSOCIATION; SEQUENCING IDENTIFIES RARE; POPULATION STRATIFICATION; COMMON VARIANTS; MODEL; DISEASES; DESIGNS; SAMPLES; RISK;
D O I
10.1093/bib/bbab572
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the increasing volume of human sequencing data available, analysis incorporating external controls becomes a popular and cost-effective approach to boost statistical power in disease association studies. To prevent spurious association due to population stratification, it is important to match the ancestry backgrounds of cases and controls. However, rare variant association tests based on a standard logistic regression model are conservative when all ancestry-matched strata have the same case-control ratio and might become anti-conservative when case-control ratio varies across strata. Under the conditional logistic regression (CLR) model, we propose a weighted burden test (CLR-Burden), a variance component test (CLR-SKAT) and a hybrid test (CLR-MiST). We show that the CLR model coupled with ancestry matching is a general approach to control for population stratification, regardless of the spatial distribution of disease risks. Through extensive simulation studies, we demonstrate that the CLR-based tests robustly control type 1 errors under different matching schemes and are more powerful than the standard Burden, SKAT and MiST tests. Furthermore, because CLR-based tests allow for different case-control ratios across strata, a full-matching scheme can be employed to efficiently utilize all available cases and controls to accelerate the discovery of disease associated genes.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Gene-based Rare Variant Association Tests for Ancestry-matched Case-control Data
    Wang, Chaolong
    Sun, Baoluo
    Cheng, Shanshan
    Wang, Zengmiao
    Deng, Minghua
    Chen, Han
    [J]. GENETIC EPIDEMIOLOGY, 2019, 43 (07) : 914 - 915
  • [2] Unconditional or Conditional Logistic Regression Model for Age-Matched Case-Control Data?
    Kuo, Chia-Ling
    Duan, Yinghui
    Grady, James
    [J]. FRONTIERS IN PUBLIC HEALTH, 2018, 6
  • [3] Conditional or unconditional logistic regression for frequency matched case-control design?
    Wan, Fei
    [J]. STATISTICS IN MEDICINE, 2022, 41 (06) : 1023 - 1041
  • [4] EXACT CONDITIONAL LOGISTIC-REGRESSION FOR MATCHED CASE-CONTROL STUDIES
    MEHTA, C
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1993, 138 (08) : 619 - 619
  • [5] Firth logistic regression for rare variant association tests
    Zhang, Qunyuan
    [J]. FRONTIERS IN GENETICS, 2014, 5
  • [6] CONDITIONAL LOGISTIC ANALYSES OF MATCHED CASE-CONTROL STUDIES
    FLANDERS, WD
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1986, 123 (04) : 756 - 757
  • [7] GOODNESS-OF-FIT TESTS FOR THE LOGISTIC-REGRESSION MODEL FOR MATCHED CASE-CONTROL STUDIES
    HOSMER, DW
    LEMESHOW, S
    [J]. BIOMETRICAL JOURNAL, 1985, 27 (05) : 511 - 520
  • [8] Insulin Resistance Associated With Differentiated Thyroid Carcinoma: Penalized Conditional Logistic Regression Analysis of a Matched Case-Control Study Data
    Heidari, Zahra
    Abdani, Mahdi
    Mansournia, Mohammad Ali
    [J]. INTERNATIONAL JOURNAL OF ENDOCRINOLOGY AND METABOLISM, 2018, 16 (01)
  • [9] Assessing the fit of the logistic regression model to individual matched sets of case-control data
    Bedrick, EJ
    Hill, JR
    [J]. BIOMETRICS, 1996, 52 (01) : 1 - 9
  • [10] Conditional likelihood methods for haplotype-based association analysis using matched case-control data
    Chen, Jinbo
    Rodriguez, Carmen
    [J]. BIOMETRICS, 2007, 63 (04) : 1099 - 1107