Precisely modeling zero-inflated count phenotype for rare variants

被引:1
|
作者
Fan, Qiao [1 ]
Sun, Shuming [2 ]
Li, Yi-Ju [3 ]
机构
[1] Natl Univ Singapore, Ctr Quantitat Med, Duke NUS Med Sch, Singapore, Singapore
[2] Duke Univ, Sch Med, Duke Mol Physiol Inst, Durham, NC 27710 USA
[3] Duke Univ, Sch Med, Dept Biostat & Bioinformat, DUMC Box 104775, Durham, NC 27710 USA
基金
美国国家卫生研究院;
关键词
burden test; kernel test; rare variant; zero-inflated count; POISSON REGRESSION; ASSOCIATION;
D O I
10.1002/gepi.22438
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Count data with excessive zeros are increasingly ubiquitous in genetic association studies, such as neuritic plaques in brain pathology for Alzheimer's disease. Here, we developed gene-based association tests to model such data by a mixture of two distributions, one for the structural zeros contributed by the Binomial distribution, and the other for the counts from the Poisson distribution. We derived the score statistics of the corresponding parameter of the rare variants in the zero-inflated Poisson regression model, and then constructed burden (ZIP-b) and kernel (ZIP-k) tests for the association tests. We evaluated omnibus tests that combined both ZIP-b and ZIP-k tests. Through simulated sequence data, we illustrated the potential power gain of our proposed method over a two-stage method that analyzes binary and non-zero continuous data separately for both burden and kernel tests. The ZIP burden test outperformed the kernel test as expected in all scenarios except for the scenario of variants with a mixture of directions in the genetic effects. We further demonstrated its applications to analyses of the neuritic plaque data in the ROSMAP cohort. We expect our proposed test to be useful in practice as more powerful than or complementary to the two-stage method.
引用
收藏
页码:73 / 86
页数:14
相关论文
共 50 条
  • [21] Semiparametric analysis of longitudinal zero-inflated count data
    Feng, Jiarui
    Zhu, Zhongyi
    JOURNAL OF MULTIVARIATE ANALYSIS, 2011, 102 (01) : 61 - 72
  • [22] Zero-inflated models with application to spatial count data
    Agarwal, DK
    Gelfand, AE
    Citron-Pousty, S
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2002, 9 (04) : 341 - 355
  • [23] Forecasting Civil Conflict with Zero-Inflated Count Models
    Bagozzi, Benjamin E.
    CIVIL WARS, 2015, 17 (01) : 1 - 24
  • [24] Marginal zero-inflated regression models for count data
    Martin, Jacob
    Hall, Daniel B.
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (10) : 1807 - 1826
  • [25] Marginal Mean Models for Zero-Inflated Count Data
    Todem, David
    Kim, KyungMann
    Hsu, Wei-Wen
    BIOMETRICS, 2016, 72 (03) : 986 - 994
  • [26] Estimation and selection for spatial zero-inflated count models
    Shen, Chung-Wei
    Chen, Chun-Shu
    ENVIRONMETRICS, 2024, 35 (04)
  • [27] Zero-inflated Bell regression models for count data
    Lemonte, Artur J.
    Moreno-Arenas, German
    Castellares, Fredy
    JOURNAL OF APPLIED STATISTICS, 2020, 47 (02) : 265 - 286
  • [28] Zero-inflated models with application to spatial count data
    Deepak K. Agarwal
    Alan E. Gelfand
    Steven Citron-Pousty
    Environmental and Ecological Statistics, 2002, 9 : 341 - 355
  • [29] A dynamic hurdle model for zero-inflated count data
    Baetschmann, Gregori
    Winkelmann, Rainer
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (14) : 7174 - 7187
  • [30] On Baseline Conditions for Zero-Inflated Longitudinal Count Data
    Maruotti, Antonello
    Raponi, Valentina
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2014, 43 (04) : 743 - 760