SEMIPARAMETRIC MAXIMUM LIKELIHOOD ESTIMATION WITH TWO-PHASE STRATIFIED CASE-CONTROL SAMPLING

被引:0
|
作者
Cao, Yaqi [1 ]
Chen, Lu [1 ]
Yang, Ying [2 ]
Chen, Jinbo [1 ]
机构
[1] Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
[2] Tsinghua Univ, Dept Math Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Logistic regression model; profile likelihood; semiparametric maximum likelihood; stratified case-control study; two-phase sampling; GENE-ENVIRONMENT INDEPENDENCE; BREAST-CANCER RISK; LOGISTIC-REGRESSION; MISSING DATA; INFERENCE; MODELS; INFORMATION; EXPOSURE; DESIGN;
D O I
10.5705/ss.202021.0214
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We develop statistical inference methods for fitting logistic regression models to data arising from the two-phase stratified case-control sampling design, where a subset of covariates are available only for a portion of cases and controls, who are selected based on the case-control status and fully collected covariates. In addition, we characterize the distribution of incomplete covariates, conditional on fully observed ones. Here, we include all subjects in the analysis in order to achieve consistency in the parameter estimation and optimal statistical efficiency. We de-velop a semiparametric maximum likelihood approach under the rare disease as-sumption, where the parameter estimates are obtained using a novel reparametrized profile likelihood technique. We study the large-sample distribution theory for the proposed estimator, and use simulations to demonstrate that it performs well in finite samples and improves on the statistical efficiency of existing approaches. We apply the proposed method to analyze a stratified case-control study of breast cancer nested within the Breast Cancer Detection and Demonstration Project, where one breast cancer risk predictor, namely, percent mammographic density, was measured only for a subset of the women in the study.
引用
收藏
页码:2233 / 2256
页数:24
相关论文
共 50 条
  • [1] Consistency of semiparametric maximum likelihood estimators for two-phase sampling
    van der Vaart, A
    Wellner, JA
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2001, 29 (02): : 269 - 288
  • [2] Variance estimation for two-phase stratified sampling
    Binder, DA
    Babyak, C
    Brodeur, M
    Hidiroglou, M
    Jocelyn, W
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2000, 28 (04): : 751 - 764
  • [3] Replication variance estimation for two-phase stratified sampling
    Kim, JK
    Navarro, A
    Fuller, WA
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) : 312 - 320
  • [4] WEIGHTED LIKELIHOOD ESTIMATION UNDER TWO-PHASE SAMPLING
    Saegusa, Takumi
    Wellner, Jon A.
    ANNALS OF STATISTICS, 2013, 41 (01): : 269 - 295
  • [5] Weighted likelihood for semiparametric models and two-phase stratified samples, with application to cox regression
    Breslow, Norman E.
    Wellner, Jon A.
    SCANDINAVIAN JOURNAL OF STATISTICS, 2007, 34 (01) : 86 - 102
  • [6] Calibration weighted estimation of semiparametric transformation models for two-phase sampling
    Fong, Youyi
    Gilbert, Peter
    STATISTICS IN MEDICINE, 2015, 34 (10) : 1695 - 1707
  • [7] Maximum likelihood estimation for Cox's regression model under nested case-control sampling
    Scheike, TH
    Juul, A
    BIOSTATISTICS, 2004, 5 (02) : 193 - 206
  • [8] Estimating Additive Interaction Effect in Stratified Two-Phase Case-Control Design
    Ni, Ai
    Satagopan, Jaya M.
    HUMAN HEREDITY, 2019, 84 (02) : 90 - 108
  • [9] Maximum likelihood estimation of logistic regression parameters under two-phase, outcome-dependent sampling
    Breslow, NE
    Holubkov, R
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1997, 59 (02): : 447 - 461
  • [10] Comparison of semiparametric maximum likelihood estimation and two-stage semiparametric estimation in copula models
    Lawless, Jerald F.
    Yilmaz, Yildiz E.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (07) : 2446 - 2455