Logistic Regression With Incomplete Covariate Data in Complex Survey Sampling Application of Reweighted Estimating Equations

被引:23
|
作者
Moore, Charity G. [4 ]
Lipsitz, Stuart R. [3 ]
Addy, Cheryl L. [2 ]
Hussey, James R. [2 ]
Fitzmaurice, Garrett [3 ]
Natarajan, Sundar [1 ]
机构
[1] NYU, Sch Med, Div Gen Internal Med, New York, NY 10010 USA
[2] Univ S Carolina, Norman J Arnold Sch Publ Hlth, Dept Epidemiol & Biostat, Columbia, SC 29208 USA
[3] Brigham & Womens Hosp, Div Gen Internal Med, Boston, MA 02115 USA
[4] Univ Pittsburgh, Dept Med, Pittsburgh, PA USA
基金
美国国家卫生研究院;
关键词
SEMIPARAMETRIC REGRESSION; REPEATED OUTCOMES; MODELS;
D O I
10.1097/EDE.0b013e318196cd65
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Weighted survey data with missing data for some covariates presents a substantial challenge for analysis. We addressed this problem by using a reweighting technique in a logistic regression model to estimate parameters. Each survey weight was adjusted by the inverse of the probability that the possibly missing covariate was observed. The reweighted estimating equations procedure was compared with a complete case analysis (after discarding any subjects with missing data) in a simulation study to assess bias reduction. The method was also applied to data obtained from a national health survey (National Health and Nutritional Examination Survey or NHANES). Adjusting the sampling weights by the inverse probability of being completely observed appears to be effective in accounting for missing data and reducing the bias of the complete case estimate of die regression coefficients.
引用
收藏
页码:382 / 390
页数:9
相关论文
共 50 条
  • [41] Optimal estimating functions in incomplete data and length biased sampling data problems
    Qin, Jing
    Zhang, Biao
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2011, 39 (03): : 510 - 518
  • [42] THE BIAS OF ESTIMATING EQUATIONS WITH APPLICATION TO THE ERROR RATE OF LOGISTIC DISCRIMINATION
    ONEILL, TJ
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (428) : 1492 - 1498
  • [43] A regression model for pooled data in a two-stage survey under informative sampling with application for detecting and estimating the presence of transgenic corn
    Montesinos-Lopez, Osval A.
    Eskridge, Kent
    Montesinos-Lopez, Abelardo
    Crossa, Jose
    Cortes-Cruz, Moises
    Wang, Dong
    SEED SCIENCE RESEARCH, 2016, 26 (02) : 182 - 197
  • [44] Local logistic regression: An application to Army penetration data
    Nottingham, QJ
    Birch, JB
    Bodt, BA
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2000, 66 (01) : 35 - 50
  • [45] Quantifying and estimating ecological network diversity based on incomplete sampling data
    Chiu, Chun-Huo
    Chao, Anne
    Vogel, Sebastian
    Kriegel, Peter
    Thorn, Simon
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2023, 378 (1881)
  • [46] Estimating Bias and Variances in Bootstrap Logistic Regression for Umaru and Impact Data
    Fitrianto, Anwar
    Cing, Ng Mei
    INTERNATIONAL CONFERENCE ON QUANTITATIVE SCIENCES AND ITS APPLICATIONS (ICOQSIA 2014), 2014, 1635 : 742 - 747
  • [47] Estimating the class prior for positive and unlabelled data via logistic regression
    Lazecka, Malgorzata
    Mielniczuk, Jan
    Teisseyre, Pawel
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (04) : 1039 - 1068
  • [48] Augmented two-step estimating equations with nuisance functionals and complex survey data
    Zhao, Puying
    Wu, Changbao
    ECONOMETRICS JOURNAL, 2024, 27 (01): : 37 - 61
  • [49] Estimating the class prior for positive and unlabelled data via logistic regression
    Małgorzata Łazęcka
    Jan Mielniczuk
    Paweł Teisseyre
    Advances in Data Analysis and Classification, 2021, 15 : 1039 - 1068
  • [50] Accounting for informatively missing data in logistic regression by means of reassessment sampling
    Lin, Ji
    Lyles, Robert H.
    STATISTICS IN MEDICINE, 2015, 34 (11) : 1925 - 1939