Doubly robust kernel density estimation when group membership is missing at random

被引:0
|
作者
Zhang, Chenguang [1 ]
He, Hua [2 ]
Li, Jian [1 ]
Tang, Wan [1 ]
机构
[1] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Biostat & Data Sci, New Orleans, LA 70118 USA
[2] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Epidemiol, New Orleans, LA USA
关键词
Kernel smoothing; Missing at random; Doubly robust; VERIFICATION; SUBJECT;
D O I
10.1016/j.jspi.2019.09.010
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When there are subjects with subpopulation memberships missing, the kernel density estimates of the subpopulations based on the subjects with verified memberships may not be valid unless the missingness of the memberships satisfies the stringent missing completely at random assumption. Under more plausible missing at random (MAR) assumptions, the inverse probability weighting (IPW) kernel smoothing method can be employed if the missing mechanism is well understood. On the other hand, if the memberships can be well predicted by a prediction model, one may replace the memberships (either all or only the missing ones), with the probabilities for the memberships to deal with the missing values. By integrating the information of the missing mechanism and the prediction model, we develop a doubly robust kernel density estimate which is valid as long as either of the models is correctly specified. Asymptotic properties of the new estimates are derived and the asymptotic orders of the mean integrated square errors (MISE) are compared with existing methods. We prove that the doubly robust estimate can be more efficient than the IPW estimate when the model for the missing mechanism is correctly specified. We also prove that the prediction-model based approaches may have smaller MISEs than the complete data when the prediction model is correctly specified, suggesting that those methods can be applied even when there are no missing values in the memberships. Simulation studies are carried out to assess the performances of methods and a real study example is used for illustrative purposes. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:163 / 178
页数:16
相关论文
共 50 条
  • [1] Kernel smoothing density estimation when group membership is subject to missing
    Tang, Wan
    He, Hua
    Gunzler, Douglas
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (03) : 685 - 694
  • [2] Prediction model-based kernel density estimation when group membership is subject to missing
    Hua He
    Wenjuan Wang
    Wan Tang
    AStA Advances in Statistical Analysis, 2017, 101 : 267 - 288
  • [3] Prediction model-based kernel density estimation when group membership is subject to missing
    He, Hua
    Wang, Wenjuan
    Tang, Wan
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2017, 101 (03) : 267 - 288
  • [4] A Simple Implementation of Doubly Robust Estimation in Logistic Regression With Covariates Missing at Random
    Tchetgen, Eric J. Tchetgen
    EPIDEMIOLOGY, 2009, 20 (03) : 391 - 394
  • [5] Robust kernel density estimation
    Kim, JooSeuk
    Scott, Clayton
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3381 - 3384
  • [6] Robust Kernel Density Estimation
    Kim, JooSeuk
    Scott, Clayton D.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 2529 - 2565
  • [7] Probability density estimation with data missing at random when covariables are present
    Wang, Qihua
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2008, 138 (03) : 568 - 587
  • [8] Kernel density estimation for random fields (density estimation for random fields)
    Carbon, M
    Tran, LT
    Wu, B
    STATISTICS & PROBABILITY LETTERS, 1997, 36 (02) : 115 - 125
  • [9] Kernel density estimation with doubly truncated data
    Moreira, Carla
    de Una-Alvarez, Jacobo
    ELECTRONIC JOURNAL OF STATISTICS, 2012, 6 : 501 - 521
  • [10] DOUBLY ROBUST TREATMENT EFFECT ESTIMATION WITH MISSING ATTRIBUTES
    Mayer, Imke
    Sverdrup, Erik
    Gauss, Tobias
    Moyer, Jean-Denis
    Wager, Stefan
    Josse, Julie
    ANNALS OF APPLIED STATISTICS, 2020, 14 (03): : 1409 - 1431