Doubly robust kernel density estimation when group membership is missing at random

被引:0
|
作者
Zhang, Chenguang [1 ]
He, Hua [2 ]
Li, Jian [1 ]
Tang, Wan [1 ]
机构
[1] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Biostat & Data Sci, New Orleans, LA 70118 USA
[2] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Epidemiol, New Orleans, LA USA
关键词
Kernel smoothing; Missing at random; Doubly robust; VERIFICATION; SUBJECT;
D O I
10.1016/j.jspi.2019.09.010
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
When there are subjects with subpopulation memberships missing, the kernel density estimates of the subpopulations based on the subjects with verified memberships may not be valid unless the missingness of the memberships satisfies the stringent missing completely at random assumption. Under more plausible missing at random (MAR) assumptions, the inverse probability weighting (IPW) kernel smoothing method can be employed if the missing mechanism is well understood. On the other hand, if the memberships can be well predicted by a prediction model, one may replace the memberships (either all or only the missing ones), with the probabilities for the memberships to deal with the missing values. By integrating the information of the missing mechanism and the prediction model, we develop a doubly robust kernel density estimate which is valid as long as either of the models is correctly specified. Asymptotic properties of the new estimates are derived and the asymptotic orders of the mean integrated square errors (MISE) are compared with existing methods. We prove that the doubly robust estimate can be more efficient than the IPW estimate when the model for the missing mechanism is correctly specified. We also prove that the prediction-model based approaches may have smaller MISEs than the complete data when the prediction model is correctly specified, suggesting that those methods can be applied even when there are no missing values in the memberships. Simulation studies are carried out to assess the performances of methods and a real study example is used for illustrative purposes. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:163 / 178
页数:16
相关论文
共 50 条
  • [41] Robust normal reference bandwidth for kernel density estimation
    Zhang, Jin
    Wang, Xueren
    STATISTICA NEERLANDICA, 2009, 63 (01) : 13 - 23
  • [42] KERNEL ESTIMATION WHEN DENSITY MAY NOT EXIST: A CORRIGENDUM
    Zinde-Walsh, Victoria
    ECONOMETRIC THEORY, 2017, 33 (05) : 1259 - 1263
  • [43] Fast Kernel Density Estimation with Density Matrices and Random Fourier Features
    Gallego, Joseph A.
    Osorio, Juan F.
    Gonzalez, Fabio A.
    ADVANCES IN ARTIFICIAL INTELLIGENCE-IBERAMIA 2022, 2022, 13788 : 160 - 172
  • [44] Robust propensity score weighting estimation under missing at random
    Wang, Hengfang
    Kim, Jae Kwang
    Han, Jeongseop
    Lee, Youngjo
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 2687 - 2720
  • [45] Efficient Robust Estimation for Linear Models with Missing Response at Random
    Tang, Man-Lai
    Tang, Nian-Sheng
    Zhao, Pu-Ying
    Zhu, Hongtu
    SCANDINAVIAN JOURNAL OF STATISTICS, 2018, 45 (02) : 366 - 381
  • [46] Robust estimation for moment condition models with data missing not at random
    Li, Wei
    Yang, Shu
    Han, Peisong
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2020, 207 : 246 - 254
  • [47] Robust estimation of single index models with responses missing at random
    Ash Abebe
    Huybrechts F. Bindele
    Masego Otlaadisa
    Boikanyo Makubate
    Statistical Papers, 2021, 62 : 2195 - 2225
  • [48] Robust estimation of single index models with responses missing at random
    Abebe, Ash
    Bindele, Huybrechts F.
    Otlaadisa, Masego
    Makubate, Boikanyo
    STATISTICAL PAPERS, 2021, 62 (05) : 2195 - 2225
  • [49] Distributionally Robust Fault Detection by using Kernel Density Estimation
    Xue, Ting
    Zhong, Maiying
    Luo, Lijia
    Li, Linlin
    Ding, Steven X.
    IFAC PAPERSONLINE, 2020, 53 (02): : 652 - 657
  • [50] A bandwidth selection for kernel density estimation of functions of random variables
    Mugdadi, AR
    Ahmad, IA
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (01) : 49 - 62