Multiple Imputation with Predictive Mean Matching Method for Numerical Missing Data

被引:10
|
作者
Akmam, Emha Fathul [1 ]
Siswantining, Titin [1 ]
Soemartojo, Saskya Mary [1 ]
Sarwinda, Devvi [1 ]
机构
[1] Univ Indonesia, Dept Math, Fac Math & Nat Sci, Depok, Indonesia
来源
2019 3RD INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS 2019) | 2019年
关键词
linear regression analysis; multiple imputation; missing values; predictive mean matching;
D O I
10.1109/icicos48119.2019.8982510
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing data are condition when there are some missing values or empty entries on several observations on data. It could inhibit statistical analysis process and might give a bias conclusion from the analysis if couldn't be handled properly. This problem can be found on some linear regression analysis. One way to handle this problem is using multiple imputation (MI) method named Predictive Mean Matching (PMM). PMM will matching the predictive mean distance of incomplete observations with the complete observations. To get the multiple imputation concept, the predictive mean of incomplete observations were estimated by Bayesian approach while the complete observations were estimated with ordinary least square. Thus, the complete observation that has the closest distance will be a donor value for the incomplete one. Simulation data with two variable (x and y), univariate missing data pattern (on y), and MAR mechanism is used to analyzed the effectiveness of PMM based on relative efficiency estimation result of missing covariate data. Regression analysis used x as independent variable and y as dependent variable. The result showed that PMM give a significant coefficient regression parameter at 5% level of significance and only loss 1% of relative efficiency.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Multiple imputation for nonignorable missing data
    Im, Jongho
    Kim, Soeun
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2017, 46 (04) : 583 - 592
  • [22] Multiple imputation for nonignorable missing data
    Jongho Im
    Soeun Kim
    Journal of the Korean Statistical Society, 2017, 46 : 583 - 592
  • [23] Propensity score matching after multiple imputation when a confounder has missing data
    Segalas, Corentin
    Leyrat, Clemence
    R. Carpenter, James
    Williamson, Elizabeth
    STATISTICS IN MEDICINE, 2023, 42 (07) : 1082 - 1095
  • [24] Composite Imputation Method for the Multiple Linear Regression with Missing at Random Data
    Thongsri, Thidarat
    Samart, Klairung
    INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2022, 17 (01): : 51 - 62
  • [25] Application of Multiple Imputation Method in Analyzing Data with Missing Continuous Covariates
    Tamar, S. Ghasemizadeh
    Ganjali, M.
    KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (04) : 659 - 664
  • [26] Optimal imputation of missing data for estimation of population mean
    Bhushan, Shashi
    Pandey, Abhay Pratap
    JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2016, 19 (06): : 755 - 766
  • [27] Imputation is beneficial for handling missing data in predictive models
    Steyerberg, Ewout W.
    van Veen, Mirjam
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2007, 60 (09) : 979 - 979
  • [28] Multiple imputation of missing data for survey data analysis
    Lupo, Coralie
    Le Bouquin, Sophie
    Michel, Virginie
    Colin, Pierre
    Chauvin, Claire
    EPIDEMIOLOGIE ET SANTE ANIMALE, 2008, NO 53, 2008, (53): : 73 - 83
  • [29] A Predictive Estimator of the Mean with Missing Data
    M. Rueda
    S. González
    A. Arcos
    Quality & Quantity, 2007, 41 : 201 - 217
  • [30] A predictive estimator of the mean with missing data
    Rueda, M.
    Gonzalez, S.
    Arcos, A.
    QUALITY & QUANTITY, 2007, 41 (02) : 201 - 217