Methods for the analysis of explanatory linear regression models with missing data not at random

被引:0
|
作者
Pastor, JBN [1 ]
机构
[1] Univ Autonoma Barcelona, Dept Psicobiol & Metodol, Bellaterra 08193, Spain
关键词
nonrandom missing data; regression analysis; incomplete maximum likelihood estimation; multiple imputation; Monte Carlo simulation;
D O I
暂无
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Since the work of Little and Rubin ( 1987) not substantial advances in the analysis of explanatory regression models for incomplete data with missing not at random have been achieved, mainly due to the difficulty of verifying the randomness of the unknown data. In practice, the analysis of nonrandom missing data is done with techniques designed for datasets with random or completely random missing data, as complete case analysis, mean imputation, regression imputation, maximum likelihood or multiple imputation. However, the data conditions required to minimize the bias derived from an incorrect analysis have not been fully determined. In the present work, several Monte Carlo simulations have been carried out to establish the best strategy of analysis for random missing data applicable in datasets with nonrandom missing data. The factors involved in simulations are sample size, percentage of missing data, predictive power of the imputation model and existence of interaction between predictors. The results show that the smallest bias is obtained with maximum likelihood and multiple imputation techniques, although with low percentages of missing data, absence of interaction and high predictive power of the imputation model ( frequent data structures in research on child and adolescent psychopathology) acceptable results are obtained with the simplest regression imputation.
引用
收藏
页码:363 / 376
页数:14
相关论文
共 50 条
  • [31] Gaussian Scale Mixture Models for Robust Linear Multivariate Regression with Missing Data
    Ala-Luhtala, Juha
    Piche, Robert
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (03) : 791 - 813
  • [32] Empirical Likelihood for Response Differences in Two Linear Regression Models with Missing Data
    Qin, Yong-song
    Qiu, Tao
    Lei, Qing-zhu
    [J]. ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2015, 31 (04): : 963 - 976
  • [33] k-Nearest neighbors local linear regression for functional and missing data at random
    Rachdi, Mustapha
    Laksaci, Ali
    Kaid, Zoulikha
    Benchiha, Abbassia
    Al-Awadhi, Fahimah A.
    [J]. STATISTICA NEERLANDICA, 2021, 75 (01) : 42 - 65
  • [34] Empirical likelihood for response differences in two linear regression models with missing data
    Yong-song Qin
    Tao Qiu
    Qing-zhu Lei
    [J]. Acta Mathematicae Applicatae Sinica, English Series, 2015, 31 : 963 - 976
  • [35] Missing-data methods for generalized linear models: A comparative review
    Ibrahim, JG
    Chen, MH
    Lipsitz, SR
    Herring, AH
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) : 332 - 346
  • [36] Imputed Empirical Likelihood for Partially Linear Models with Covariate Data Missing at Random
    Yang Yiping
    Zhou Yousheng
    [J]. CONTEMPORARY INNOVATION AND DEVELOPMENT IN MANAGEMENT SCIENCE, 2012, : 790 - 795
  • [37] Missing Data Analysis in Regression
    Marcelino, C. G.
    Leite, G. M. C.
    Celes, P.
    Pedreira, C. E.
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [38] The more data, the better? Demystifying deletion-based methods in linear regression with missing data
    Xu, Tianchen
    Chen, Kun
    Li, Gen
    [J]. STATISTICS AND ITS INTERFACE, 2022, 15 (04) : 515 - 526
  • [40] Efficiency transfer for regression models with responses missing at random
    Mueller, Ursula U.
    Schick, Anton
    [J]. BERNOULLI, 2017, 23 (4A) : 2693 - 2719