Methods for the analysis of explanatory linear regression models with missing data not at random

被引：0

作者：

Pastor, JBN ^{[1
]}

机构：

[1] Univ Autonoma Barcelona, Dept Psicobiol & Metodol, Bellaterra 08193, Spain

来源：

QUALITY & QUANTITY | 2003年 / 37卷 / 04期

关键词：

nonrandom missing data; regression analysis; incomplete maximum likelihood estimation; multiple imputation; Monte Carlo simulation;

D O I：

暂无

中图分类号：

C [社会科学总论];

学科分类号：

03 ; 0303 ;

摘要：

Since the work of Little and Rubin ( 1987) not substantial advances in the analysis of explanatory regression models for incomplete data with missing not at random have been achieved, mainly due to the difficulty of verifying the randomness of the unknown data. In practice, the analysis of nonrandom missing data is done with techniques designed for datasets with random or completely random missing data, as complete case analysis, mean imputation, regression imputation, maximum likelihood or multiple imputation. However, the data conditions required to minimize the bias derived from an incorrect analysis have not been fully determined. In the present work, several Monte Carlo simulations have been carried out to establish the best strategy of analysis for random missing data applicable in datasets with nonrandom missing data. The factors involved in simulations are sample size, percentage of missing data, predictive power of the imputation model and existence of interaction between predictors. The results show that the smallest bias is obtained with maximum likelihood and multiple imputation techniques, although with low percentages of missing data, absence of interaction and high predictive power of the imputation model ( frequent data structures in research on child and adolescent psychopathology) acceptable results are obtained with the simplest regression imputation.

引用

页码：363 / 376

页数：14

共 50 条

[31] Gaussian Scale Mixture Models for Robust Linear Multivariate Regression with Missing Data
Ala-Luhtala, Juha
Piche, Robert
[J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (03) : 791 - 813
[32] Empirical Likelihood for Response Differences in Two Linear Regression Models with Missing Data
Qin, Yong-song
Qiu, Tao
Lei, Qing-zhu
[J]. ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2015, 31 (04): : 963 - 976
[33] k-Nearest neighbors local linear regression for functional and missing data at random
Rachdi, Mustapha
Laksaci, Ali
Kaid, Zoulikha
Benchiha, Abbassia
Al-Awadhi, Fahimah A.
[J]. STATISTICA NEERLANDICA, 2021, 75 (01) : 42 - 65
[34] Empirical likelihood for response differences in two linear regression models with missing data
Yong-song Qin
Tao Qiu
Qing-zhu Lei
[J]. Acta Mathematicae Applicatae Sinica, English Series, 2015, 31 : 963 - 976
[35] Missing-data methods for generalized linear models: A comparative review
Ibrahim, JG
Chen, MH
Lipsitz, SR
Herring, AH
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (469) : 332 - 346
[36] Imputed Empirical Likelihood for Partially Linear Models with Covariate Data Missing at Random
Yang Yiping
Zhou Yousheng
[J]. CONTEMPORARY INNOVATION AND DEVELOPMENT IN MANAGEMENT SCIENCE, 2012, : 790 - 795
[37] Missing Data Analysis in Regression
Marcelino, C. G.
Leite, G. M. C.
Celes, P.
Pedreira, C. E.
[J]. APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[38] The more data, the better? Demystifying deletion-based methods in linear regression with missing data
Xu, Tianchen
Chen, Kun
Li, Gen
[J]. STATISTICS AND ITS INTERFACE, 2022, 15 (04) : 515 - 526
[39] MISSING DATA IN REGRESSION ANALYSIS
HAITOVSKY, Y
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1968, 30 (01) : 67 - 82
[40] Efficiency transfer for regression models with responses missing at random
Mueller, Ursula U.
Schick, Anton
[J]. BERNOULLI, 2017, 23 (4A) : 2693 - 2719

← 1 2 3 4 5 →