Methods for the analysis of explanatory linear regression models with missing data not at random

被引：0

作者：

Pastor, JBN ^{[1
]}

机构：

[1] Univ Autonoma Barcelona, Dept Psicobiol & Metodol, Bellaterra 08193, Spain

来源：

QUALITY & QUANTITY | 2003年 / 37卷 / 04期

关键词：

nonrandom missing data; regression analysis; incomplete maximum likelihood estimation; multiple imputation; Monte Carlo simulation;

D O I：

暂无

中图分类号：

C [社会科学总论];

学科分类号：

03 ; 0303 ;

摘要：

Since the work of Little and Rubin ( 1987) not substantial advances in the analysis of explanatory regression models for incomplete data with missing not at random have been achieved, mainly due to the difficulty of verifying the randomness of the unknown data. In practice, the analysis of nonrandom missing data is done with techniques designed for datasets with random or completely random missing data, as complete case analysis, mean imputation, regression imputation, maximum likelihood or multiple imputation. However, the data conditions required to minimize the bias derived from an incorrect analysis have not been fully determined. In the present work, several Monte Carlo simulations have been carried out to establish the best strategy of analysis for random missing data applicable in datasets with nonrandom missing data. The factors involved in simulations are sample size, percentage of missing data, predictive power of the imputation model and existence of interaction between predictors. The results show that the smallest bias is obtained with maximum likelihood and multiple imputation techniques, although with low percentages of missing data, absence of interaction and high predictive power of the imputation model ( frequent data structures in research on child and adolescent psychopathology) acceptable results are obtained with the simplest regression imputation.

引用

页码：363 / 376

页数：14

共 50 条

[1] Methods for the Analysis of Explanatory Linear Regression Models with Missing Data Not at Random
José Blas Navarro Pastor
[J]. Quality and Quantity, 2003, 37 (4) : 363 - 376
[2] Indicator and stratification methods for missing explanatory variables in multiple linear regression
Jones, MP
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (433) : 222 - 230
[3] Development of Imputation Methods for Missing Data in Multiple Linear Regression Analysis
Thidarat Thongsri
Klairung Samart
[J]. Lobachevskii Journal of Mathematics, 2022, 43 : 3390 - 3399
[4] Development of Imputation Methods for Missing Data in Multiple Linear Regression Analysis
Thongsri, Thidarat
Samart, Klairung
[J]. LOBACHEVSKII JOURNAL OF MATHEMATICS, 2022, 43 (11) : 3390 - 3399
[5] Local linear regression for generalized linear models with missing data
Wang, CY
Wang, SJ
Gutierrez, RG
Carroll, RJ
[J]. ANNALS OF STATISTICS, 1998, 26 (03): : 1028 - 1050
[6] Unified approach for regression models with nonmonotone missing at random data
Zhao, Yang
Liu, Meng
[J]. ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2021, 105 (01) : 87 - 101
[7] Copula-based regression models with data missing at random
Hamori, Shigeyuki
Motegi, Kaiji
Zhang, Zheng
[J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 180
[8] Unified approach for regression models with nonmonotone missing at random data
Yang Zhao
Meng Liu
[J]. AStA Advances in Statistical Analysis, 2021, 105 : 87 - 101
[9] Bayesian methods for generalized linear models with covariates missing at random
Ibrahim, JG
Chen, MH
Lipsitz, SR
[J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2002, 30 (01): : 55 - 78
[10] Composite Imputation Method for the Multiple Linear Regression with Missing at Random Data
Thongsri, Thidarat
Samart, Klairung
[J]. INTERNATIONAL JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE, 2022, 17 (01): : 51 - 62

← 1 2 3 4 5 →