A Comparison of Estimation Methods for Missing Data in Multiple Linear Regression with Two Independent Variables

被引:0
|
作者
Suraphee, Sujitta [1 ]
Raksmanee, Chancharoen [1 ]
Busaba, Jaruchat [1 ]
Chaisorn, Chanchai [1 ]
Nakornthai, Wilaiwan [1 ]
机构
[1] Mahasarakham Univ, Fac Sci, Dept Math, Maha Sarakham 44150, Thailand
来源
THAILAND STATISTICIAN | 2006年 / 4卷
关键词
EM algorithm; Missing Data; Multiple Regression; Pairwise Deletion; Regression Imputation;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In multiple linear regression, if the incomplete values occur in sample, many researchers will use the statistical software packages which delete the cases that deal with missing value. Then they use the ordinary least square (OLS) to estimate parameters in the regression model. However, the validity of this approach is decreased in making inference because the size of the sample reduces. In this paper, when missing values occur in response variable, we perform a simulation study of imputation based procedures and indicate that missing values should be filled by the EM algorithm and regression imputation methods, which have been more efficient than deletion of observations. However, in case of having the large sample sizes and small variances of errors and small proportions of missing values, pairwise deletion methods can be used efficiently. Further, we analyze completed data by ordinary least square method to predict regression coefficients in the regression model. This leads to a result with minimum mean squared errors.
引用
收藏
页码:13 / 26
页数:14
相关论文
共 50 条