Handling missing data: analysis of a challenging data set using multiple imputation

被引:71
|
作者
Pampaka, Maria [1 ]
Hutcheson, Graeme [1 ]
Williams, Julian [1 ]
机构
[1] Univ Manchester, Manchester Inst Educ, Room B4-10 Ellen Wilkinson Bldg,Oxford Rd, Manchester M13 9PL, Lancs, England
基金
英国经济与社会研究理事会;
关键词
missing data; surveys; multiple imputation; regression; modelling;
D O I
10.1080/1743727X.2014.979146
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Missing data is endemic in much educational research. However, practices such as step-wise regression common in the educational research literature have been shown to be dangerous when significant data are missing, and multiple imputation (MI) is generally recommended by statisticians. In this paper, we provide a review of these advances and their implications for educational research. We illustrate the issues with an educational, longitudinal survey in which missing data was significant, but for which we were able to collect much of these missing data through subsequent data collection. We thus compare methods, that is, step-wise regression (basically ignoring the missing data) and MI models, with the model from the actual enhanced sample. The value of MI is discussed and the risks involved in ignoring missing data are considered. Implications for research practice are discussed.
引用
收藏
页码:19 / 37
页数:19
相关论文
共 50 条
  • [1] Multiple Imputation A Flexible Tool for Handling Missing Data
    Li, Peng
    Stuart, Elizabeth A.
    Allison, David B.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2015, 314 (18): : 1966 - 1967
  • [2] Handling missing data in nursing research with multiple imputation
    Kneipp, SM
    McIntosh, M
    NURSING RESEARCH, 2001, 50 (06) : 384 - 389
  • [3] Multiple imputation of missing data for survey data analysis
    Lupo, Coralie
    Le Bouquin, Sophie
    Michel, Virginie
    Colin, Pierre
    Chauvin, Claire
    EPIDEMIOLOGIE ET SANTE ANIMALE, 2008, NO 53, 2008, (53): : 73 - 83
  • [4] Handling Missing Values in Longitudinal Panel Data With Multiple Imputation
    Young, Rebekah
    Johnson, David R.
    JOURNAL OF MARRIAGE AND FAMILY, 2015, 77 (01) : 277 - 294
  • [5] Hot Deck Multiple Imputation for Handling Missing Accelerometer Data
    Nicole M. Butera
    Siying Li
    Kelly R. Evenson
    Chongzhi Di
    David M. Buchner
    Michael J. LaMonte
    Andrea Z. LaCroix
    Amy Herring
    Statistics in Biosciences, 2019, 11 : 422 - 448
  • [6] Hot Deck Multiple Imputation for Handling Missing Accelerometer Data
    Butera, Nicole M.
    Li, Siying
    Evenson, Kelly R.
    Di, Chongzhi
    Buchner, David M.
    LaMonte, Michael J.
    LaCroix, Andrea Z.
    Herring, Amy
    STATISTICS IN BIOSCIENCES, 2019, 11 (02) : 422 - 448
  • [7] The use of multiple imputation for the analysis of missing data
    Sinharay, S
    Stern, HS
    Russell, D
    PSYCHOLOGICAL METHODS, 2001, 6 (04) : 317 - 329
  • [8] Regression multiple imputation for missing data analysis
    Yu, Lili
    Liu, Liang
    Peace, Karl E.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (09) : 2647 - 2664
  • [9] Handling Missing Data in Matched Case-Control Studies Using Multiple Imputation
    Seaman, Shaun R.
    Keogh, Ruth H.
    BIOMETRICS, 2015, 71 (04) : 1150 - 1159
  • [10] Missing Data and Multiple Imputation
    Cummings, Peter
    JAMA PEDIATRICS, 2013, 167 (07) : 656 - 661