Imputation methods for missing data in educational diagnostic evaluation

被引:0
|
作者
Fernandez-Alonso, Ruben [2 ,3 ]
Suarez-Alvarez, Javier
Muniz, Jose [1 ]
机构
[1] Univ Oviedo, Fac Psicol, Oviedo 33003, Spain
[2] Consejeria Educ, Oviedo, Spain
[3] Univ Gobierno Principado Asturias, Oviedo, Spain
关键词
D O I
暂无
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Imputation methods for missing data in educational diagnostic evaluation. In the diagnostic evaluation of educational systems, self-reports are commonly used to collect data, both cognitive and orectic. For various reasons, in these self-reports, some of the students' data are frequently missing. The main goal of this research is to compare the performance of different imputation methods for missing data in the context of the evaluation of educational systems. On an empirical database of 5,000 subjects, 72 conditions were simulated: three levels of missing data, three types of loss mechanisms, and eight methods of imputation. The levels of missing data were 5%, 10%, and 20%. The loss mechanisms were set at: Missing completely at random, moderately conditioned, and strongly conditioned. The eight imputation methods used were: listwise deletion, replacement by the mean of the scale, by the item mean, the subject mean, the corrected subject mean, multiple regression, and Expectation-Maximization (EM) algorithm, with and without auxiliary variables. The results indicate that the recovery of the data is more accurate when using an appropriate combination of different methods of recovering lost data. When a case is incomplete, the mean of the subject works very well, whereas for completely lost data, multiple imputation with the EM algorithm is recommended. The use of this combination is especially recommended when data loss is greater and its loss mechanism is more conditioned. Lastly, the results are discussed, and some future lines of research are analyzed.
引用
收藏
页码:167 / 175
页数:9
相关论文
共 50 条
  • [41] Missing data incremental imputation through tree based methods
    Conversano, C
    Cappelli, C
    [J]. COMPSTAT 2002: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2002, : 455 - 460
  • [42] Methods for imputation of missing values in air quality data sets
    Junninen, H
    Niska, H
    Tuppurainen, K
    Ruuskanen, J
    Kolehmainen, M
    [J]. ATMOSPHERIC ENVIRONMENT, 2004, 38 (18) : 2895 - 2907
  • [43] Imputation Methods for Handling Missing Dietary Supplement Dosage Data
    Leung, June
    Dwyer, Johanna
    Hibberd, Patricia
    Jacques, Paul
    Rand, William
    [J]. JOURNAL OF RENAL NUTRITION, 2010, 20 (05) : 342 - 347
  • [44] Evaluation of Multiple Imputation Methods for Missing Diary Data for Statistical Analysis in Dry Eye Studies
    Slade, Lot
    Bateman, Kirk
    Usner, Dale W.
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2018, 59 (09)
  • [45] From Predictive Methods to Missing Data Imputation: An Optimization Approach
    Bertsimas, Dimitris
    Pawlowski, Colin
    Zhuo, Ying Daisy
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [46] Comparison of imputation methods for missing production data of dairy cattle
    You, J.
    Ellis, J. L.
    Adams, S.
    Sahar, M.
    Jacobs, M.
    Tulpan, D.
    [J]. ANIMAL, 2023, 17
  • [47] Comparison of missing value imputation methods for crop yield data
    Lokupitiya, Ravindra S.
    Lokupitiya, Erandathie
    Paustian, Keith
    [J]. ENVIRONMETRICS, 2006, 17 (04) : 339 - 349
  • [48] A comparison of multiple imputation methods for missing data in longitudinal studies
    Huque, Md Hamidul
    Carlin, John B.
    Simpson, Julie A.
    Lee, Katherine J.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2018, 18
  • [49] A comparison of multiple imputation methods for missing data in longitudinal studies
    Md Hamidul Huque
    John B. Carlin
    Julie A. Simpson
    Katherine J. Lee
    [J]. BMC Medical Research Methodology, 18
  • [50] Efficient Imputation Methods to Handle Missing Data in Sample Surveys
    Singh, G. N.
    Jaiswal, Ashok K.
    [J]. JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2022, 16 (03)