Nested multiple imputation in large-scale assessments

被引:14
|
作者
Weirich, Sebastian [1 ]
Haag, Nicole [1 ]
Hecht, Martin [1 ]
Boehme, Katrin [1 ]
Siegle, Thilo [1 ]
Luedtke, Oliver [2 ]
机构
[1] Humboldt Univ, Inst Educ Qual Improvement, Unter Linden 6, D-10099 Berlin, Germany
[2] Leibniz Inst Sci & Math Educ IPN, Ctr Int Student Assessment, Olshausenstr 62, D-24118 Kiel, Germany
关键词
Large-scale assessment; Missing data; Imputation; Simulation; Item response theory;
D O I
10.1186/s40536-014-0009-0
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Background: In order to measure the proficiency of person populations in various domains, large-scale assessments often use marginal maximum likelihood IRT models where person proficiency is modelled as a random variable. Thus, the model does not provide proficiency estimates for any single person. A popular approach to derive these proficiency estimates is the multiple imputation of plausible values (PV) to enable subsequent analyses on complete data sets. The main drawback is that all variables that are to be analyzed later have to be included in the imputation model to allow the distribution of plausible values to be conditional on these variables. These background variables (e.g., sex, age) have to be fully observed which is highly unlikely in practice. In several current large-scale assessment programs missing observations on background variables are dummy coded, and subsequently, dummy codes are used additionally in the PV imputation model. However, this approach is only appropriate for small proportions of missing data. Otherwise the resulting population scores may be biased. Methods: Alternatively, single imputation or multiple imputation methods can be used to account for missing values on background variables. With both imputation methods, the result is a two-step procedure in which the PV imputation is nested within the background variable imputation. In the single+multiple-imputation (SMI), each missing value on background variables is replaced by one value. In the multiple+multiple-imputation (MMI), each missing value is replaced by a set of imputed values. MMI is expected to outperform SMI as SMI ignores the uncertainty due to missing values in the background data. Results: In a simulation study, both methods yielded unbiased population estimates under most conditions. Still, the recovery proportion was slightly higher for the MMI method. Conclusions: The advantages of the MMI method are apparent for fairly high proportions of missing values in combination with fairly high dependency between the latent trait and the probability of missing data on background variables.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Multiple imputation in a large-scale complex survey: a practical guide
    He, Y.
    Zaslavsky, A. M.
    Landrum, M. B.
    Harrington, D. P.
    Catalano, P.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (06) : 653 - 670
  • [2] Bayesian multiple imputation for large-scale categorical data with structural zeros
    Manrique-Vallier, Daniel
    Reiter, Jerome P.
    [J]. SURVEY METHODOLOGY, 2014, 40 (01) : 125 - 134
  • [3] On Matrix Sampling and Imputation of Context Questionnaires With Implications for the Generation of Plausible Values in Large-Scale Assessments
    Kaplan, David
    Su, Dan
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2016, 41 (01) : 57 - 80
  • [4] Statistical inference with large-scale trait imputation
    Ren, Jingchen
    Pan, Wei
    [J]. STATISTICS IN MEDICINE, 2024, 43 (04) : 625 - 641
  • [5] Improved Phasing and Imputation for Large-Scale Data
    Browning, Brian L.
    Browning, Sharon R.
    Tian, Xiaowen
    [J]. GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 673 - 673
  • [6] LARGE-SCALE ASSESSMENTS AND READING THEORIES
    Hachey, Krystal K.
    [J]. 2011 4TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI), 2011, : 1019 - 1026
  • [7] Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys
    Si, Yajuan
    Reiter, Jerome P.
    [J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2013, 38 (05) : 499 - 521
  • [8] Large-scale international assessments of learning outcomes: balancing the interests of multiple stakeholders
    Li, Guirong
    Shcheglova, Irina
    Bhuradia, Ashutosh
    Li, Yanyan
    Loyalka, Prashant
    Zhou, Olivia
    Hu, Shangfeng
    Yu, Ningning
    Ma, Liping
    Guo, Fei
    Chirikov, Igor
    [J]. JOURNAL OF HIGHER EDUCATION POLICY AND MANAGEMENT, 2021, 43 (02) : 198 - 213
  • [9] RALSA: the R analyzer for large-scale assessments
    Mirazchiyski, Plamen, V
    [J]. LARGE-SCALE ASSESSMENTS IN EDUCATION, 2021, 9 (01)
  • [10] Linking Large-Scale Reading Assessments: Comment
    Hanushek, Eric A.
    [J]. MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2016, 14 (01) : 27 - 29