Nested multiple imputation in large-scale assessments

被引：14

作者：

Weirich, Sebastian ^{[1
]}

Haag, Nicole ^{[1
]}

Hecht, Martin ^{[1
]}

Boehme, Katrin ^{[1
]}

Siegle, Thilo ^{[1
]}

Luedtke, Oliver ^{[2
]}

机构：

[1] Humboldt Univ, Inst Educ Qual Improvement, Unter Linden 6, D-10099 Berlin, Germany

[2] Leibniz Inst Sci & Math Educ IPN, Ctr Int Student Assessment, Olshausenstr 62, D-24118 Kiel, Germany

来源：

LARGE-SCALE ASSESSMENTS IN EDUCATION | 2014年 / 2卷 / 01期

关键词：

Large-scale assessment; Missing data; Imputation; Simulation; Item response theory;

D O I：

10.1186/s40536-014-0009-0

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

Background: In order to measure the proficiency of person populations in various domains, large-scale assessments often use marginal maximum likelihood IRT models where person proficiency is modelled as a random variable. Thus, the model does not provide proficiency estimates for any single person. A popular approach to derive these proficiency estimates is the multiple imputation of plausible values (PV) to enable subsequent analyses on complete data sets. The main drawback is that all variables that are to be analyzed later have to be included in the imputation model to allow the distribution of plausible values to be conditional on these variables. These background variables (e.g., sex, age) have to be fully observed which is highly unlikely in practice. In several current large-scale assessment programs missing observations on background variables are dummy coded, and subsequently, dummy codes are used additionally in the PV imputation model. However, this approach is only appropriate for small proportions of missing data. Otherwise the resulting population scores may be biased. Methods: Alternatively, single imputation or multiple imputation methods can be used to account for missing values on background variables. With both imputation methods, the result is a two-step procedure in which the PV imputation is nested within the background variable imputation. In the single+multiple-imputation (SMI), each missing value on background variables is replaced by one value. In the multiple+multiple-imputation (MMI), each missing value is replaced by a set of imputed values. MMI is expected to outperform SMI as SMI ignores the uncertainty due to missing values in the background data. Results: In a simulation study, both methods yielded unbiased population estimates under most conditions. Still, the recovery proportion was slightly higher for the MMI method. Conclusions: The advantages of the MMI method are apparent for fairly high proportions of missing values in combination with fairly high dependency between the latent trait and the probability of missing data on background variables.

引用

页数：18

共 50 条

[1] Multiple imputation in a large-scale complex survey: a practical guide
He, Y.
Zaslavsky, A. M.
Landrum, M. B.
Harrington, D. P.
Catalano, P.
[J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (06) : 653 - 670
[2] Bayesian multiple imputation for large-scale categorical data with structural zeros
Manrique-Vallier, Daniel
Reiter, Jerome P.
[J]. SURVEY METHODOLOGY, 2014, 40 (01) : 125 - 134
[3] On Matrix Sampling and Imputation of Context Questionnaires With Implications for the Generation of Plausible Values in Large-Scale Assessments
Kaplan, David
Su, Dan
[J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2016, 41 (01) : 57 - 80
[4] Statistical inference with large-scale trait imputation
Ren, Jingchen
Pan, Wei
[J]. STATISTICS IN MEDICINE, 2024, 43 (04) : 625 - 641
[5] Improved Phasing and Imputation for Large-Scale Data
Browning, Brian L.
Browning, Sharon R.
Tian, Xiaowen
[J]. GENETIC EPIDEMIOLOGY, 2017, 41 (07) : 673 - 673
[6] LARGE-SCALE ASSESSMENTS AND READING THEORIES
Hachey, Krystal K.
[J]. 2011 4TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI), 2011, : 1019 - 1026
[7] Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys
Si, Yajuan
Reiter, Jerome P.
[J]. JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2013, 38 (05) : 499 - 521
[8] Large-scale international assessments of learning outcomes: balancing the interests of multiple stakeholders
Li, Guirong
Shcheglova, Irina
Bhuradia, Ashutosh
Li, Yanyan
Loyalka, Prashant
Zhou, Olivia
Hu, Shangfeng
Yu, Ningning
Ma, Liping
Guo, Fei
Chirikov, Igor
[J]. JOURNAL OF HIGHER EDUCATION POLICY AND MANAGEMENT, 2021, 43 (02) : 198 - 213
[9] RALSA: the R analyzer for large-scale assessments
Mirazchiyski, Plamen, V
[J]. LARGE-SCALE ASSESSMENTS IN EDUCATION, 2021, 9 (01)
[10] Linking Large-Scale Reading Assessments: Comment
Hanushek, Eric A.
[J]. MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2016, 14 (01) : 27 - 29

← 1 2 3 4 5 →