Benefits from computerized adaptive testing as seen in simulation studies

被引:11
|
作者
Hornke, LF [1 ]
机构
[1] Aachen Tech Univ, Inst Psychol, D-52056 Aachen, Germany
关键词
adaptive testing; item banking; item pools; matrices; verbal analogies; number problems;
D O I
10.1027//1015-5759.15.2.91
中图分类号
B849 [应用心理学];
学科分类号
040203 ;
摘要
Item parameters for several hundreds of items were estimated based on empirical data from several thousands of subjects. The logistic one-parameter (1PL) and two-parameter (2PL) model estimates were evaluated. However, model fit showed that only a subset of items complied sufficiently, so that the remaining ones were assembled in well-fitting item banks. In several simulation studies 5000 simulated responses were generated in accordance with a computerized adaptive test procedure along with person parameters. A,general reliability of .80 or a standard error of measurement of .44 was used as a stopping rule to end CAT testing. We also recorded how often each item was used by all simulees. Person-parameter estimates based on CAT correlated higher than .90 with true values simulated. For all 1PL fitting item banks most simulees used more than 20 items but less than 30 items to reach the pre-set level of measurement error. However, testing based on item banks that complied to the 2PL revealed that, on average, only 10 items were sufficient to end testing at the same measurement error level. Both clearly demonstrate the precision and economy of computerized: adaptive testing. Empirical evaluations from everyday uses will show whether these trends will holdup in practice. If so, CAT will become possible and reasonable with some 150 well-calibrated 2PL items.
引用
收藏
页码:91 / 98
页数:8
相关论文
共 50 条