Optimizing the Use of Response Times for Item Selection in Computerized Adaptive Testing

被引:20
|
作者
Choe, Edison M. [1 ]
Kern, Justin L. [2 ]
Chang, Hua-Hua [3 ]
机构
[1] GMAC, 11921 Freedom Dr,Suite 300, Reston, VA 20190 USA
[2] Univ Calif Merced, 5200 North Lake Rd, Merced, CA 95343 USA
[3] Univ Illinois, Psychol, 603 East Daniel St, Champaign, IL 61820 USA
关键词
computerized adaptive testing; response time; item selection; item exposure; test overlap; DIFFERENTIAL SPEEDEDNESS; MAXIMUM INFORMATION; ABILITY ESTIMATION; EXPOSURE; MODEL; ACCURACY; CAT;
D O I
10.3102/1076998617723642
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Despite common operationalization, measurement efficiency of computerized adaptive testing should not only be assessed in terms of the number of items administered but also the time it takes to complete the test. To this end, a recent study introduced a novel item selection criterion that maximizes Fisher information per unit of expected response time (RT), which was shown to effectively reduce the average completion time for a fixed-length test with minimal decrease in the accuracy of ability estimation. As this method also resulted in extremely unbalanced exposure of items, however, a-stratification with b-blocking was recommended as a means for counterbalancing. Although exceptionally effective in this regard, it comes at substantial costs of attenuating the reduction in average testing time, increasing the variance of testing times, and further decreasing estimation accuracy. Therefore, this article investigated several alternative methods for item exposure control, of which the most promising was a simple modification of maximizing Fisher information per unit of centered expected RT. The key advantage of the proposed method is the flexibility in choosing a centering value according to a desired distribution of testing times and level of exposure control. Moreover, the centered expected RT can be exponentially weighted to calibrate the degree of measurement precision. The results of extensive simulations, with item pools and examinees that are both simulated and real, demonstrate that optimally chosen centering and weighting values can markedly reduce the mean and variance of both testing times and test overlap, all without much compromise in estimation accuracy.
引用
收藏
页码:135 / 158
页数:24
相关论文
共 50 条