The Accuracy of Computerized Adaptive Testing in Heterogeneous Populations: A Mixture Item-Response Theory Analysis

被引:4
|
作者
Sawatzky, Richard [1 ,2 ]
Ratner, Pamela A. [3 ]
Kopec, Jacek A. [4 ,5 ]
Wu, Amery D. [6 ]
Zumbo, Bruno D. [6 ,7 ]
机构
[1] Trinity Western Univ, Sch Nursing, Langley, BC, Canada
[2] Providence Hlth Care Res Inst, Ctr Hlth Evaluat & Outcomes Sci, Vancouver, BC, Canada
[3] Univ British Columbia, Fac Educ, Vancouver, BC V5Z 1M9, Canada
[4] Univ British Columbia, Sch Populat & Publ Hlth, Vancouver, BC V5Z 1M9, Canada
[5] Arthrit Res Ctr Canada, Vancouver, BC, Canada
[6] Univ British Columbia, Measurement Evaluat & Res Methodol, Vancouver, BC V5Z 1M9, Canada
[7] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada
来源
PLOS ONE | 2016年 / 11卷 / 03期
关键词
OUTCOMES MEASUREMENT; MODEL-SELECTION; SHORT-FORMS; PERFORMANCE; INSTRUMENTS; VALIDATION; LIKELIHOOD; REGRESSION; VALIDITY; BANKING;
D O I
10.1371/journal.pone.0150563
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Computerized adaptive testing (CAT) utilizes latent variable measurement model parameters that are typically assumed to be equivalently applicable to all people. Biased latent variable scores may be obtained in samples that are heterogeneous with respect to a specified measurement model. We examined the implications of sample heterogeneity with respect to CAT-predicted patient-reported outcomes (PRO) scores for the measurement of pain. Methods A latent variable mixture modeling (LVMM) analysis was conducted using data collected from a heterogeneous sample of people in British Columbia, Canada, who were administered the 36 pain domain items of the CAT-5D-QOL. The fitted LVMM was then used to produce data for a simulation analysis. We evaluated bias by comparing the referent PRO scores of the LVMM with PRO scores predicted by a "conventional" CAT (ignoring heterogeneity) and a LVMM-based "mixture" CAT (accommodating heterogeneity). Results The LVMM analysis indicated support for three latent classes with class proportions of 0.25, 0.30 and 0.45, which suggests that the sample was heterogeneous. The simulation analyses revealed differences between the referent PRO scores and the PRO scores produced by the "conventional" CAT. The "mixture" CAT produced PRO scores that were nearly equivalent to the referent scores. Conclusion Bias in PRO scores based on latent variable models may result when population heterogeneity is ignored. Improved accuracy could be obtained by using CATs that are parameterized using LVMM.
引用
收藏
页数:16
相关论文
共 50 条