The Accuracy of Computerized Adaptive Testing in Heterogeneous Populations: A Mixture Item-Response Theory Analysis

被引：4

作者：

Sawatzky, Richard ^{[1
,2
]}

Ratner, Pamela A. ^{[3
]}

Kopec, Jacek A. ^{[4
,5
]}

Wu, Amery D. ^{[6
]}

Zumbo, Bruno D. ^{[6
,7
]}

机构：

[1] Trinity Western Univ, Sch Nursing, Langley, BC, Canada

[2] Providence Hlth Care Res Inst, Ctr Hlth Evaluat & Outcomes Sci, Vancouver, BC, Canada

[3] Univ British Columbia, Fac Educ, Vancouver, BC V5Z 1M9, Canada

[4] Univ British Columbia, Sch Populat & Publ Hlth, Vancouver, BC V5Z 1M9, Canada

[5] Arthrit Res Ctr Canada, Vancouver, BC, Canada

[6] Univ British Columbia, Measurement Evaluat & Res Methodol, Vancouver, BC V5Z 1M9, Canada

[7] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada

来源：

PLOS ONE | 2016年 / 11卷 / 03期

关键词：

OUTCOMES MEASUREMENT; MODEL-SELECTION; SHORT-FORMS; PERFORMANCE; INSTRUMENTS; VALIDATION; LIKELIHOOD; REGRESSION; VALIDITY; BANKING;

D O I：

10.1371/journal.pone.0150563

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Background Computerized adaptive testing (CAT) utilizes latent variable measurement model parameters that are typically assumed to be equivalently applicable to all people. Biased latent variable scores may be obtained in samples that are heterogeneous with respect to a specified measurement model. We examined the implications of sample heterogeneity with respect to CAT-predicted patient-reported outcomes (PRO) scores for the measurement of pain. Methods A latent variable mixture modeling (LVMM) analysis was conducted using data collected from a heterogeneous sample of people in British Columbia, Canada, who were administered the 36 pain domain items of the CAT-5D-QOL. The fitted LVMM was then used to produce data for a simulation analysis. We evaluated bias by comparing the referent PRO scores of the LVMM with PRO scores predicted by a "conventional" CAT (ignoring heterogeneity) and a LVMM-based "mixture" CAT (accommodating heterogeneity). Results The LVMM analysis indicated support for three latent classes with class proportions of 0.25, 0.30 and 0.45, which suggests that the sample was heterogeneous. The simulation analyses revealed differences between the referent PRO scores and the PRO scores produced by the "conventional" CAT. The "mixture" CAT produced PRO scores that were nearly equivalent to the referent scores. Conclusion Bias in PRO scores based on latent variable models may result when population heterogeneity is ignored. Improved accuracy could be obtained by using CATs that are parameterized using LVMM.

引用

页数：16

共 50 条

[41] Detecting uniform differential item functioning for continuous response computerized adaptive testing
Wang, Chun
Zhu, Ruoyi
APPLIED PSYCHOLOGICAL MEASUREMENT, 2024, 48 (1-2) : 18 - 37
[42] Item Pocket Method to Allow Response Review and Change in Computerized Adaptive Testing
Han, Kyung T.
APPLIED PSYCHOLOGICAL MEASUREMENT, 2013, 37 (04) : 259 - 275
[43] Psychometric properties of the Epworth Sleepiness Scale: A factor analysis and item-response theory approach
Pilcher, June J.
Switzer, Fred S., III
Munc, Alec
Donnelly, Janet
Jellen, Julia C.
Lamm, Claus
CHRONOBIOLOGY INTERNATIONAL, 2018, 35 (04) : 533 - 545
[44] Scaling MacDonald's AT-20 using item-response theory
Lange, R
Houran, J
PERSONALITY AND INDIVIDUAL DIFFERENCES, 1999, 26 (03) : 467 - 475
[45] THE USE OF ITEM-RESPONSE THEORY IN SOCIAL-WORK MEASUREMENT AND RESEARCH
NUGENT, WR
HANKINS, JA
SOCIAL SERVICE REVIEW, 1989, 63 (03) : 447 - 473
[46] Evaluation of adding item-response theory analysis for evaluation of the European Board of Ophthalmology Diploma examination
Mathysen, Danny G. P.
Aclimandos, Wagih
Roelant, Ella
Wouters, Kristien
Creuzot-Garcher, Catherine
Ringens, Peter J.
Hawlina, Marko
Tassignon, Marie-Jose
ACTA OPHTHALMOLOGICA, 2013, 91 (07) : E573 - E577
[47] Predicting item exposure parameters in computerized adaptive testing
Chen, Shu-Ying
Doong, Shing-Hwang
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2008, 61 : 75 - 91
[48] Computerized adaptive testing with decision regression trees: an alternative to item response theory for quality of life measurement in multiple sclerosis
Michel, Pierre
Baumstarck, Karine
Loundou, Anderson
Ghattas, Badih
Auquier, Pascal
Boyer, Laurent
PATIENT PREFERENCE AND ADHERENCE, 2018, 12 : 1043 - 1053
[49] Components of the item selection algorithm in computerized adaptive testing
Han, Kyung Tyek
JOURNAL OF EDUCATIONAL EVALUATION FOR HEALTH PROFESSIONS, 2018, 15 : 7
[50] Item calibration error in Computerized Adaptive Testing (CAT)
Yousfi, Safir
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 620 - 620

← 1 2 3 4 5 →