The Accuracy of Computerized Adaptive Testing in Heterogeneous Populations: A Mixture Item-Response Theory Analysis

被引:4
|
作者
Sawatzky, Richard [1 ,2 ]
Ratner, Pamela A. [3 ]
Kopec, Jacek A. [4 ,5 ]
Wu, Amery D. [6 ]
Zumbo, Bruno D. [6 ,7 ]
机构
[1] Trinity Western Univ, Sch Nursing, Langley, BC, Canada
[2] Providence Hlth Care Res Inst, Ctr Hlth Evaluat & Outcomes Sci, Vancouver, BC, Canada
[3] Univ British Columbia, Fac Educ, Vancouver, BC V5Z 1M9, Canada
[4] Univ British Columbia, Sch Populat & Publ Hlth, Vancouver, BC V5Z 1M9, Canada
[5] Arthrit Res Ctr Canada, Vancouver, BC, Canada
[6] Univ British Columbia, Measurement Evaluat & Res Methodol, Vancouver, BC V5Z 1M9, Canada
[7] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada
来源
PLOS ONE | 2016年 / 11卷 / 03期
关键词
OUTCOMES MEASUREMENT; MODEL-SELECTION; SHORT-FORMS; PERFORMANCE; INSTRUMENTS; VALIDATION; LIKELIHOOD; REGRESSION; VALIDITY; BANKING;
D O I
10.1371/journal.pone.0150563
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Computerized adaptive testing (CAT) utilizes latent variable measurement model parameters that are typically assumed to be equivalently applicable to all people. Biased latent variable scores may be obtained in samples that are heterogeneous with respect to a specified measurement model. We examined the implications of sample heterogeneity with respect to CAT-predicted patient-reported outcomes (PRO) scores for the measurement of pain. Methods A latent variable mixture modeling (LVMM) analysis was conducted using data collected from a heterogeneous sample of people in British Columbia, Canada, who were administered the 36 pain domain items of the CAT-5D-QOL. The fitted LVMM was then used to produce data for a simulation analysis. We evaluated bias by comparing the referent PRO scores of the LVMM with PRO scores predicted by a "conventional" CAT (ignoring heterogeneity) and a LVMM-based "mixture" CAT (accommodating heterogeneity). Results The LVMM analysis indicated support for three latent classes with class proportions of 0.25, 0.30 and 0.45, which suggests that the sample was heterogeneous. The simulation analyses revealed differences between the referent PRO scores and the PRO scores produced by the "conventional" CAT. The "mixture" CAT produced PRO scores that were nearly equivalent to the referent scores. Conclusion Bias in PRO scores based on latent variable models may result when population heterogeneity is ignored. Improved accuracy could be obtained by using CATs that are parameterized using LVMM.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Detecting uniform differential item functioning for continuous response computerized adaptive testing
    Wang, Chun
    Zhu, Ruoyi
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2024, 48 (1-2) : 18 - 37
  • [42] Item Pocket Method to Allow Response Review and Change in Computerized Adaptive Testing
    Han, Kyung T.
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2013, 37 (04) : 259 - 275
  • [43] Psychometric properties of the Epworth Sleepiness Scale: A factor analysis and item-response theory approach
    Pilcher, June J.
    Switzer, Fred S., III
    Munc, Alec
    Donnelly, Janet
    Jellen, Julia C.
    Lamm, Claus
    CHRONOBIOLOGY INTERNATIONAL, 2018, 35 (04) : 533 - 545
  • [44] Scaling MacDonald's AT-20 using item-response theory
    Lange, R
    Houran, J
    PERSONALITY AND INDIVIDUAL DIFFERENCES, 1999, 26 (03) : 467 - 475
  • [45] THE USE OF ITEM-RESPONSE THEORY IN SOCIAL-WORK MEASUREMENT AND RESEARCH
    NUGENT, WR
    HANKINS, JA
    SOCIAL SERVICE REVIEW, 1989, 63 (03) : 447 - 473
  • [46] Evaluation of adding item-response theory analysis for evaluation of the European Board of Ophthalmology Diploma examination
    Mathysen, Danny G. P.
    Aclimandos, Wagih
    Roelant, Ella
    Wouters, Kristien
    Creuzot-Garcher, Catherine
    Ringens, Peter J.
    Hawlina, Marko
    Tassignon, Marie-Jose
    ACTA OPHTHALMOLOGICA, 2013, 91 (07) : E573 - E577
  • [47] Predicting item exposure parameters in computerized adaptive testing
    Chen, Shu-Ying
    Doong, Shing-Hwang
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2008, 61 : 75 - 91
  • [48] Computerized adaptive testing with decision regression trees: an alternative to item response theory for quality of life measurement in multiple sclerosis
    Michel, Pierre
    Baumstarck, Karine
    Loundou, Anderson
    Ghattas, Badih
    Auquier, Pascal
    Boyer, Laurent
    PATIENT PREFERENCE AND ADHERENCE, 2018, 12 : 1043 - 1053
  • [49] Components of the item selection algorithm in computerized adaptive testing
    Han, Kyung Tyek
    JOURNAL OF EDUCATIONAL EVALUATION FOR HEALTH PROFESSIONS, 2018, 15 : 7
  • [50] Item calibration error in Computerized Adaptive Testing (CAT)
    Yousfi, Safir
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 620 - 620