An R toolbox for score-based measurement invariance tests in IRT models

被引:7
|
作者
Schneider, Lennart [1 ,2 ]
Strobl, Carolin [3 ]
Zeileis, Achim [4 ]
Debelak, Rudolf [3 ,5 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] Ludwig Maximilian Univ Munich, Munich, Germany
[3] Univ Zurich, Dept Psychol, Zurich, Switzerland
[4] Univ Innsbruck, Innsbruck, Austria
[5] Univ Leipzig, Inst Psychol, Neumarkt 9, D-04109 Leipzig, Germany
基金
瑞士国家科学基金会;
关键词
Differential item functioning; Item response theory; Software tutorial; ITEM RESPONSE THEORY; PARTIAL CREDIT MODEL; PARAMETER INSTABILITY; LIKELIHOOD ESTIMATION; LATENT ABILITY; RASCH MODEL; COVARIATE; PACKAGE; FRAMEWORK;
D O I
10.3758/s13428-021-01689-0
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
The detection of differential item functioning (DIF) is a central topic in psychometrics and educational measurement. In the past few years, a new family of score-based tests of measurement invariance has been proposed, which allows the detection of DIF along arbitrary person covariates in a variety of item response theory (IRT) models. This paper illustrates the application of these tests within the R system for statistical computing, making them accessible to a broad range of users. This presentation also includes IRT models for which these tests have not previously been investigated, such as the generalized partial credit model. The paper has three goals: First, we review the ideas behind score-based tests of measurement invariance. Second, we describe the implementation of these tests within the R system for statistical computing, which is based on the interaction of the R packages mirt, psychotools and strucchange. Third, we illustrate the application of this software and the interpretation of its output in two empirical datasets. The complete R code for reproducing our results is reported in the paper.
引用
收藏
页码:2101 / 2113
页数:13
相关论文
共 50 条
  • [1] An R toolbox for score-based measurement invariance tests in IRT models
    Lennart Schneider
    Carolin Strobl
    Achim Zeileis
    Rudolf Debelak
    [J]. Behavior Research Methods, 2022, 54 : 2101 - 2113
  • [2] Score-based tests of measurement invariance: use in practice
    Wang, Ting
    Merkle, Edgar C.
    Zeileis, Achim
    [J]. FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [3] Score-based tests for detecting heterogeneity in linear mixed models
    Wang, Ting
    Merkle, Edgar C.
    Anguera, Joaquin A.
    Turner, Brandon M.
    [J]. BEHAVIOR RESEARCH METHODS, 2021, 53 (01) : 216 - 231
  • [4] Score-based tests for detecting heterogeneity in linear mixed models
    Ting Wang
    Edgar C. Merkle
    Joaquin A. Anguera
    Brandon M. Turner
    [J]. Behavior Research Methods, 2021, 53 : 216 - 231
  • [5] Fit Indices for Measurement Invariance Tests in the Thurstonian IRT Model
    Lee, HyeSun
    Smith, Weldon Z.
    [J]. APPLIED PSYCHOLOGICAL MEASUREMENT, 2020, 44 (04) : 282 - 295
  • [6] Score-based Tests for Item Factor Analysis
    Wang, Ting
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2015, 50 (06) : 728 - 729
  • [7] Score-based measurement invariance checks for Bayesian maximum-a-posteriori estimates in item response theory
    Debelak, Rudolf
    Pawel, Samuel
    Strobl, Carolin
    Merkle, Edgar C.
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2022, 75 (03): : 728 - 752
  • [8] Score-based Generative Models with Levy Processes
    Yoon, Eunbi
    Park, Keehun
    Kim, Sungwoong
    Lim, Sungbin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] Score-based diffusion models for accelerated MRI
    Chung, Hyungjin
    Ye, Jong Chul
    [J]. MEDICAL IMAGE ANALYSIS, 2022, 80
  • [10] Score-Based Hypothesis Testing for Unnormalized Models
    Wu, Suya
    Diao, Enmao
    Elkhalil, Khalil
    Ding, Jie
    Tarokh, Vahid
    [J]. IEEE ACCESS, 2022, 10 : 71936 - 71950