Modeling and Evaluating Repeatability and Reproducibility of Ordinal Classifications

被引:23
|
作者
de Mast, Jeroen [1 ]
van Wieringen, Wessel N. [2 ,3 ]
机构
[1] Univ Amsterdam, Inst Business & Ind Stat, NL-1018 TV Amsterdam, Netherlands
[2] Vrije Univ Amsterdam, Med Ctr, Dept Epidemiol & Biostat, NL-1007 MB Amsterdam, Netherlands
[3] Vrije Univ Amsterdam, Dept Math, NL-1081 HV Amsterdam, Netherlands
关键词
Agreement; Attribute data; Categorical data; Concordance; Gauge capability; Gauge repeatability and reproducibility; Item response theory; Measurement system analysis; Ordinal data; MEASUREMENT SYSTEM-ANALYSIS; AGREEMENT; KAPPA;
D O I
10.1198/TECH.2009.08052
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper argues that currently available methods for the assessment of the repeatability and reproducibility of ordinal classifications are not satisfactory. The paper aims to study whether we can modify a class of models from Item Response Theory. well established for the study of the reliability of categorical measurements in psychometrics and education, for use in business and industry, and whether the resulting approaches offer a satisfactory solution. The fitted models can be presented graphically, but also allow the calculation of probabilities of correct ordering and consistent classification. In addition, the model-based approach allows refined diagnostics. giving the user insight into the workings of a classification procedure, which is vital information for a user willing to improve a poor classification procedure. The approach is illustrated from a real-life example, and the proposed analysis is contrasted to two popular alternative analyses, based on Goodman and Kruskal's gamma and Kendall's coefficient of concordance. The datasets and mathematical proofs are available as online supplemental materials.
引用
收藏
页码:94 / 106
页数:13
相关论文
共 50 条
  • [1] Analysis of Repeatability and Reproducibility Studies With Ordinal Measurements
    Culp, Stacey L.
    Ryan, Kenneth J.
    Chen, Juan
    Hamada, Michael S.
    [J]. TECHNOMETRICS, 2018, 60 (04) : 545 - 556
  • [2] A Novel Approach to Evaluate Repeatability and Reproducibility for Ordinal Data
    Deldossi, Laura
    Zappa, Diego
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2014, 43 (04) : 851 - 866
  • [3] Evaluating the Gauge Repeatability and Reproducibility for Different Industries
    Jeh-Nan Pan
    [J]. Quality and Quantity, 2006, 40 : 499 - 518
  • [4] Evaluating the gauge repeatability and reproducibility for different industries
    Pan, Jeh-Nan
    [J]. QUALITY & QUANTITY, 2006, 40 (04) : 499 - 518
  • [5] Statistical Considerations for Evaluating Biofidelity, Repeatability, and Reproducibility of ATDs
    Nusholtz, Guy S.
    Aoun, Zine
    Di Domenico, Laura
    Hsu, Timothy
    Gracian, Manuel A.
    Prado, Jesus A.
    [J]. SAE INTERNATIONAL JOURNAL OF TRANSPORTATION SAFETY, 2013, 1 (01) : 200 - 218
  • [6] REPEATABILITY AND REPRODUCIBILITY
    MANDEL, J
    [J]. MATERIALS RESEARCH AND STANDARDS, 1971, 11 (08): : 8 - &
  • [7] Reproducibility and Repeatability
    Slezak, P.
    Waczulikova, I.
    [J]. PHYSIOLOGICAL RESEARCH, 2011, 60 (01) : 203 - 204
  • [8] REPEATABILITY AND REPRODUCIBILITY
    SCOTT, CG
    [J]. JOURNAL OF LIQUID CHROMATOGRAPHY, 1978, 1 (06): : R9 - R9
  • [10] The repeatability of code defect classifications
    El Emam, K
    Wieczorek, I
    [J]. NINTH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING, PROCEEDINGS, 1998, : 322 - 333