Comparison of confidence interval methods for an intra-class correlation coefficient (ICC)

被引:45
|
作者
Ionan, Alexei C. [1 ]
Polley, Mei-Yin C. [2 ]
McShane, Lisa M. [2 ]
Dobbin, Kevin K. [3 ]
机构
[1] Univ Georgia, Dept Stat, Athens, GA 30602 USA
[2] NCI, Biometr Res Branch, Rockville, MD USA
[3] Univ Georgia, Dept Epidemiol & Biostat, Athens, GA 30602 USA
关键词
Confidence interval; Credible interval; Generalized confidence interval; Intraclass correlation coefficient; Modified large sample; ASSESSING AGREEMENT; VARIANCE; BAYES; SAMPLE; MODELS;
D O I
10.1186/1471-2288-14-121
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The intraclass correlation coefficient (ICC) is widely used in biomedical research to assess the reproducibility of measurements between raters, labs, technicians, or devices. For example, in an inter-rater reliability study, a high ICC value means that noise variability (between-raters and within-raters) is small relative to variability from patient to patient. A confidence interval or Bayesian credible interval for the ICC is a commonly reported summary. Such intervals can be constructed employing either frequentist or Bayesian methodologies. Methods: This study examines the performance of three different methods for constructing an interval in a two-way, crossed, random effects model without interaction: the Generalized Confidence Interval method (GCI), the Modified Large Sample method (MLS), and a Bayesian method based on a noninformative prior distribution (NIB). Guidance is provided on interval construction method selection based on study design, sample size, and normality of the data. We compare the coverage probabilities and widths of the different interval methods. Results: We show that, for the two-way, crossed, random effects model without interaction, care is needed in interval method selection because the interval estimates do not always have properties that the user expects. While different methods generally perform well when there are a large number of levels of each factor, large differences between the methods emerge when the number of one or more factors is limited. In addition, all methods are shown to lack robustness to certain hard-to-detect violations of normality when the sample size is limited. Conclusions: Decision rules and software programs for interval construction are provided for practical implementation in the two-way, crossed, random effects model without interaction. All interval methods perform similarly when the data are normal and there are sufficient numbers of levels of each factor. The MLS and GCI methods outperform the NIB when one of the factors has a limited number of levels and the data are normally distributed or nearly normally distributed. None of the methods work well if the number of levels of a factor are limited and data are markedly non-normal. The software programs are implemented in the popular R language.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Comparison of confidence interval methods for an intra-class correlation coefficient (ICC)
    Alexei C Ionan
    Mei-Yin C Polley
    Lisa M McShane
    Kevin K Dobbin
    [J]. BMC Medical Research Methodology, 14
  • [2] Sample size methods for constructing confidence intervals for the intra-class correlation coefficient
    Dobbin, Kevin K.
    Ionan, Alexei C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 85 : 67 - 83
  • [3] On intra-class correlation coefficient estimation
    Nabendu Pal
    Wooi K. Lim
    [J]. Statistical Papers, 2004, 45 : 369 - 392
  • [4] On intra-class correlation coefficient estimation
    Pal, N
    Lim, WK
    [J]. STATISTICAL PAPERS, 2004, 45 (03) : 369 - 392
  • [5] Asymptotic Confidence Interval, Sample Size Formulas and Comparison Test for the Agreement Intra-Class Correlation Coefficient in Inter-Rater Reliability Studies
    Bourredjem, Abderrahmane
    Cardot, Herve
    Devilliers, Herve
    [J]. STATISTICS IN MEDICINE, 2024,
  • [6] Measuring fMRI reliability with the intra-class correlation coefficient
    Caceres, Alejandro
    Hall, Deanna L.
    Zelaya, Fernando O.
    Williams, Steven C. R.
    Mehta, Mitul A.
    [J]. NEUROIMAGE, 2009, 45 (03) : 758 - 768
  • [7] A COMPARISON OF CONFIDENCE-INTERVAL METHODS FOR THE INTRACLASS CORRELATION-COEFFICIENT
    DONNER, A
    WELLS, G
    [J]. BIOMETRICS, 1986, 42 (02) : 401 - 412
  • [8] A Cross-National Comparison of Intra-Class Correlation Coefficient in Educational Achievement Outcomes
    Zopluoglu, Cengiz
    [J]. JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, 2012, 3 (01): : 242 - 278
  • [9] MARGINAL LIKELIHOOD INFERENCE FOR THE INTRA-CLASS CORRELATION-COEFFICIENT
    HAQ, MS
    NG, VM
    [J]. COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1979, 8 (02): : 179 - 189
  • [10] A Comparison of Confidence Interval Methods for the Concordance Correlation Coefficient and Intraclass Correlation Coefficient with Small Number of Raters
    Feng, Dai
    Svetnik, Vladimir
    Coimbra, Alexandre
    Baumgartner, Richard
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2014, 24 (02) : 272 - 293