An overview on assessing agreement with continuous measurements

被引:241
|
作者
Barnhart, Huiman X.
Haber, Michael J.
Lin, Lawrence I.
机构
[1] Duke Univ, Dept Biostat & Bioinformat, Durham, NC 27715 USA
[2] Duke Univ, Duke Clin Res Inst, Durham, NC 27715 USA
[3] Emory Univ, Rollins Sch Publ Hlth, Dept Biostat, Atlanta, GA USA
[4] Baxter Healthcare Co, Div Tech Resources, Round Lake, IL USA
基金
美国国家卫生研究院;
关键词
accuracy; agreement; coefficient of individual agreement; concordance correlation coefficient; coverage probability; generalizability; intraclass correlation coefficient; limits of agreement; method comparison; precision; reliability; repeatability; reproducibility; tolerance interval; total deviation index; validity;
D O I
10.1080/10543400701376480
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Reliable and accurate measurements serve as the basis for evaluation in many scientific disciplines. Issues related to reliable and accurate measurement have evolved over many decades, dating back to the nineteenth century and the pioneering work of Galton (1886), Pearson (1896, 1899, 1901), and Fisher (1925). Requiring a new measurement to be identical to the truth is often impractical, either because (1) we are willing to accept a measurement up to some tolerable (or acceptable) error, or (2) the truth is simply not available to us, either because it is not measurable or is only measurable with some degree of error. To deal with issues related to both (1) and (2), a number of concepts, methods, and theories have been developed in various disciplines. Some of these concepts have been used across disciplines, while others have been limited to a particular field but may have potential uses in other disciplines. In this paper, we elucidate and contrast fundamental concepts employed in different disciplines and unite these concepts into one common theme: assessing closeness (agreement) of observations. We focus on assessing agreement with continuous measurements and classify different statistical approaches as (1) descriptive tools; (2) unscaled summary indices based on absolute differences of measurements; and (3) scaled summary indices attaining values between -1 and 1 for various data structures, and for cases with and without a reference. We also identify gaps that require further research and discuss future directions in assessing agreement.
引用
收藏
页码:529 / 569
页数:41
相关论文
共 50 条