Estimating the agreement and diagnostic accuracy of two diagnostic tests when one test is conducted on only a subsample of specimens

被引：21

作者：

Katki, Hormuzd A. ^{[1
]}

Li, Yan ^{[2
]}

Edelstein, David W. ^{[3
]}

Castle, Philip E. ^{[4
]}

机构：

[1] NCI, Div Canc Epidemiol & Genet, Rockville, MD USA

[2] Univ Texas Arlington, Dept Math, Arlington, TX 76019 USA

[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[4] Amer Soc Clin Pathologists, Washington, DC USA

来源：

STATISTICS IN MEDICINE | 2012年 / 31卷 / 05期

关键词：

verification bias; symmetry test; kappa; two-phase design; HPV; sensitivity; specificity; gold standard; DOUBLE SAMPLING SCHEME; DISEASE VERIFICATION; GOLD STANDARD; BINOMIAL DATA; SENSITIVITY; SPECIFICITY; DESIGNS; 2-STAGE; ERROR; BIAS;

D O I：

10.1002/sim.4422

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

We focus on the efficient usage of specimen repositories for the evaluation of new diagnostic tests and for comparing new tests with existing tests. Typically, all pre-existing diagnostic tests will already have been conducted on all specimens. However, we propose retesting only a judicious subsample of the specimens by the new diagnostic test. Subsampling minimizes study costs and specimen consumption, yet estimates of agreement or diagnostic accuracy potentially retain adequate statistical efficiency. We introduce methods to estimate agreement statistics and conduct symmetry tests when the second test is conducted on only a subsample and no gold standard exists. The methods treat the subsample as a stratified two-phase sample and use inverse-probability weighting. Strata can be any information available on all specimens and can be used to oversample the most informative specimens. The verification bias framework applies if the test conducted on only the subsample is a gold standard. We also present inverse-probability-weighting-based estimators of diagnostic accuracy that take advantage of stratification. We present three examples demonstrating that adequate statistical efficiency can be achieved under subsampling while greatly reducing the number of specimens requiring retesting. Naively using standard estimators that ignore subsampling can lead to drastically misleading estimates. Through simulation, we assess the finite-sample properties of our estimators and consider other possible sampling designs for our examples that could have further improved statistical efficiency. To help promote subsampling designs, our R package CompareTests computes all of our agreement and diagnostic accuracy statistics. Copyright (c) 2011 John Wiley & Sons, Ltd.

引用

页码：436 / 448

页数：13

共 50 条

[1] Estimating and comparing diagnostic tests' accuracy when the gold standard is not binary
Obuchowski, NA
ACADEMIC RADIOLOGY, 2005, 12 (09) : 1198 - 1204
[2] Agreement between two diagnostic tests when accounting for test-retest variation: application to FFR versus iFR
Zhu, Hongjian
Lai, Dejian
Johnson, Nils P.
JOURNAL OF APPLIED STATISTICS, 2016, 43 (09) : 1673 - 1689
[3] Assessing the gain in diagnostic performance when combining two diagnostic tests
Macaskill, P
Walter, SD
Irwig, L
Franco, EL
STATISTICS IN MEDICINE, 2002, 21 (17) : 2527 - 2546
[4] On implementation of the Gibbs sampler for estimating the accuracy of multiple diagnostic tests
Principato, Fabio
Vullo, Angela
Matranga, Domenica
JOURNAL OF APPLIED STATISTICS, 2010, 37 (08) : 1335 - 1354
[5] Comparing two diagnostic tests when two tests are applied to same patients and test scores are given in categories
Aoyama, Yoshiko
Murotani, Kenta
Yanagawa, Takashi
Nagata, Shuji
SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2012, 74 (01): : 44 - 55
[6] Reliability, agreement, and diagnostic accuracy of the Modified Lateral Scapular Slide test
Shadmehr, A.
Sarafraz, H.
Blooki, M. Heidari
Jalaie, S. H.
Morais, N.
MANUAL THERAPY, 2016, 24 : 18 - 24
[7] ESTIMATING THE COMPARATIVE ACCURACY OF DIAGNOSTIC TESTS: AN EXAMPLE USING TYPHOID FEVER
Arora, P.
Thorlund, K.
Brenner, D. R.
Andrews, J. R.
VALUE IN HEALTH, 2017, 20 (09) : A574 - A575
[8] Estimating diagnostic accuracy of multiple binary tests with an imperfect reference standard
Albert, Paul S.
STATISTICS IN MEDICINE, 2009, 28 (05) : 780 - 797
[9] The effect of warming specimens of rapid urease test on its diagnostic accuracy
Nasiri, Jafar
Allandin, Arshya
Imani, Reza
Kheiri, Soleiman
Khoshdel, Abolfazl
REVISTA LATINOAMERICANA DE HIPERTENSION, 2019, 14 (01): : 15 - +
[10] ROC CURVES, TEST ACCURACY, AND THE DESCRIPTION OF DIAGNOSTIC-TESTS
MOSSMAN, D
SOMOZA, E
JOURNAL OF NEUROPSYCHIATRY AND CLINICAL NEUROSCIENCES, 1991, 3 (03) : 330 - 333

← 1 2 3 4 5 →