An Evaluation of Consensus Techniques for Diagnostic Interpretation

被引:0
|
作者
Sauter, Jake N. [1 ]
LaBarre, Victoria M. [2 ]
Furst, Jacob D. [3 ]
Raicu, Daniela S. [3 ]
机构
[1] SUNY Coll Oswego, 7060 Route 104, Oswego, NY USA
[2] McLennan Community Coll, 140 Coll Dr, Waco, TX USA
[3] Coll Comp & Digital Media, 243 South Wabash Ave, Chicago, IL USA
基金
美国国家科学基金会;
关键词
Belief Decision Tree; LIDC; Leverage Label Variability; PULMONARY NODULES; LUNG;
D O I
10.1117/12.2293778
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Learning diagnostic labels from image content has been the standard in computer-aided diagnosis. Most computer-aided diagnosis systems use low-level image features extracted directly from image content to train and test machine learning classifiers for diagnostic label prediction. When the ground truth for the diagnostic labels is not available, reference truth is generated from the experts diagnostic interpretations of the image/region of interest. More specifically, when the label is uncertain, e.g. when multiple experts label an image and their interpretations are different, techniques to handle the label variability are necessary. In this paper, we compare three consensus techniques that are typically used to encode the variability in the experts labeling of the medical data: mean, median and mode, and their effects on simple classifiers that can handle deterministic labels (decision trees) and probabilistic vectors of labels (belief decision trees). Given that the NIH/NCI Lung Image Database Consortium (LIDC) data provides interpretations for lung nodules by up to four radiologists, we leverage the LIDC data to evaluate and compare these consensus approaches when creating computer-aided diagnosis systems for lung nodules. First, low-level image features of nodules are extracted and paired with their radiologists semantic ratings (1= most likely benign, 5 = most likely malignant); second, machine learning multi-class classifiers that handle deterministic labels (decision trees) and probabilistic vectors of labels (belief decision trees) are built to predict the lung nodules semantic ratings. We show that the mean-based consensus generates the most robust classifier overall when compared to the median- and mode-based consensus. Lastly, the results of this study show that, when building CAD systems with uncertain diagnostic interpretation, it is important to evaluate different strategies for encoding and predicting the diagnostic label.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Role of imaging techniques in the diagnostic evaluation of nipple discharge
    Brandon, CJ
    Youree, CC
    Sewell, CL
    Hall, D
    RADIOLOGY, 1996, 201 : 18 - 18
  • [32] EVALUATION OF DIAGNOSTIC-TECHNIQUES FOR NATURAL PORCINE SCHISTOSOMIASIS
    AGRAWAL, MC
    PANESAR, N
    INDIAN VETERINARY JOURNAL, 1987, 64 (11): : 918 - 920
  • [33] Evaluation of the diagnostic techniques for Oral Lichen Planus.
    Paparella, ML
    Masquijo, P
    Brandizzi, D
    Diaz, J
    Aguas, S
    Keszler, A
    Itoiz, ME
    Cabrini, RL
    Lanfranchi, HE
    JOURNAL OF DENTAL RESEARCH, 2003, 82 : 39 - 39
  • [34] LUNG-CANCER DIAGNOSIS - EVALUATION OF DIAGNOSTIC TECHNIQUES
    HEALY, TM
    BORRIE, J
    NEW ZEALAND MEDICAL JOURNAL, 1975, 81 (539) : 423 - 424
  • [35] Impact of Consensus Conference Review on Diagnostic Disagreements in the Evaluation of Cervical Biopsies
    Layfield, Lester
    Hammer, Richard D.
    Frazier, Shellaine R.
    Esebua, Magda
    Bivin, William W.
    Laziuk, Katsiaryna
    Nguyen, Van T.
    Johannesen, Eric
    Schmidt, Robert L.
    MODERN PATHOLOGY, 2017, 30 : 510A - 510A
  • [36] Helical CT evaluation of the spleen: Techniques, applications, and pitfalls of interpretation
    Urban, BA
    Fishman, EK
    RADIOLOGY, 1996, 201 : 556 - 556
  • [38] Non diagnostic information and the effects of context evaluation in the moderation of False Consensus
    Verlhiac, JF
    SWISS JOURNAL OF PSYCHOLOGY, 1999, 58 (01): : 12 - 21
  • [39] Impact of Consensus Conference Review on Diagnostic Disagreements in the Evaluation of Cervical Biopsies
    Layfield, Lester
    Hammer, Richard D.
    Frazier, Shellaine R.
    Esebua, Magda
    Bivin, William W.
    Laziuk, Katsiaryna
    Nguyen, Van T.
    Johannesen, Eric
    Schmidt, Robert L.
    LABORATORY INVESTIGATION, 2017, 97 : 510A - 510A
  • [40] INTERPRETATION TECHNIQUES
    KLINT, P
    SOFTWARE-PRACTICE & EXPERIENCE, 1981, 11 (09): : 963 - 973