Between Always and Never: Evaluating Uncertainty in Radiology Reports Using Natural Language Processing

被引:16
|
作者
Callen, Andrew L. [1 ]
Dupont, Sara M. [2 ]
Price, Adi [3 ]
Laguna, Ben [3 ]
McCoy, David [3 ]
Do, Bao [4 ]
Talbott, Jason [3 ]
Kohli, Marc [3 ]
Narvid, Jared [3 ]
机构
[1] Univ Colorado, Dept Radiol, Anschutz Med Campus, Denver, CO 80045 USA
[2] Sublte Med Inc, Menlo Pk, CA USA
[3] Univ Calif San Francisco, Dept Radiol & Biomed Imaging, San Francisco, CA 94143 USA
[4] Stanford Univ, Med Ctr, Dept Radiol, Stanford, CA 94305 USA
基金
美国国家卫生研究院;
关键词
Diagnostic uncertainty; Natural language processing; MALPRACTICE; INFORMATION; ACCURACY; MEDICINE; ERRORS;
D O I
10.1007/s10278-020-00379-1
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
The ideal radiology report reduces diagnostic uncertainty, while avoiding ambiguity whenever possible. The purpose of this study was to characterize the use of uncertainty terms in radiology reports at a single institution and compare the use of these terms across imaging modalities, anatomic sections, patient characteristics, and radiologist characteristics. We hypothesized that there would be variability among radiologists and between subspecialities within radiology regarding the use of uncertainty terms and that the length of the impression of a report would be a predictor of use of uncertainty terms. Finally, we hypothesized that use of uncertainty terms would often be interpreted by human readers as "hedging." To test these hypotheses, we applied a natural language processing (NLP) algorithm to assess and count the number of uncertainty terms within radiology reports. An algorithm was created to detect usage of a published set of uncertainty terms. All 642,569 radiology report impressions from 171 reporting radiologists were collected from 2011 through 2015. For validation, two radiologists without knowledge of the software algorithm reviewed report impressions and were asked to determine whether the report was "uncertain" or "hedging." The relationship between the presence of 1 or more uncertainty terms and the human readers' assessment was compared. There were significant differences in the proportion of reports containing uncertainty terms across patient admission status and across anatomic imaging subsections. Reports with uncertainty were significantly longer than those without, although report length was not significantly different between subspecialities or modalities. There were no significant differences in rates of uncertainty when comparing the experience of the attending radiologist. When compared with reader 1 as a gold standard, accuracy was 0.91, sensitivity was 0.92, specificity was 0.9, and precision was 0.88, with an F1-score of 0.9. When compared with reader 2, accuracy was 0.84, sensitivity was 0.88, specificity was 0.82, and precision was 0.68, with an F1-score of 0.77. Substantial variability exists among radiologists and subspecialities regarding the use of uncertainty terms, and this variability cannot be explained by years of radiologist experience or differences in proportions of specific modalities. Furthermore, detection of uncertainty terms demonstrates good test characteristics for predicting human readers' assessment of uncertainty.
引用
收藏
页码:1194 / 1201
页数:8
相关论文
共 50 条
  • [41] Characterization of Change and Significance for Clinical Findings in Radiology Reports Through Natural Language Processing
    Saeed Hassanpour
    Graham Bay
    Curtis P. Langlotz
    Journal of Digital Imaging, 2017, 30 : 314 - 322
  • [42] Natural language processing of radiology reports for identification of skeletal site-specific fractures
    Wang, Yanshan
    Mehrabi, Saeed
    Sohn, Sunghwan
    Atkinson, Elizabeth J.
    Amin, Shreyasee
    Liu, Hongfang
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 3)
  • [43] Qualifying Certainty in Radiology Reports through Deep Learning?Based Natural Language Processing
    Liu, F.
    Zhou, P.
    Baccei, S. J.
    Masciocchi, M. J.
    Amornsiripanitch, N.
    Kiefe, C., I
    Rosen, M. P.
    AMERICAN JOURNAL OF NEURORADIOLOGY, 2021, 42 (10) : 1755 - 1761
  • [44] Characterization of Change and Significance for Clinical Findings in Radiology Reports Through Natural Language Processing
    Hassanpour, Saeed
    Bay, Graham
    Langlotz, Curtis P.
    JOURNAL OF DIGITAL IMAGING, 2017, 30 (03) : 314 - 322
  • [45] The reporting quality of natural language processing studies: systematic review of studies of radiology reports
    Emma M. Davidson
    Michael T. C. Poon
    Arlene Casey
    Andreas Grivas
    Daniel Duma
    Hang Dong
    Víctor Suárez-Paniagua
    Claire Grover
    Richard Tobin
    Heather Whalley
    Honghan Wu
    Beatrice Alex
    William Whiteley
    BMC Medical Imaging, 21
  • [46] A Preliminary Study of Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports Using Natural Language Processing
    Yang, Shuang
    Yang, Xi
    Lyu, Tianchen
    He, Xing
    Braithwaite, Dejana
    Mehta, Hiren J.
    Guo, Yi
    Wu, Yonghui
    Bian, Jiang
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 618 - 619
  • [47] Natural Language Processing Techniques for Extracting and Categorizing Finding Measurements in Narrative Radiology Reports
    Sevenster, M.
    Buurman, J.
    Liu, P.
    Peters, J. F.
    Chang, P. J.
    APPLIED CLINICAL INFORMATICS, 2015, 6 (03): : 600 - 610
  • [48] Natural language processing of radiology reports for identification of skeletal site-specific fractures
    Yanshan Wang
    Saeed Mehrabi
    Sunghwan Sohn
    Elizabeth J. Atkinson
    Shreyasee Amin
    Hongfang Liu
    BMC Medical Informatics and Decision Making, 19
  • [49] The reporting quality of natural language processing studies: systematic review of studies of radiology reports
    Davidson, Emma M.
    Poon, Michael T. C.
    Casey, Arlene
    Grivas, Andreas
    Duma, Daniel
    Dong, Hang
    Suarez-Paniagua, Victor
    Grover, Claire
    Tobin, Richard
    Whalley, Heather
    Wu, Honghan
    Alex, Beatrice
    Whiteley, William
    BMC MEDICAL IMAGING, 2021, 21 (01)
  • [50] Practical Examples of Natural Language Processing in Radiology
    Lacson, Ronilda
    Khorasani, Ramin
    JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2011, 8 (12) : 872 - 874