Qualifying Certainty in Radiology Reports through Deep Learning?Based Natural Language Processing

被引:10
|
作者
Liu, F. [1 ,2 ]
Zhou, P. [2 ]
Baccei, S. J. [2 ,3 ]
Masciocchi, M. J. [2 ,3 ]
Amornsiripanitch, N. [2 ,3 ]
Kiefe, C., I [1 ]
Rosen, M. P. [2 ,3 ]
机构
[1] Univ Massachusetts, Sch Med, Dept Populat & Quantitat Hlth Sci, Worcester, MA 01655 USA
[2] Univ Massachusetts, Sch Med, Dept Radiol, Room S2-824,55 Lake Ave North, Worcester, MA 01655 USA
[3] UMass Mem Med Ctr, Dept Radiol, Worcester, MA USA
关键词
BIOMEDICAL TEXT; PHYSICIANS;
D O I
10.3174/ajnr.A7241
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
BACKGROUND AND PURPOSE: Communication gaps exist between radiologists and referring physicians in conveying diagnostic certainty. We aimed to explore deep learning?based bidirectional contextual language models for automatically assessing diagnostic certainty expressed in the radiology reports to facilitate the precision of communication. MATERIALS AND METHODS: We randomly sampled 594 head MR imaging reports from an academic medical center. We asked 3 board-certified radiologists to read sentences from the Impression section and assign each sentence 1 of the 4 certainty categories: ?Non-Definitive,? ?Definitive-Mild,? ?Definitive-Strong,? ?Other.? Using the annotated 2352 sentences, we developed and validated a natural language-processing system based on the start-of-the-art bidirectional encoder representations from transformers (BERT), which can capture contextual uncertainty semantics beyond the lexicon level. Finally, we evaluated 3 BERT variant models and reported standard metrics including sensitivity, specificity, and area under the curve. RESULTS: A ? score of 0.74 was achieved for interannotator agreement on uncertainty interpretations among 3 radiologists. For the 3 BERT variant models, the biomedical variant (BioBERT) achieved the best macro-average area under the curve of 0.931 (compared with 0.928 for the BERT-base and 0.925 for the clinical variant [ClinicalBERT]) on the validation data. All 3 models yielded high macro-average specificity (93.13%?93.65%), while the BERT-base obtained the highest macro-average sensitivity of 79.46% (compared with 79.08% for BioBERT and 78.52% for ClinicalBERT). The BioBERT model showed great generalizability on the heldout test data with a macro-average sensitivity of 77.29%, specificity of 92.89%, and area under the curve of 0.93. CONCLUSIONS: A deep transfer learning model can be developed to reliably assess the level of uncertainty communicated in a radiology report.
引用
收藏
页码:1755 / 1761
页数:7
相关论文
共 50 条
  • [1] Machine learning based natural language processing of radiology reports in orthopaedic trauma
    Olthof, A. W.
    Shouche, P.
    Fennema, E. M.
    IJpma, F. F. A.
    Koolstra, R. H. C.
    Stirler, V. M. A.
    van Ooijen, P. M. A.
    Cornelissen, L. J.
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
  • [2] Application of Natural Language Processing and Machine Learning to Radiology Reports
    Jeon, Seoungdeok
    Colburn, Zachary
    Sakai, Joshua
    Hung, Ling-Hong
    Yeung, Ka Yee
    [J]. 12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [3] Deep Learning-based Assessment of Oncologic Outcomes from Natural Language Processing of Structured Radiology Reports
    Fink, Matthias A.
    Kades, Klaus
    Bischoff, Arved
    Moll, Martin
    Schnell, Merle
    Kuechler, Maike
    Koehler, Gregor
    Sellner, Jan
    Heussel, Claus Peter
    Kauczor, Hans-Ulrich
    Schlemmer, Heinz-Peter
    Maier-Hein, Klaus
    Weber, Tim F.
    Kleesiek, Jens
    [J]. RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2022, 4 (05)
  • [4] Classification of Diagnostic Certainty in Radiology Reports with Deep Learning
    Sugimoto, Kento
    Wada, Shoya
    Konishi, Shozo
    Okada, Katsuki
    Manabe, Shirou
    Matsumura, Yasushi
    Takeda, Toshihiro
    [J]. MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 569 - 573
  • [5] Development of a Deep Learning Natural Language Processing Model for Classification of Lung Cancer Radiology Reports
    Mithun, S.
    Jha, A. K.
    Sherkhane, U. B.
    Jaiswar, V.
    Nautiyal, A.
    Purandare, N. C.
    Rangarajan, V.
    Dekker, A.
    Wee, L.
    [J]. EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2021, 48 (SUPPL 1) : S330 - S330
  • [6] Characterizing and Quantifying Diagnostic (Un)Certainty in Medical Reports Through Natural Language Processing
    Isenegger, Kathleen
    Dong, Yilan
    Shang, Mengyuan
    Furst, Jacob
    Stan-Raicu, Daniela
    [J]. 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 914 - 919
  • [7] Assessment of Deep Natural Language Processing in Ascertaining Oncologic Outcomes From Radiology Reports
    Kehl, Kenneth L.
    Elmarakeby, Haitham
    Nishino, Mizuki
    Van Allen, Eliezer M.
    Lepisto, Eva M.
    Hassett, Michael J.
    Johnson, Bruce E.
    Schrag, Deborah
    [J]. JAMA ONCOLOGY, 2019, 5 (10) : 1421 - 1429
  • [8] Deep Learning for Natural Language Processing in Radiology-Fundamentals and a Systematic Review
    Sorin, Vera
    Barash, Yiftach
    Konen, Eli
    Klang, Eyal
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2020, 17 (05) : 639 - 648
  • [9] Characterization of Change and Significance for Clinical Findings in Radiology Reports Through Natural Language Processing
    Saeed Hassanpour
    Graham Bay
    Curtis P. Langlotz
    [J]. Journal of Digital Imaging, 2017, 30 : 314 - 322
  • [10] Quality Management of Pulmonary Nodule Radiology Reports Based on Natural Language Processing
    Fei, Xiaolu
    Chen, Pengyu
    Wei, Lan
    Huang, Yue
    Xin, Yi
    Li, Jia
    [J]. BIOENGINEERING-BASEL, 2022, 9 (06):