Development of image-based decision support systems utilizing information extracted from radiological free-text report databases with text-based transformers

被引:2
|
作者
Nowak, Sebastian [1 ]
Schneider, Helen [2 ]
Layer, Yannik C. [1 ]
Theis, Maike [1 ]
Biesner, David [2 ]
Block, Wolfgang [1 ]
Wulff, Benjamin [2 ]
Attenberger, Ulrike I. [1 ]
Sifa, Rafet [2 ]
Sprinkart, Alois M. [1 ]
机构
[1] Univ Hosp Bonn, Dept Diagnost & Intervent Radiol, Bonn, Germany
[2] Fraunhofer Inst Intelligent Anal & Informat Syst I, St Augustin, Germany
关键词
Radiology; Deep learning; Intensive care units; Thorax;
D O I
10.1007/s00330-023-10373-0
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectivesTo investigate the potential and limitations of utilizing transformer-based report annotation for on-site development of image-based diagnostic decision support systems (DDSS).MethodsThe study included 88,353 chest X-rays from 19,581 intensive care unit (ICU) patients. To label the presence of six typical findings in 17,041 images, the corresponding free-text reports of the attending radiologists were assessed by medical research assistants ("gold labels"). Automatically generated "silver" labels were extracted for all reports by transformer models trained on gold labels. To investigate the benefit of such silver labels, the image-based models were trained using three approaches: with gold labels only (MG), with silver labels first, then with gold labels (MS/G), and with silver and gold labels together (MS+G). To investigate the influence of invested annotation effort, the experiments were repeated with different numbers (N) of gold-annotated reports for training the transformer and image-based models and tested on 2099 gold-annotated images. Significant differences in macro-averaged area under the receiver operating characteristic curve (AUC) were assessed by non-overlapping 95% confidence intervals.ResultsUtilizing transformer-based silver labels showed significantly higher macro-averaged AUC than training solely with gold labels (N = 1000: MG 67.8 [66.0-69.6], MS/G 77.9 [76.2-79.6]; N = 14,580: MG 74.5 [72.8-76.2], MS/G 80.9 [79.4-82.4]). Training with silver and gold labels together was beneficial using only 500 gold labels (MS+G 76.4 [74.7-78.0], MS/G 75.3 [73.5-77.0]).ConclusionsTransformer-based annotation has potential for unlocking free-text report databases for the development of image-based DDSS. However, on-site development of image-based DDSS could benefit from more sophisticated annotation pipelines including further information than a single radiological report.Clinical relevance statementLeveraging clinical databases for on-site development of artificial intelligence (AI)-based diagnostic decision support systems by text-based transformers could promote the application of AI in clinical practice by circumventing highly regulated data exchanges with third parties.Key Points center dot The amount of data from a database that can be used to develop AI-assisted diagnostic decision systems is often limited by the need for time-consuming identification of pathologies by radiologists.center dot The transformer-based structuring of free-text radiological reports shows potential to unlock corresponding image databases for on-site development of image-based diagnostic decision support systems.center dot However, the quality of image annotations generated solely on the content of a single radiology report may be limited by potential inaccuracies and incompleteness of this report.Key Points center dot The amount of data from a database that can be used to develop AI-assisted diagnostic decision systems is often limited by the need for time-consuming identification of pathologies by radiologists.center dot The transformer-based structuring of free-text radiological reports shows potential to unlock corresponding image databases for on-site development of image-based diagnostic decision support systems.center dot However, the quality of image annotations generated solely on the content of a single radiology report may be limited by potential inaccuracies and incompleteness of this report. Key Points center dot The amount of data from a database that can be used to develop AI-assisted diagnostic decision systems is often limited by the need for time-consuming identification of pathologies by radiologists.center dot The transformer-based structuring of free-text radiological reports shows potential to unlock corresponding image databases for on-site development of image-based diagnostic decision support systems.center dot However, the quality of image annotations generated solely on the content of a single radiology report may be limited by potential inaccuracies and incompleteness of this report.
引用
收藏
页码:2895 / 2904
页数:10
相关论文
共 17 条
  • [1] Searching for information in a time-pressured setting: experiences with a Text-based and an Image-based decision support system
    Aminilari, M
    Pakath, R
    [J]. DECISION SUPPORT SYSTEMS, 2005, 41 (01) : 37 - 68
  • [2] FILTERING AND CONDENSING IN TEXT-BASED DECISION SUPPORT SYSTEMS
    MORRIS, AH
    [J]. PROCEEDINGS OF THE TWENTY-FIRST, ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOLS 1-4: ARCHITECTURE TRACK, SOFTWARE TRACK, DECISION SUPPORT AND KNOWLEDGE BASED SYSTEMS TRACK, APPLICATIONS TRACK, 1988, : 77 - 85
  • [3] Transformer-based structuring of free-text radiology report databases
    Nowak, S.
    Biesner, D.
    Layer, Y. C.
    Theis, M.
    Schneider, H.
    Block, W.
    Wulff, B.
    Attenberger, U. I.
    Sifa, R.
    Sprinkart, A. M.
    [J]. EUROPEAN RADIOLOGY, 2023, 33 (06) : 4228 - 4236
  • [4] Transformer-based structuring of free-text radiology report databases
    S. Nowak
    D. Biesner
    Y. C. Layer
    M. Theis
    H. Schneider
    W. Block
    B. Wulff
    U. I. Attenberger
    R. Sifa
    A. M. Sprinkart
    [J]. European Radiology, 2023, 33 : 4228 - 4236
  • [5] Using Image-based and Text-based Information for Sales Prediction: A Deep Neural Network Model Completed Research
    Wang, Ying
    Guo, Yue
    Song, Jaeki
    [J]. AMCIS 2018 PROCEEDINGS, 2018,
  • [6] Ontology-based clinical information extraction from physician's free-text notes
    Yehia, Engy
    Boshnak, Hussein
    AbdelGaber, Sayed
    Abdo, Amany
    Elzanfaly, Doaa S.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 98
  • [7] Emotional expression in breast cancer support groups and emotional information available in observational and text-based coding systems
    Dubenko, Lynn M.
    Greenberg, Melanie
    Piemme, Karen Altree
    Yutsis, Maya
    Giese-Davis, Janine
    Golant, Mitch
    [J]. ANNALS OF BEHAVIORAL MEDICINE, 2008, 35 : S219 - S219
  • [8] Emotional expression in breast cancer support groups and emotional information available in observational and text-based coding systems
    Dubenko, L.
    Greenberg, M.
    Altree, Piemme K.
    Yutsis, M.
    Golant, M.
    Giese-Davis, J.
    [J]. PSYCHO-ONCOLOGY, 2008, 17 (03) : S92 - S93
  • [9] Extracting Structured Genotype Information from Free-Text HLA Reports Using a Rule-Based Approach
    Lee, Kye Hwa
    Kim, Hyo Jung
    Kim, Yi-Jun
    Kim, Ju Han
    Song, Eun Young
    [J]. JOURNAL OF KOREAN MEDICAL SCIENCE, 2020, 35 (12)
  • [10] Large language model-based information extraction from free-text radiology reports: a scoping review protocol
    Reichenpfader, Daniel
    Muller, Henning
    Denecke, Kerstin
    [J]. BMJ OPEN, 2023, 13 (12):