Improving Chest X-Ray Report Generation by Leveraging Warm Starting

被引:24
|
作者
Nicolson, Aaron [1 ]
Dowling, Jason [1 ]
Koopman, Bevan [1 ]
机构
[1] CSIRO Hlth & Biosecur, Australian eHlth Res Ctr, Brisbane, Australia
关键词
Chest X-ray report generation; Image captioning; Multi-modal learning warm starting; ARTIFICIAL-INTELLIGENCE; RADIOLOGY;
D O I
10.1016/j.artmed.2023.102633
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatically generating a report from a patient's Chest X-Rays (CXRs) is a promising solution to reducing clinical workload and improving patient care. However, current CXR report generators -- which are predominantly encoder-to-decoder models -- lack the diagnostic accuracy to be deployed in a clinical setting. To improve CXR report generation, we investigate warm starting the encoder and decoder with recent open-source computer vision and natural language processing checkpoints, such as the Vision Transformer (ViT) and PubMedBERT. To this end, each checkpoint is evaluated on the MIMIC-CXR and IU X-Ray datasets. Our experimental investigation demonstrates that the Convolutional vision Transformer (CvT) ImageNet-21K and the Distilled Generative Pre-trained Transformer 2 (DistilGPT2) checkpoints are best for warm starting the encoder and decoder, respectively. Compared to the state-of-the-art (M2 Transformer Progressive), CvT2DistilGPT2 attained an improvement of 8.3\% for CE F-1, 1.8\% for BLEU-4, 1.6\% for ROUGE-L, and 1.0\% for METEOR. The reports generated by CvT2DistilGPT2 have a higher similarity to radiologist reports than previous approaches. This indicates that leveraging warm starting improves CXR report generation. Code and checkpoints for CvT2DistilGPT2 are available at this https://github.com/achre/cvt2distiglgpt2.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Baselines for Chest X-Ray Report Generation
    Boag, William
    Hsu, Tzu-Ming Harry
    McDermott, Matthew
    Berner, Gabriela
    Alsentzer, Emily
    Szolovits, Peter
    MACHINE LEARNING FOR HEALTH WORKSHOP, VOL 116, 2019, 116 : 126 - 140
  • [2] Clinically Accurate Chest X-Ray Report Generation
    Liu, Guanxiong
    Hsu, Tzu-Ming Harry
    McDermott, Matthew
    Boag, Willie
    Weng, Wei-Hung
    Szolovits, Peter
    Ghassemi, Marzyeh
    MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 106, 2019, 106
  • [3] Contrastive Attention for Automatic Chest X-ray Report Generation
    Liu, Fenglin
    Yin, Changchang
    Wu, Xian
    Ge, Shen
    Zhang, Ping
    Sun, Xu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 269 - 280
  • [4] Variational Topic Inference for Chest X-Ray Report Generation
    Najdenkoska, Ivona
    Zhen, Xiantong
    Worring, Marcel
    Shao, Ling
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 625 - 635
  • [5] Deep learning for report generation on chest X-ray images
    Ouis, Mohammed Yasser
    Akhloufi, Moulay A.
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 111
  • [6] TiBiX: Leveraging Temporal Information for Bidirectional X-Ray and Report Generation
    Sanjeev, Santosh
    Maani, Fadillah Adamsyah
    Abzhanov, Arsen
    Papineni, Vijay Ram
    Almakky, Ibrahim
    Papiez, Bartlomiej W.
    Yaqub, Mohammad
    DEEP GENERATIVE MODELS, DGM4MICCAI 2024, 2025, 15224 : 169 - 179
  • [7] Controllable Chest X-Ray Report Generation from Longitudinal Representations
    Serra, Francesco Dalla
    Wang, Chaoyang
    Deligianni, Fani
    Dalton, Jeffrey
    O'Neil, Alison Q.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4891 - 4904
  • [8] Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation
    Yan, An
    He, Zexue
    Lu, Xing
    Du, Jiang
    Chang, Eric
    Gentili, Amilcare
    McAuley, Julian
    Hsu, Chun-Nan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 4009 - 4015
  • [9] Evaluating progress in automatic chest X-ray radiology report generation
    Yu, Feiyang
    Endo, Mark
    Krishnan, Rayan
    Pan, Ian
    Tsai, Andy
    Reis, Eduardo Pontes
    Fonseca, Eduardo Kaiser Ururahy Nunes
    Lee, Henrique Min Ho
    Abad, Zahra Shakeri Hossein
    Ng, Andrew Y.
    Langlotz, Curtis P.
    Venugopal, Vasantha Kumar
    Rajpurkar, Pranav
    PATTERNS, 2023, 4 (09):
  • [10] Improving the Fairness of Chest X-ray Classifiers
    Zhang, Haoran
    Dullerud, Natalie
    Roth, Karsten
    Oakden-Rayner, Lauren
    Pfohl, Stephen
    Ghassemi, Marzyeh
    CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, VOL 174, 2022, 174 : 204 - 233