Multi-task vision transformer using low-level chest X-ray feature corpus for COVID-19 diagnosis and severity quantification

被引:59
|
作者
Park, Sangjoon [1 ]
Kim, Gwanghyun [1 ]
Oh, Yujin [1 ]
Seo, Joon Beom [2 ]
Lee, Sang Min [2 ]
Kim, Jin Hwan [3 ]
Moon, Sungjun [4 ]
Lim, Jae-Kwang [5 ]
Ye, Jong Chul [1 ]
机构
[1] Korea Adv Inst Sci & Technol KAIST, Daejeon, South Korea
[2] Univ Ulsan, Asan Med Ctr, Coll Med, Seoul, South Korea
[3] Chungnam Natl Univ, Coll Med, Daejeon, South Korea
[4] Yeungnam Univ, Coll Med, Daegu, South Korea
[5] Kyungpook Natl Univ, Sch Med, Daegu, South Korea
基金
新加坡国家研究基金会;
关键词
Coronavirus disease-19; Chest X-ray; Vision transformer; Multi-task learning;
D O I
10.1016/j.media.2021.102299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Developing a robust algorithm to diagnose and quantify the severity of the novel coronavirus disease 2019 (COVID-19) using Chest X-ray (CXR) requires a large number of well-curated COVID-19 datasets, which is difficult to collect under the global COVID-19 pandemic. On the other hand, CXR data with other findings are abundant. This situation is ideally suited for the Vision Transformer (ViT) architecture, where a lot of unlabeled data can be used through structural modeling by the self-attention mechanism. However, the use of existing ViT may not be optimal, as the feature embedding by direct patch flattening or ResNet backbone in the standard ViT is not intended for CXR. To address this problem, here we propose a novel Multi-task ViT that leverages low-level CXR feature corpus obtained from a backbone network that extracts common CXR findings. Specifically, the backbone network is first trained with large public datasets to detect common abnormal findings such as consolidation, opacity, edema, etc. Then, the embedded features from the backbone network are used as corpora for a versatile Transformer model for both the diagnosis and the severity quantification of COVID-19. We evaluate our model on various external test datasets from totally different institutions to evaluate the generalization capability. The experimental results confirm that our model can achieve state-of-the-art performance in both diagnosis and severity quantification tasks with outstanding generalization capability, which are sine qua non of widespread deployment. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Multi-task driven explainable diagnosis of COVID-19 using chest X-ray images
    Malhotra, Aakarsh
    Mittal, Surbhi
    Majumdar, Puspita
    Chhabra, Saheb
    Thakral, Kartik
    Vatsa, Mayank
    Singh, Richa
    Chaudhury, Santanu
    Pudrod, Ashwin
    Agrawal, Anjali
    PATTERN RECOGNITION, 2022, 122
  • [2] Multi-task driven explainable diagnosis of COVID-19 using chest X-ray images
    Malhotra, Aakarsh
    Mittal, Surbhi
    Majumdar, Puspita
    Chhabra, Saheb
    Thakral, Kartik
    Vatsa, Mayank
    Singh, Richa
    Chaudhury, Santanu
    Pudrod, Ashwin
    Agrawal, Anjali
    Pattern Recognition, 2022, 122
  • [3] PneuNet: deep learning for COVID-19 pneumonia diagnosis on chest X-ray image analysis using Vision Transformer
    Tianmu Wang
    Zhenguo Nie
    Ruijing Wang
    Qingfeng Xu
    Hongshi Huang
    Handing Xu
    Fugui Xie
    Xin-Jun Liu
    Medical & Biological Engineering & Computing, 2023, 61 : 1395 - 1408
  • [4] PneuNet: deep learning for COVID-19 pneumonia diagnosis on chest X-ray image analysis using Vision Transformer
    Wang, Tianmu
    Nie, Zhenguo
    Wang, Ruijing
    Xu, Qingfeng
    Huang, Hongshi
    Xu, Handing
    Xie, Fugui
    Liu, Xin-Jun
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (06) : 1395 - 1408
  • [5] COVID-19 Chest X-ray Classification and Severity Assessment Using Convolutional and Transformer Neural Networks
    Tuan Le Dinh
    Lee, Suk-Hwan
    Kwon, Seong-Geun
    Kwon, Ki-Ryong
    APPLIED SCIENCES-BASEL, 2022, 12 (10):
  • [6] Multi-task supervised contrastive learning for chest X-ray diagnosis: A two-stage hierarchical classification framework for COVID-19 diagnosis
    Chen, Guan-Ying
    Lin, Chih-Ting
    APPLIED SOFT COMPUTING, 2024, 155
  • [7] Multi-task supervised contrastive learning for chest X-ray diagnosis: A two-stage hierarchical classification framework for COVID-19 diagnosis
    Chen, Guan-Ying
    Lin, Chih-Ting
    Applied Soft Computing, 2024, 155
  • [8] Multi-task localization of the hemidiaphragms and lung segmentation in portable chest X-ray images of COVID-19 patients
    Moris, Daniel, I
    de Moura, Joaquim
    Aslani, Shahab
    Jacob, Joseph
    Novo, Jorge
    Ortega, Marcos
    DIGITAL HEALTH, 2024, 10
  • [9] Utility of the serial portable chest x-ray for the diagnosis and quantification of COVID-19 patients
    Khan, Sohail Ahmed
    Manohar, Murli
    Khan, Maria
    Hasan, Nighat
    Zaheer, Sidra
    Asad, Faisal
    Adil, Syed Omair
    JOURNAL OF TAIBAH UNIVERSITY MEDICAL SCIENCES, 2023, 18 (02): : 321 - 330
  • [10] Feature-level ensemble approach for COVID-19 detection using chest X-ray images
    Ho, Thi Kieu Khanh
    Gwak, Jeonghwan
    PLOS ONE, 2022, 17 (07):