Towards efficient diagnostics: refining vision transformers for medical image multi-label classification

被引:0
|
作者
Cayce, Garrett I. [1 ]
Hand, Benjamin M. [1 ]
Kurz, Aidan G. [1 ]
Bailey, Colleen P. [1 ]
机构
[1] Univ North Texas, Dept Elect Engn, Denton, TX 76207 USA
关键词
chest x-ray; multi-label classification; vision transformer;
D O I
10.1117/12.3013977
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Medical imaging, including the use of chest X-rays, is an important tool for modern healthcare, enabling early and accurate disease diagnosis, facilitating timely interventions to mitigate health issues. By capturing images of critical internal organs like the lungs and heart, X-rays enable doctors to make informed diagnoses and treatment decisions, especially concerning respiratory and cardiac conditions. The importance of early and accurate disease diagnosis, particularly with multiple pathologies, is paramount, as it greatly impacts patient outcomes by enabling timely and specific treatments. Recently, multi-label classification has become increasingly important in medical imaging, since several pathologies can be present within a single X-ray. While traditional convolutional neural networks (CNNs) have played a pivotal role in enhancing the accuracy of X-ray diagnoses, the expanding complexity of multi-label imaging demands more sophisticated methods. Vision Transformers (ViTs) have emerged as a promising approach in medical image classification, showcasing their ability to effectively process X-ray images and identify pathologies within them. While traditional ViTs perform well, they have significant drawbacks. Most ViT models utilize a large number of parameters, often ranging from millions to billions of parameters. Such parameter-intensive designs, while powerful, are computationally heavy. This not only increases the resource requirements, but also raises concerns about their feasibility and scalability in real-world, time-sensitive healthcare settings. We propose a novel Vision Transformer architecture aimed at effectively classifying multi-label X-ray images while significantly enhancing the efficiency of ViT-based multi-label medical image classification methods. By optimizing model architectures and exploring techniques for parameter reduction, we seek to develop more streamlined and resource-efficient approaches without completely sacrificing the efficacy of these methods. Our work endeavors to bridge the gap between cutting-edge technology and practical healthcare applications, promising a more efficient and accessible future for medical image analysis.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] General Multi-label Image Classification with Transformers
    Lanchantin, Jack
    Wang, Tianlu
    Ordonez, Vicente
    Qi, Yanjun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16473 - 16483
  • [2] Towards the Interpretation of Multi-label Image Classification using Transformers and Fuzzy Cognitive Maps
    Sovatzidi, Georgia
    Vasilakakis, Michael D.
    Iakovidis, Dimitris K.
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ, 2023,
  • [3] Transformers for Multi-label Classification of Medical Text: An Empirical Comparison
    Yogarajan, Vithya
    Montiel, Jacob
    Smith, Tony
    Pfahringer, Bernhard
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2021), 2021, : 114 - 123
  • [4] Improving Children Diagnostics by Efficient Multi-label Classification Method
    Glinka, Kinga
    Wosiak, Agnieszka
    Zakrzewska, Danuta
    [J]. INFORMATION TECHNOLOGIES IN MEDICINE, ITIB 2016, VOL 1, 2016, 471 : 253 - 266
  • [5] Visual Transformers with Primal Object Queries for Multi-Label Image Classification
    Yazici, Vacit Oguz
    Van De Weijer, Joost
    Yu, Longlong
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3014 - 3020
  • [6] CROSS-LAYER AGGREGATION WITH TRANSFORMERS FOR MULTI-LABEL IMAGE CLASSIFICATION
    Zhang, Weibo
    Zhu, Fuqing
    Han, Jizhong
    Guo, Tao
    Hu, Songlin
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3448 - 3452
  • [7] Towards explainable multi-label classification
    Tabia, Karimt
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1088 - 1095
  • [8] Multi-Label Retinal Disease Classification Using Transformers
    Rodriguez, Manuel Alejandro
    AlMarzouqi, Hasan
    Liatsis, Panos
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (06) : 2739 - 2750
  • [9] Efficient Methods for Multi-label Classification
    Sun, Chonglin
    Zhou, Chunting
    Jin, Bo
    Lau, Francis C. M.
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 164 - 175
  • [10] Exploiting Label Dependencies for Multi-Label Document Classification Using Transformers
    Fallah, Haytame
    Bruno, Emmanuel
    Bellot, Patrice
    Murisasco, Elisabeth
    [J]. PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,