A deep fusion-based vision transformer for breast cancer classification

被引:1
|
作者
Fiaz, Ahsan [1 ]
Raza, Basit [1 ]
Faheem, Muhammad [2 ]
Raza, Aadil [3 ]
机构
[1] COMSATS Univ Islamabad CUI, Dept Comp Sci, Islamabad, Pakistan
[2] Univ Vaasa, Sch Technol & Innovat, Vaasa 65200, Finland
[3] COMSATS Univ Islamabad CUI, Dept Phys, Islamabad, Pakistan
关键词
artificial intelligence; breast cancer; classification; deep learning; histopathology images; machine learning;
D O I
10.1049/htl2.12093
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Breast cancer is one of the most common causes of death in women in the modern world. Cancerous tissue detection in histopathological images relies on complex features related to tissue structure and staining properties. Convolutional neural network (CNN) models like ResNet50, Inception-V1, and VGG-16, while useful in many applications, cannot capture the patterns of cell layers and staining properties. Most previous approaches, such as stain normalization and instance-based vision transformers, either miss important features or do not process the whole image effectively. Therefore, a deep fusion-based vision Transformer model (DFViT) that combines CNNs and transformers for better feature extraction is proposed. DFViT captures local and global patterns more effectively by fusing RGB and stain-normalized images. Trained and tested on several datasets, such as BreakHis, breast cancer histology (BACH), and UCSC cancer genomics (UC), the results demonstrate outstanding accuracy, F1 score, precision, and recall, setting a new milestone in histopathological image analysis for diagnosing breast cancer.
引用
收藏
页码:471 / 484
页数:14
相关论文
共 50 条
  • [21] EMViT-BCC: Enhanced Mobile Vision Transformer for Breast Cancer Classification
    Potsangbam, Jacinta
    Devi, Salam Shuleenda
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (02)
  • [22] Vision transformer-based multimodal fusion network for classification of tumor malignancy on breast ultrasound: A retrospective multicenter study
    Li, Mengying
    Fang, Yin
    Shao, Jiong
    Jiang, Yan
    Xu, Guoping
    Cui, Xin-wu
    Wu, Xinglong
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 196
  • [23] Hybrid Sparse Transformer and Wavelet Fusion-Based Deep Unfolding Network for Hyperspectral Snapshot Compressive Imaging
    Ying, Yangke
    Wang, Jin
    Shi, Yunhui
    Ling, Nam
    SENSORS, 2024, 24 (19)
  • [24] Breast Ultrasound Image BI-RADS Classification Based on Vision Transformer
    Wei, Yanbo
    Ye, Junbo
    Li, Xiaofeng
    Zhao, Yuanyuan
    Wang, Yanwei
    INTERNATIONAL JOURNAL OF MULTIPHYSICS, 2024, 18 (02) : 32 - 39
  • [25] Attention Multisource Fusion-Based Deep Few-Shot Learning for Hyperspectral Image Classification
    Liang, Xuejian
    Zhang, Ye
    Zhang, Junping
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8773 - 8788
  • [26] A VISION TRANSFORMER NETWORK WITH WAVELET-BASED FEATURES FOR BREAST ULTRASOUND CLASSIFICATION
    He, Chenyang
    Diao, Yan
    Ma, Xingcong
    Yu, Shuo
    He, Xin
    Mao, Guochao
    Wei, Xinyu
    Zhang, Yu
    Zhao, Yang
    IMAGE ANALYSIS & STEREOLOGY, 2024, 43 (02): : 185 - 194
  • [27] GSC-DVIT: A vision transformer based deep learning model for lung cancer classification in CT images
    Mannepalli, Durgaprasad
    Tak, Tan Kuan
    Krishnan, Sivaneasan Bala
    Sreenivas, Velagapudi
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 103
  • [28] A sensor fusion-based classification system for thermoplastic recycling
    Satorres Martinez, S.
    Lopez Paniza, J. M.
    Cobo Ramirez, M.
    Gomez Ortega, J.
    Gamez Garcia, J.
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC 12), 2012, : 290 - 295
  • [29] Vision Transformer based Audio Classification using Patch-level Feature Fusion
    Luo, Juan
    Yang, Jielong
    Chng, Eng Siong
    Zhong, Xionghu
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 22 - 26
  • [30] DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
    Vachmanus, Sirawich
    Noraset, Thanapon
    Piyanonpong, Waritsara
    Rattananukrom, Teerapong
    Tuarob, Suppawong
    IEEE ACCESS, 2023, 11 : 145467 - 145484