Transformer-based multiple instance learning network with 2D positional encoding for histopathology image classification

被引:0
|
作者
Bin Yang [1 ]
Lei Ding [2 ]
Jianqiang Li [2 ]
Yong Li [2 ]
Guangzhi Qu [2 ]
Jingyi Wang [3 ]
Qiang Wang [2 ]
Bo Liu [2 ]
机构
[1] Academy of Military Science,Center for Strategic Assessment and Consulting
[2] Beijing University of Technology,Faculty of Information Technology
[3] Oakland University,Computer Science and Engineering Department
[4] Massey University,School of Mathematical and Computational Sciences
关键词
Weakly supervised training; Image classification; Multiple instance learning;
D O I
10.1007/s40747-025-01779-y
中图分类号
学科分类号
摘要
Digital medical imaging, particularly pathology images, is essential for cancer diagnosis but faces challenges in direct model training due to its super-resolution nature. Although weakly supervised learning has reduced the need for manual annotations, many multiple instance learning (MIL) methods struggle to effectively capture crucial spatial relationships in histopathological images. Existing methods incorporating positional information often overlook nuanced spatial correlations or use positional encoding strategies that do not fully capture the unique spatial dynamics of pathology images. To address this issue, we propose a new framework named TMIL (Transformer-based Multiple Instance Learning Network with 2D positional encoding), which leverages multiple instance learning for weakly supervised classification of histopathological images. TMIL incorporates a 2D positional encoding module, based on the Transformer, to model positional information and explore correlations between instances. Furthermore, TMIL divides histopathological images into pseudo-bags and trains patch-level feature vectors with deep metric learning to enhance classification performance. Finally, the proposed approach is evaluated on a public colorectal adenoma dataset. The experimental results show that TMIL outperforms existing MIL methods, achieving an AUC of 97.28% and an ACC of 95.19%. These findings suggest that TMIL’s integration of deep metric learning and positional encoding offers a promising approach for improving the efficiency and accuracy of pathology image analysis in cancer diagnosis.
引用
收藏
相关论文
共 50 条
  • [21] Transformer-Based Skin Carcinoma Classification using Histopathology Images via Incremental Learning
    Imran, Muhammad
    Akram, Muhammad Usman
    Salam, Anum Abdul
    2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2024,
  • [22] Transformer based multiple instance learning for WSI breast cancer classification
    Gao, Chengyang
    Sun, Qiule
    Zhu, Wen
    Zhang, Lizhi
    Zhang, Jianxin
    Liu, Bin
    Zhang, Junxing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [23] Contrastive Transformer-Based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection
    Tian, Yu
    Pang, Guansong
    Liu, Fengbei
    Liu, Yuyuan
    Wang, Chong
    Chen, Yuanhong
    Verjans, Johan
    Carneiro, Gustavo
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 88 - 98
  • [24] An EM based multiple instance learning method for image classification
    Pao, H. T.
    Chuang, S. C.
    Xu, Y. Y.
    Fu, Hsin-Chia
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (03) : 1468 - 1472
  • [25] A 3-D-Swin Transformer-Based Hierarchical Contrastive Learning Method for Hyperspectral Image Classification
    Huang, Xin
    Dong, Mengjie
    Li, Jiayi
    Guo, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [26] CRAT: Advanced transformer-based deep learning algorithms in OCT image classification
    Yang, Mingming
    Du, Junhui
    Lv, Ruichan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 104
  • [27] TransPath: Transformer-Based Self-supervised Learning for Histopathological Image Classification
    Wang, Xiyue
    Yang, Sen
    Zhang, Jun
    Wang, Minghui
    Zhang, Jing
    Huang, Junzhou
    Yang, Wei
    Han, Xiao
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 186 - 195
  • [28] 2D medical image synthesis using transformer-based denoising diffusion probabilistic model
    Pan, Shaoyan
    Wang, Tonghe
    Qiu, Richard L. J.
    Axente, Marian
    Chang, Chih-Wei
    Peng, Junbo
    Patel, Ashish B.
    Shelton, Joseph
    Patel, Sagar A.
    Roper, Justin
    Yang, Xiaofeng
    PHYSICS IN MEDICINE AND BIOLOGY, 2023, 68 (10):
  • [29] MIL-VT: Multiple Instance Learning Enhanced Vision Transformer for Fundus Image Classification
    Yu, Shuang
    Ma, Kai
    Bi, Qi
    Bian, Cheng
    Ning, Munan
    He, Nanjun
    Li, Yuexiang
    Liu, Hanruo
    Zheng, Yefeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 45 - 54
  • [30] Breast Ultrasound Image Classification Based on Multiple-Instance Learning
    Jianrui Ding
    H. D. Cheng
    Jianhua Huang
    Jiafeng Liu
    Yingtao Zhang
    Journal of Digital Imaging, 2012, 25 : 620 - 627