Transformer-based multi-task learning for classification and segmentation of gastrointestinal tract endoscopic images

被引:11
|
作者
Tang, Suigu [1 ]
Yu, Xiaoyuan [1 ]
Cheang, Chak Fong [1 ]
Liang, Yanyan [1 ]
Zhao, Penghui [1 ]
Yu, Hon Ho [2 ]
Choi, I. Cheong [2 ]
机构
[1] Macau Univ Sci & Technol, Fac Innovat Engn, Sch Comp Sci & Engn, Cotai, Macao, Peoples R China
[2] Kiang Wu Hosp, Macau, Macau, Peoples R China
关键词
Transformer; Multi-task learning; Classification; Segmentation; Active learning; DEEP; NETWORK; DIAGNOSIS;
D O I
10.1016/j.compbiomed.2023.106723
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite being widely utilized to help endoscopists identify gastrointestinal (GI) tract diseases using classifica-tion and segmentation, models based on convolutional neural network (CNN) have difficulties in distinguishing the similarities among some ambiguous types of lesions presented in endoscopic images, and in the training when lacking labeled datasets. Those will prevent CNN from further improving the accuracy of diagnosis. To address these challenges, we first proposed a Multi-task Network (TransMT-Net) capable of simultaneously learning two tasks (classification and segmentation), which has the transformer designed to learn global features and can combine the advantages of CNN in learning local features so that to achieve a more accurate prediction in identifying the lesion types and regions in GI tract endoscopic images. We further adopted the active learning in TransMT-Net to tackle the labeled image-hungry problem. A dataset was created from the CVC-ClinicDB dataset, Macau Kiang Wu Hospital, and Zhongshan Hospital to evaluate the model performance. Then, the experimental results show that our model not only achieved 96.94% accuracy in the classification task and 77.76% Dice Similarity Coefficient in the segmentation task but also outperformed those of other models on our test set. Meanwhile, active learning also produced positive results for the performance of our model with a small-scale initial training set, and even its performance with 30% of the initial training set was comparable to that of most comparable models with the full training set. Consequently, the proposed TransMT-Net has demonstrated its potential performance in GI tract endoscopic images and it through active learning can alleviate the shortage of labeled images.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Predicting Outcomes for Cancer Patients with Transformer-Based Multi-task Learning
    Gerrard, Leah
    Peng, Xueping
    Clarke, Allison
    Schlegel, Clement
    Jiang, Jing
    AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 381 - 392
  • [2] MFUnetr: A transformer-based multi-task learning network for multi-organ segmentation from partially labeled datasets
    Hao, Qin
    Tian, Shengwei
    Yu, Long
    Wang, Junwen
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
  • [3] Multi-task Active Learning for Pre-trained Transformer-based Models
    Rotman, Guy
    Reichart, Roi
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1209 - 1228
  • [4] A transformer-based multi-task deep learning model for simultaneous infiltrated brain area identification and segmentation of gliomas
    Li, Yin
    Zheng, Kaiyi
    Li, Shuang
    Yi, Yongju
    Li, Min
    Ren, Yufan
    Guo, Congyue
    Zhong, Liming
    Yang, Wei
    Li, Xinming
    Yao, Lin
    CANCER IMAGING, 2023, 23 (01)
  • [5] HTML']HTML: Hierarchical Transformer-based Multi-task Learning for Volatility Prediction
    Yang, Linyi
    Ng, Tin Lok James
    Smyth, Barry
    Dong, Riuhai
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 441 - 451
  • [6] A transformer-based multi-task deep learning model for simultaneous infiltrated brain area identification and segmentation of gliomas
    Yin Li
    Kaiyi Zheng
    Shuang Li
    Yongju Yi
    Min Li
    Yufan Ren
    Congyue Guo
    Liming Zhong
    Wei Yang
    Xinming Li
    Lin Yao
    Cancer Imaging, 23
  • [7] A Novel Multi-Task Learning Network Based on Melanoma Segmentation and Classification with Skin Lesion Images
    Alenezi, Fayadh
    Armghan, Ammar
    Polat, Kemal
    DIAGNOSTICS, 2023, 13 (02)
  • [8] Multi-task learning for segmentation and classification of breast tumors from ultrasound images
    He Q.
    Yang Q.
    Su H.
    Wang Y.
    Computers in Biology and Medicine, 2024, 173
  • [9] Transformer-based transfer learning and multi-task learning for improving the performance of speech emotion recognition
    Park, Sunchan
    Kim, Hyung Soon
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 515 - 522
  • [10] A transformer-based multi-task deep learning model for simultaneous T-stage identification and segmentation of nasopharyngeal carcinoma
    Yang, Kaifan
    Dong, Xiuyu
    Tang, Fan
    Ye, Feng
    Chen, Bei
    Liang, Shujun
    Zhang, Yu
    Xu, Yikai
    FRONTIERS IN ONCOLOGY, 2024, 14