Transformer-based multi-task learning for classification and segmentation of gastrointestinal tract endoscopic images

被引:11
|
作者
Tang, Suigu [1 ]
Yu, Xiaoyuan [1 ]
Cheang, Chak Fong [1 ]
Liang, Yanyan [1 ]
Zhao, Penghui [1 ]
Yu, Hon Ho [2 ]
Choi, I. Cheong [2 ]
机构
[1] Macau Univ Sci & Technol, Fac Innovat Engn, Sch Comp Sci & Engn, Cotai, Macao, Peoples R China
[2] Kiang Wu Hosp, Macau, Macau, Peoples R China
关键词
Transformer; Multi-task learning; Classification; Segmentation; Active learning; DEEP; NETWORK; DIAGNOSIS;
D O I
10.1016/j.compbiomed.2023.106723
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite being widely utilized to help endoscopists identify gastrointestinal (GI) tract diseases using classifica-tion and segmentation, models based on convolutional neural network (CNN) have difficulties in distinguishing the similarities among some ambiguous types of lesions presented in endoscopic images, and in the training when lacking labeled datasets. Those will prevent CNN from further improving the accuracy of diagnosis. To address these challenges, we first proposed a Multi-task Network (TransMT-Net) capable of simultaneously learning two tasks (classification and segmentation), which has the transformer designed to learn global features and can combine the advantages of CNN in learning local features so that to achieve a more accurate prediction in identifying the lesion types and regions in GI tract endoscopic images. We further adopted the active learning in TransMT-Net to tackle the labeled image-hungry problem. A dataset was created from the CVC-ClinicDB dataset, Macau Kiang Wu Hospital, and Zhongshan Hospital to evaluate the model performance. Then, the experimental results show that our model not only achieved 96.94% accuracy in the classification task and 77.76% Dice Similarity Coefficient in the segmentation task but also outperformed those of other models on our test set. Meanwhile, active learning also produced positive results for the performance of our model with a small-scale initial training set, and even its performance with 30% of the initial training set was comparable to that of most comparable models with the full training set. Consequently, the proposed TransMT-Net has demonstrated its potential performance in GI tract endoscopic images and it through active learning can alleviate the shortage of labeled images.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Microcrack Segmentation of 3D CT Images Based on Multi-Task Learning
    Peng, Junjie
    Li, Wenbin
    Liao, Suyu
    Zhu, Yining
    IEEE Access, 2024, 12 : 138192 - 138200
  • [22] Segmentation of Remote Sensing Images Based on U-Net Multi-Task Learning
    Ruiwen, Ni
    Ye, Mu
    Ji, Li
    Tong, Zhang
    Tianye, Luo
    Ruilong, Feng
    He, Gong
    Tianli, Hu
    Yu, Sun
    Ying, Guo
    Shijun, Li
    Tyasi, Thobela Louis
    Computers, Materials and Continua, 2022, 73 (02): : 3263 - 3274
  • [23] Multi-Task Model for Esophageal Lesion Analysis Using Endoscopic Images: Classification with Image Retrieval and Segmentation with Attention
    Yu, Xiaoyuan
    Tang, Suigu
    Cheang, Chak Fong
    Yu, Hon Ho
    Choi, I. Cheong
    SENSORS, 2022, 22 (01)
  • [24] Peripapillary Atrophy Segmentation in Fundus Images via Multi-task Learning
    Wei, Xiao
    Jiang, Bo
    Ling, Yuye
    Jin, Peiyao
    Wang, Yifan
    Wang, Xinbing
    Zhou, Chenghu
    MEDICAL IMAGING 2023, 2023, 12464
  • [25] Multi-task learning for gland segmentation
    Rezazadeh, Iman
    Duygulu, Pinar
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (01) : 1 - 9
  • [26] Multi-Task Learning for Subspace Segmentation
    Wang, Yu
    Wipf, David
    Ling, Qing
    Chen, Wei
    Wassell, Ian
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1209 - 1217
  • [27] Multi-task learning for gland segmentation
    Iman Rezazadeh
    Pinar Duygulu
    Signal, Image and Video Processing, 2023, 17 : 1 - 9
  • [28] Multi-task learning for segmentation and classification of tumors in 3D automated breast ultrasound images☆
    Zhou, Yue
    Chen, Houjin
    Li, Yanfeng
    Liu, Qin
    Xu, Xuanang
    Wang, Shu
    Yap, Pew-Thian
    Shen, Dinggang
    MEDICAL IMAGE ANALYSIS, 2021, 70
  • [29] A statistical categorization-based curriculum learning approach for multi-task classification of images
    Ozan Veranyurt
    C. Okan Sakar
    Applied Intelligence, 2025, 55 (6)
  • [30] TransNuSeg: A Lightweight Multi-task Transformer for Nuclei Segmentation
    He, Zhenqi
    Unberath, Mathias
    Ke, Jing
    Shen, Yiqing
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 206 - 215