Transformer-based multi-task learning for classification and segmentation of gastrointestinal tract endoscopic images

被引：11

作者：

Tang, Suigu ^{[1
]}

Yu, Xiaoyuan ^{[1
]}

Cheang, Chak Fong ^{[1
]}

Liang, Yanyan ^{[1
]}

Zhao, Penghui ^{[1
]}

Yu, Hon Ho ^{[2
]}

Choi, I. Cheong ^{[2
]}

机构：

[1] Macau Univ Sci & Technol, Fac Innovat Engn, Sch Comp Sci & Engn, Cotai, Macao, Peoples R China

[2] Kiang Wu Hosp, Macau, Macau, Peoples R China

来源：

COMPUTERS IN BIOLOGY AND MEDICINE | 2023年 / 157卷

关键词：

Transformer; Multi-task learning; Classification; Segmentation; Active learning; DEEP; NETWORK; DIAGNOSIS;

D O I：

10.1016/j.compbiomed.2023.106723

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Despite being widely utilized to help endoscopists identify gastrointestinal (GI) tract diseases using classifica-tion and segmentation, models based on convolutional neural network (CNN) have difficulties in distinguishing the similarities among some ambiguous types of lesions presented in endoscopic images, and in the training when lacking labeled datasets. Those will prevent CNN from further improving the accuracy of diagnosis. To address these challenges, we first proposed a Multi-task Network (TransMT-Net) capable of simultaneously learning two tasks (classification and segmentation), which has the transformer designed to learn global features and can combine the advantages of CNN in learning local features so that to achieve a more accurate prediction in identifying the lesion types and regions in GI tract endoscopic images. We further adopted the active learning in TransMT-Net to tackle the labeled image-hungry problem. A dataset was created from the CVC-ClinicDB dataset, Macau Kiang Wu Hospital, and Zhongshan Hospital to evaluate the model performance. Then, the experimental results show that our model not only achieved 96.94% accuracy in the classification task and 77.76% Dice Similarity Coefficient in the segmentation task but also outperformed those of other models on our test set. Meanwhile, active learning also produced positive results for the performance of our model with a small-scale initial training set, and even its performance with 30% of the initial training set was comparable to that of most comparable models with the full training set. Consequently, the proposed TransMT-Net has demonstrated its potential performance in GI tract endoscopic images and it through active learning can alleviate the shortage of labeled images.

引用

页数：11

共 50 条

[21] Microcrack Segmentation of 3D CT Images Based on Multi-Task Learning
Peng, Junjie
Li, Wenbin
Liao, Suyu
Zhu, Yining
IEEE Access, 2024, 12 : 138192 - 138200
[22] Segmentation of Remote Sensing Images Based on U-Net Multi-Task Learning
Ruiwen, Ni
Ye, Mu
Ji, Li
Tong, Zhang
Tianye, Luo
Ruilong, Feng
He, Gong
Tianli, Hu
Yu, Sun
Ying, Guo
Shijun, Li
Tyasi, Thobela Louis
Computers, Materials and Continua, 2022, 73 (02): : 3263 - 3274
[23] Multi-Task Model for Esophageal Lesion Analysis Using Endoscopic Images: Classification with Image Retrieval and Segmentation with Attention
Yu, Xiaoyuan
Tang, Suigu
Cheang, Chak Fong
Yu, Hon Ho
Choi, I. Cheong
SENSORS, 2022, 22 (01)
[24] Peripapillary Atrophy Segmentation in Fundus Images via Multi-task Learning
Wei, Xiao
Jiang, Bo
Ling, Yuye
Jin, Peiyao
Wang, Yifan
Wang, Xinbing
Zhou, Chenghu
MEDICAL IMAGING 2023, 2023, 12464
[25] Multi-task learning for gland segmentation
Rezazadeh, Iman
Duygulu, Pinar
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (01) : 1 - 9
[26] Multi-Task Learning for Subspace Segmentation
Wang, Yu
Wipf, David
Ling, Qing
Chen, Wei
Wassell, Ian
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1209 - 1217
[27] Multi-task learning for gland segmentation
Iman Rezazadeh
Pinar Duygulu
Signal, Image and Video Processing, 2023, 17 : 1 - 9
[28] Multi-task learning for segmentation and classification of tumors in 3D automated breast ultrasound images☆
Zhou, Yue
Chen, Houjin
Li, Yanfeng
Liu, Qin
Xu, Xuanang
Wang, Shu
Yap, Pew-Thian
Shen, Dinggang
MEDICAL IMAGE ANALYSIS, 2021, 70
[29] A statistical categorization-based curriculum learning approach for multi-task classification of images
Ozan Veranyurt
C. Okan Sakar
Applied Intelligence, 2025, 55 (6)
[30] TransNuSeg: A Lightweight Multi-task Transformer for Nuclei Segmentation
He, Zhenqi
Unberath, Mathias
Ke, Jing
Shen, Yiqing
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 206 - 215

← 1 2 3 4 5 →