Diverter transformer-based multi-encoder-multi-decoder network model for medical retinal blood vessel image segmentation

被引：0

作者：

Wu, Chengwei ^{[1
]}

Guo, Min ^{[1
]}

Ma, Miao ^{[1
]}

Wang, Kaiguang ^{[1
]}

机构：

[1] Shaanxi Normal Univ, Sch Comp Sci, Minist Educ, Key Lab Modern Teaching Technol, Xian 710119, Peoples R China

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2024年 / 93卷

关键词：

Medical image processing; Encoder-decoder architecture; Local context; Retinal vessel segmentation; U-NET;

D O I：

10.1016/j.bspc.2024.106132

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

The retinal blood vessel is an essential part of the fundus structure. It is important to accurately analyze the structure and distribution of retinal vessels, which can help make accurate medical diagnoses. However, it is still challenging to extract detailed information due to the problems of fuzzy edges, low resolution, and lots of noise in retinal blood vessel medical images. To extract the image detail information effectively, we propose a new diverter transformer -based multi -encoder -multi -decoder network model in this paper. The network model consists of a feature encoder module and a feature decoder module. Among them, the feature encoding module consists of a diverter transformer with a diverter adaptive mechanism, three encoder units with a convolution layer and max -pooling layer, and the two decoder units in the feature decoding module consist of an inverse convolution layer and an up -sampling layer, respectively. The Local Context Module (LCNet Module) in the feature encoding module learns richer local context feature information layer by layer through changing the width of the network while downsampling; the Global Encoder Module1 (G -Encoder Module1) and the Global Encoder Module2 (G -Encoder Module2) extract the global feature representation of retinal blood vessel images by performing a max -pooling operation to transform the input data into a vector of fixed dimensions, thus helping the network model to better understand and extract the global feature representation of retinal blood vessel images. The two decoder units in the feature decoding module receive local and global feature information from three encoder units, LCNet Module, G -Encoder Module1 and G -Encoder Module2, respectively. Decoder Module1 generates segmentation prediction by layer -by -layer up -sampling operation, and Decoder Module2 recovers the feature information by downsampling and decoding operations and fuses the recovered feature information to output, obtaining the final segmentation of the retinal blood vessels. The proposed diverter transformer -based multi -encoder -multi -decoder network model is validated on the DRIVE and STARE datasets with other classical and state-of-the-art network models, and its segmentation accuracy is 97.25% and 97.93%, respectively. Compared with the classical U -Net model, the improvement is 2.24% and 1.42%, respectively. Compared with the state-of-the-art SPNet model, the accuracy is increased by 0.61% on DRIVE and 1.01% on STARE. It indicates that the network model proposed in this paper has a significant competitive advantage in improving the segmentation performance of retinal blood vessel images.

引用

页数：24

共 50 条

[41] TransDiffSeg: Transformer-Based Conditional Diffusion Segmentation Model for Abdominal Multi-Objective
Gu, WenWen
Zhang, GuoDong
Ju, RongHui
Wang, SuRan
Li, YanLin
Liang, TingYu
Guo, Wei
Gong, ZhaoXuan
[J]. JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024,
[42] Multi-Level Attention Network for Retinal Vessel Segmentation
Yuan, Yuchen
Zhang, Lei
Wang, Lituan
Huang, Haiying
[J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (01) : 312 - 323
[43] Encoder-Decoder with Multi-scale Information Fusion for Semantic Image Segmentation
Ma, Xinxin
Liu, Kai
Ding, Chongyang
Yan, Lin
Duan, Meiyu
[J]. ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
[44] ELiFormer: A hierarchical Transformer based Model with Efficient Encoder and Lightweight Decoder for Semantic Segmentation
Wu, Zixuan
Zhou, Yue
[J]. 2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
[45] Blood Vessel Segmentation of Retinal Image Based on Dense-U-Net Network
Li, Zhenwei
Jia, Mengli
Yang, Xiaoli
Xu, Mengying
[J]. MICROMACHINES, 2021, 12 (12)
[46] Encoder-Decoder Network for Brain Tumor Segmentation on Multi-sequence MRI
Iantsen, Andrei
Jaouen, Vincent
Visvikis, Dimitris
Hatt, Mathieu
[J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT II, 2020, 11993 : 296 - 302
[47] Dual encoder network with transformer-CNN for multi-organ segmentation
Zhifang Hong
Mingzhi Chen
Weijie Hu
Shiyu Yan
Aiping Qu
Lingna Chen
Junxi Chen
[J]. Medical & Biological Engineering & Computing, 2023, 61 : 661 - 671
[48] Dual encoder network with transformer-CNN for multi-organ segmentation
Hong, Zhifang
Chen, Mingzhi
Hu, Weijie
Yan, Shiyu
Qu, Aiping
Chen, Lingna
Chen, Junxi
[J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (03) : 661 - 671
[49] MCV-UNet: a modified convolution & transformer hybrid encoder-decoder network with multi-scale information fusion for ultrasound image semantic segmentation
Xu, Zihong
Wang, Ziyang
[J]. PEERJ COMPUTER SCIENCE, 2024, 10
[50] MCV-UNet: a modified convolution & transformer hybrid encoder-decoder network with multi-scale information fusion for ultrasound image semantic segmentation
Xu, Zihong
Wang, Ziyang
[J]. PeerJ Computer Science, 2024, 10

← 1 2 3 4 5 →