Diffusion model-based text-guided enhancement network for medical image segmentation

被引：1

作者：

Dong, Zhiwei ^{[1
]}

Yuan, Genji ^{[1
]}

Hua, Zhen ^{[1
]}

Li, Jinjiang ^{[2
]}

机构：

[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai, Peoples R China

[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 249卷

基金：

中国国家自然科学基金;

关键词：

Denoising diffusion model; Text attention mechanism; Guided feature enhancement; Medical image segmentation; CONVOLUTIONAL NEURAL-NETWORK; CELL-NUCLEI; MISDIAGNOSIS; CLASSIFICATION;

D O I：

10.1016/j.eswa.2024.123549

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, denoising diffusion models have achieved remarkable success in generating pixel-level representations with semantic values for image generation modeling. In this study, we propose a novel end -toend framework, called TGEDiff, focusing on medical image segmentation. TGEDiff fuses a textual attention mechanism with the diffusion model by introducing an additional auxiliary categorization task to guide the diffusion model with textual information to generate excellent pixel-level representations. To overcome the limitation of limited perceptual fields for independent feature encoders within the diffusion model, we introduce a multi-kernel excitation module to extend the model's perceptual capability. Meanwhile, a guided feature enhancement module is introduced in Denoising-UNet to focus the model's attention on important regions and attenuate the influence of noise and irrelevant background in medical images. We critically evaluated TGEDiff on three datasets (Kvasir-SEG, Kvasir-Sessile, and GLaS), and TGEDiff achieved significant improvements over the state -of -the -art approach on all three datasets, with F1 scores and mIoU improving by 0.88% and 1.09%, 3.21% and 3.43%, respectively, 1.29% and 2.34%. These data validate that TGEDiff has excellent performance in medical image segmentation. TGEDiff is expected to facilitate accurate diagnosis and treatment of medical diseases through more precise deconvolutional structural segmentation.

引用

页数：18

共 50 条

[31] TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models
Cao, Tianshi
Kreis, Karsten
Fidler, Sanja
Sharp, Nicholas
Yin, Kangxue
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4146 - 4158
[32] Text-Guided Sketch-to-Photo Image Synthesis
Osahor, Uche
Nasrabadi, Nasser M.
IEEE ACCESS, 2022, 10 : 98278 - 98289
[33] Hardware Resilience Properties of Text-Guided Image Classifiers
Wasim, Syed Talal
Soboka, Kabila Haile
Mahmoud, Abdulrahman
Khan, Salman
Brooks, David
Wei, Gu-Yeon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[34] DiCTI: Diffusion-based Clothing Designer via Text-guided Input
Lampe, Ajda
Stopar, Julija
Jain, Deepak K.
Omachi, Shinichiro
Peer, Peter
Struc, Vitomir
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
[35] A Medical Image Segmentation Network with Boundary Enhancement
Sun Junmei
Ge Qingqing
Li Xiumei
Zhao Baoqi
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2022, 44 (05) : 1643 - 1652
[36] FocusGAN: Preserving Background in Text-Guided Image Editing
Zhao, Liuqing
Li, Linyan
Hu, Fuyuan
Xia, Zhenping
Yao, Rui
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (16)
[37] Learning semantic alignment from image for text-guided image inpainting
Yucheng Xie
Zehang Lin
Zhenguo Yang
Huan Deng
Xingcai Wu
Xudong Mao
Qing Li
Wenyin Liu
The Visual Computer, 2022, 38 : 3149 - 3161
[38] Target-Free Text-Guided Image Manipulation
Fan, Wan-Cyuan
Yang, Cheng-Fu
Yang, Chiao-An
Wang, Yu-Chiang Frank
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 588 - 596
[39] AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval
Zhu, Hongguang
Wei, Yunchao
Zhao, Yao
Zhang, Chunjie
Huang, Shujuan
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[40] FusionDeformer: text-guided mesh deformation using diffusion models
Xu, Hao
Wu, Yiqian
Tang, Xiangjun
Zhang, Jing
Zhang, Yang
Zhang, Zhebin
Li, Chen
Jin, Xiaogang
VISUAL COMPUTER, 2024, 40 (07): : 4701 - 4712

← 1 2 3 4 5 →