Text-Guided Cross-Position Attention for Segmentation: Case of Medical Image

被引:1
|
作者
Lee, Go-Eun [1 ]
Kim, Seon Ho [2 ]
Cho, Jungchan [3 ]
Choi, Sang Tae [4 ]
Choi, Sang-Il [1 ]
机构
[1] Dankook Univ, Yongin, Gyeonggi Do, South Korea
[2] Univ Southern Calif, Los Angeles, CA 90007 USA
[3] Gachon Univ, Seongnam, Gyeonggi Do, South Korea
[4] Chung Ang Univ, Coll Med, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Image Segmentation; Multi Modal Learning; Cross Position Attention; Text-Guided Attention; Medical Image; TRANSFORMER;
D O I
10.1007/978-3-031-43904-9_52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel text-guided cross-position attention module which aims at applying a multi-modality of text and image to position attention in medical image segmentation. To match the dimension of the text feature to that of the image feature map, we multiply learnable parameters by text features and combine the multi-modal semantics via cross-attention. It allows a model to learn the dependency between various characteristics of text and image. Our proposed model demonstrates superior performance compared to other medical models using image-only data or image-text data. Furthermore, we utilize our module as a region of interest (RoI) generator to classify the inflammation of the sacroiliac joints. The RoIs obtained from the model contribute to improve the performance of classification models.
引用
下载
收藏
页码:537 / 546
页数:10
相关论文
共 50 条
  • [41] DCACNet: Dual context aggregation and attention-guided cross deconvolution network for medical image segmentation
    Lu, Hongchun
    Tian, Shengwei
    Yu, Long
    Liu, Lu
    Cheng, Junlong
    Wu, Weidong
    Kang, Xiaojing
    Zhang, Dezhi
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 214
  • [42] TolerantGAN: Text-Guided Image Manipulation Tolerant to Real-World Image
    Watanabe, Yuto
    Togo, Ren
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 150 - 159
  • [43] Few Shot Medical Image Segmentation with Cross Attention Transformer
    Lin, Yi
    Chen, Yufan
    Cheng, Kwang-Ting
    Chen, Hao
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 233 - 243
  • [44] TIC: text-guided image colorization using conditional generative model
    Ghosh, Subhankar
    Roy, Prasun
    Bhattacharya, Saumik
    Pal, Umapada
    Blumenstein, Michael
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41121 - 41136
  • [45] TIC: text-guided image colorization using conditional generative model
    Subhankar Ghosh
    Prasun Roy
    Saumik Bhattacharya
    Umapada Pal
    Michael Blumenstein
    Multimedia Tools and Applications, 2024, 83 : 41121 - 41136
  • [46] Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
    Wang, Su
    Saharia, Chitwan
    Montgomery, Ceslee
    Pont-Tuset, Jordi
    Noy, Shai
    Pellegrini, Stefano
    Onoe, Yasumasa
    Laszlo, Sarah
    Fleet, David J.
    Soricut, Radu
    Baldridge, Jason
    Norouzi, Mohammad
    Anderson, Peter
    Chan, William
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18359 - 18369
  • [47] CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation
    Xu, Sihan
    Ma, Ziqiao
    Huang, Yidong
    Lee, Honglak
    Chai, Joyce
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Where you edit is what you get: Text-guided image editing with region-based attention
    Xiao, Changming
    Yang, Qi
    Xu, Xiaoqiang
    Zhang, Jianwei
    Zhou, Feng
    Zhang, Changshui
    PATTERN RECOGNITION, 2023, 139
  • [49] Text Segmentation by Cross Segment Attention
    Lukasik, Michal
    Dadachev, Boris
    Papineni, Kishore
    Simoes, Goncalo
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4707 - 4716
  • [50] MISL: Multi-grained image-text semantic learning for text-guided image inpainting
    Wu, Xingcai
    Zhao, Kejun
    Huang, Qianding
    Wang, Qi
    Yang, Zhenguo
    Hao, Gefei
    PATTERN RECOGNITION, 2024, 145