Object-stable unsupervised dual contrastive learning image-to-image translation with query-selected attention and convolutional block attention module

被引:0
|
作者
Oh, Yunseok [1 ,2 ]
Oh, Seonhye [1 ,3 ]
Noh, Sangwoo [4 ]
Kim, Hangyu [5 ]
Seo, Hyeon [1 ,6 ]
机构
[1] Gyeongsang Natl Univ, Dept AI Convergence Engn, Jinju Si, Gyeongsangnam D, South Korea
[2] Korea Res Inst Def Technol Planning & Adv, Precedent Study Team C4ISR Syst, Jinju Si 52851, Gyeongsangnam D, South Korea
[3] Korea Res Inst Def Technol Planning & Advancement, Guided & Firepower Syst Technol Planning Team, Jinju si, Gyeongsangnam D, South Korea
[4] Korea Res Inst Def Technol Planning & Advancement, Syst Technol Planning Team C4ISR, Jinju Si, Gyeongsangnam D, South Korea
[5] NAVER Cloud, Clova Speech, Seongnam Si, Gyeonggi Do, South Korea
[6] Gyeongsang Natl Univ, Dept Comp Sci, Jinju Si, Gyeongsangnam D, South Korea
来源
PLOS ONE | 2023年 / 18卷 / 11期
基金
新加坡国家研究基金会;
关键词
D O I
10.1371/journal.pone.0293885
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recently, contrastive learning has gained popularity in the field of unsupervised image-to-image (I2I) translation. In a previous study, a query-selected attention (QS-Attn) module, which employed an attention matrix with a probability distribution, was used to maximize the mutual information between the source and translated images. This module selected significant queries using an entropy metric computed from the attention matrix. However, it often selected many queries with equal significance measures, leading to an excessive focus on the background. In this study, we proposed a dual-learning framework with QS-Attn and convolutional block attention module (CBAM) called object-stable dual contrastive learning generative adversarial network (OS-DCLGAN). In this paper, we utilize a CBAM, which learns what and where to emphasize or suppress, thereby refining intermediate features effectively. This CBAM was integrated before the QS-Attn module to capture significant domain information for I2I translation tasks. The proposed framework outperformed recently introduced approaches in various I2I translation tasks, showing its effectiveness and versatility. The code is available at https://github.com/RedPotatoChip/OSUDL
引用
收藏
页数:17
相关论文
共 45 条
  • [1] Dual Contrastive Learning for Unsupervised Image-to-Image Translation
    Han, Junlin
    Shoeiby, Mehrdad
    Petersson, Lars
    Armin, Mohammad Ali
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 746 - 755
  • [2] Contrastive learning for unsupervised image-to-image translation
    Lee, Hanbit
    Seol, Jinseok
    Lee, Sang-goo
    Park, Jaehui
    Shim, Junho
    [J]. APPLIED SOFT COMPUTING, 2024, 151
  • [3] Unsupervised Attention-guided Image-to-Image Translation
    Mejjati, Youssef A.
    Richardt, Christian
    Tompkin, James
    Cosker, Darren
    Kim, Kwang In
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [4] Truly Unsupervised Image-to-Image Translation with Contrastive Representation Learning
    Hong, Zhiwei
    Feng, Jianxing
    Jiang, Tao
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 239 - 255
  • [5] Unsupervised Image-to-Image Translation with Self-Attention Networks
    Kang, Taewon
    Lee, Kwang Hee
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 102 - 108
  • [6] Multi-attention bidirectional contrastive learning method for unpaired image-to-image translation
    Yang, Benchen
    Liu, Xuzhao
    Li, Yize
    Jin, Haibo
    Qu, Yetian
    [J]. PLOS ONE, 2024, 19 (04):
  • [7] DualGAN: Unsupervised Dual Learning for Image-to-Image Translation
    Yi, Zili
    Zhang, Hao
    Tan, Ping
    Gong, Minglun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2868 - 2876
  • [8] QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation
    Hu, Xueqi
    Zhou, Xinyue
    Huang, Qiusheng
    Shi, Zhengyi
    Sun, Li
    Li, Qingli
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18270 - 18279
  • [9] Unsupervised image-to-image translation by semantics consistency and self-attention
    Zhibin Zhang
    Wanli Xue
    Guokai Fu
    [J]. Optoelectronics Letters, 2022, 18 : 175 - 180
  • [10] Unsupervised image-to-image translation by semantics consistency and self-attention
    Zhang Zhibin
    Xue Wanli
    Fu Guokai
    [J]. OPTOELECTRONICS LETTERS, 2022, 18 (03) : 175 - 180