Domain Adaptive Cross-Modal Image Retrieval via Modality and Domain Translations

被引:0
|
作者
Yanagi, Rintaro [1 ]
Togo, Ren [2 ]
Ogawa, Takahiro [3 ]
Haseyama, Miki [3 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
[2] Hokkaido Univ, Educ & Res Ctr Math & Data Sci, Sapporo, Hokkaido 0600812, Japan
[3] Hokkaido Univ, Fac Informat Sci & Technol, Div Media & Network Technol, Sapporo, Hokkaido 0600814, Japan
关键词
cross-modal retrieval; text-to-image generative adversarial network; style transfer; domain adaptation;
D O I
10.1587/transfun.2020IMP0011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various cross-modal retrieval methods that can retrieve images related to a query sentence without text annotations have been proposed. Although a high level of retrieval performance is achieved by these methods, they have been developed for a single domain retrieval setting. When retrieval candidate images come from various domains, the retrieval performance of these methods might be decreased. To deal with this problem, we propose a new domain adaptive cross-modal retrieval method. By translating a modality and domains of a query and candidate images, our method can retrieve desired images accurately in a different domain retrieval setting. Experimental results for clipart and painting datasets showed that the proposed method has better retrieval performance than that of other conventional and state-of-the-art methods.
引用
收藏
页码:866 / 875
页数:10
相关论文
共 50 条
  • [1] Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation
    Zhao, Wentian
    Wu, Xinxiao
    Luo, Jiebo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1180 - 1192
  • [2] Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
    Liu, Yang
    Chen, Qingchao
    Albanie, Samuel
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14949 - 14959
  • [3] Multi-level Alignment Network for Domain Adaptive Cross-modal Retrieval
    Dong, Jianfeng
    Long, Zhongzi
    Mao, Xiaofeng
    Lin, Changting
    He, Yuan
    Ji, Shouling
    NEUROCOMPUTING, 2021, 440 : 207 - 219
  • [4] Domain Invariant Subspace Learning for Cross-Modal Retrieval
    Liu, Chenlu
    Xu, Xing
    Yang, Yang
    Lu, Huimin
    Shen, Fumin
    Ji, Yanli
    MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 94 - 105
  • [5] Modality-specific Adaptive Scaling Method for Cross-modal Retrieval
    Chen, Baitao
    Ke, Xiao
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 202 - 205
  • [6] Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval
    Li, Fengling
    Wang, Bowen
    Zhu, Lei
    Li, Jingjing
    Zhang, Zheng
    Chang, Xiaojun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9664 - 9677
  • [7] Image-Text Cross-Modal Retrieval via Modality-Specific Feature Learning
    Wang, Jian
    He, Yonghao
    Kang, Cuicui
    Xiang, Shiming
    Pan, Chunhong
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 347 - 354
  • [8] CHEF: Cross-Modal Hierarchical Embeddings for Food Domain Retrieval
    Pham, Hai X.
    Guerrero, Ricardo
    Li, Jiatong
    Pavlovic, Vladimir
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2423 - 2430
  • [9] Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems
    Pereira, Jose Costa
    Vasconcelos, Nuno
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 124 : 123 - 135
  • [10] Modality-specific adaptive scaling and attention network for cross-modal retrieval
    Ke, Xiao
    Chen, Baitao
    Cai, Yuhang
    Liu, Hao
    Guo, Wenzhong
    Chen, Weibin
    NEUROCOMPUTING, 2025, 612