Domain Adaptive Cross-Modal Image Retrieval via Modality and Domain Translations

被引:0
|
作者
Yanagi, Rintaro [1 ]
Togo, Ren [2 ]
Ogawa, Takahiro [3 ]
Haseyama, Miki [3 ]
机构
[1] Hokkaido Univ, Grad Sch Informat Sci & Technol, Sapporo, Hokkaido 0600814, Japan
[2] Hokkaido Univ, Educ & Res Ctr Math & Data Sci, Sapporo, Hokkaido 0600812, Japan
[3] Hokkaido Univ, Fac Informat Sci & Technol, Div Media & Network Technol, Sapporo, Hokkaido 0600814, Japan
关键词
cross-modal retrieval; text-to-image generative adversarial network; style transfer; domain adaptation;
D O I
10.1587/transfun.2020IMP0011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Various cross-modal retrieval methods that can retrieve images related to a query sentence without text annotations have been proposed. Although a high level of retrieval performance is achieved by these methods, they have been developed for a single domain retrieval setting. When retrieval candidate images come from various domains, the retrieval performance of these methods might be decreased. To deal with this problem, we propose a new domain adaptive cross-modal retrieval method. By translating a modality and domains of a query and candidate images, our method can retrieve desired images accurately in a different domain retrieval setting. Experimental results for clipart and painting datasets showed that the proposed method has better retrieval performance than that of other conventional and state-of-the-art methods.
引用
收藏
页码:866 / 875
页数:10
相关论文
共 50 条
  • [21] Adaptive Adversarial Learning based cross-modal retrieval
    Li, Zhuoyi
    Lu, Huibin
    Fu, Hao
    Wang, Zhongrui
    Gu, Guanghun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [22] Texture BERT for Cross-modal Texture Image Retrieval
    Xu, Zelai
    Yu, Tan
    Li, Ping
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4610 - 4614
  • [23] Online Cross-Modal Hashing for Web Image Retrieval
    Xie, Liang
    Shen, Jialie
    Zhu, Lei
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 294 - 300
  • [24] Cross-Modal Coherence for Text-to-Image Retrieval
    Alikhani, Malihe
    Han, Fangda
    Ravi, Hareesh
    Kapadia, Mubbasir
    Pavlovic, Vladimir
    Stone, Matthew
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10427 - 10435
  • [25] Cross-modal Attribute Based Facial Image Retrieval
    Mali, Dasharath
    Biswas, Soma
    2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,
  • [26] Database-adaptive Re-ranking for Enhancing Cross-modal Image Retrieval
    Yanagi, Rintaro
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3816 - 3825
  • [27] Modality-Dependent Cross-Modal Retrieval Based on Graph Regularization
    Wang, Guanhua
    Ji, Hua
    Kong, Dexin
    Zhang, Na
    MOBILE INFORMATION SYSTEMS, 2020, 2020
  • [28] MODALITY-SPECIFIC STRUCTURE PRESERVING HASHING FOR CROSS-MODAL RETRIEVAL
    Liu, Xingbo
    Nie, Xiushan
    Sun, Haoliang
    Cui, Chaoran
    Yin, Yilong
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1678 - 1682
  • [29] Modality-specific matrix factorization hashing for cross-modal retrieval
    Xiong, Haixia
    Ou, Weihua
    Yan, Zengxian
    Gou, Jianping
    Zhou, Quan
    Wang, Anzhi
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 13 (11) : 5067 - 5081
  • [30] Modality-specific matrix factorization hashing for cross-modal retrieval
    Haixia Xiong
    Weihua Ou
    Zengxian Yan
    Jianping Gou
    Quan Zhou
    Anzhi Wang
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 5067 - 5081