Semi-supervised Coupled Dictionary Learning for Cross-modal Retrieval in Internet Images and Texts

被引:22
|
作者
Xu, Xing [1 ]
Yang, Yang [2 ]
Shimada, Atsushi [1 ]
Taniguchi, Rin-ichiro [1 ]
He, Li [3 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[3] Qualcomm R&D Ctr, San Diego, CA USA
关键词
Cross-modal Retrieval; Semi-supervised learning; Coupled Dictionary Learning; SPACE;
D O I
10.1145/2733373.2806346
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays massive amount of images and texts has been emerging on the Internet, arousing the demand of effective cross-modal retrieval. To eliminate the heterogeneity between the modalities of images and texts, the existing subspace learning methods try to learn a common latent subspace under which cross-modal matching can be performed. However, these methods usually require fully paired samples (images with corresponding texts) and also ignore the class label information along with the paired samples. Indeed, the class label information can reduce the semantic gap between different modalities and explicitly guide the subspace learning procedure. In addition, the large quantities of unpaired samples (images or texts) may provide useful side information to enrich the representations from learned subspace. Thus, in this paper we propose a novel model for cross-modal retrieval problem. It consists of 1) a semi-supervised coupled dictionary learning step to generate homogeneously sparse representations for different modalities based on both paired and unpaired samples; 2) a coupled feature mapping step to project the sparse representations of different modalities into a common subspace defined by class label information to perform cross-modal matching. Experiments on a large scale web image dataset MIRFlickr-1M with both fully paired and unpaired settings show the effectiveness of the proposed model on the cross-modal retrieval task.
引用
收藏
页码:847 / 850
页数:4
相关论文
共 50 条
  • [1] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Fuhao Zou
    Xingqiang Bai
    Chaoyang Luan
    Kai Li
    Yunfei Wang
    Hefei Ling
    [J]. World Wide Web, 2019, 22 : 825 - 841
  • [2] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Zou, Fuhao
    Bai, Xingqiang
    Luan, Chaoyang
    Li, Kai
    Wang, Yunfei
    Ling, Hefei
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 825 - 841
  • [3] Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
    Zhang, Liang
    Ma, Bingpeng
    He, Jianfeng
    Li, Guorong
    Huang, Qingming
    Tian, Qi
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3406 - 3412
  • [4] A semi-supervised cross-modal memory bank for cross-modal retrieval
    Huang, Yingying
    Hu, Bingliang
    Zhang, Yipeng
    Gao, Chi
    Wang, Quan
    [J]. NEUROCOMPUTING, 2024, 579
  • [5] Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval
    Zhang, Liang
    Ma, Bingpeng
    Li, Guorong
    Huang, Qingming
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (01) : 128 - 141
  • [6] Semi-Supervised Cross-Modal Retrieval With Label Prediction
    Mandal, Devraj
    Rao, Pramod
    Biswas, Soma
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (09) : 2345 - 2353
  • [7] Semi-supervised Prototype Semantic Association Learning for Robust Cross-modal Retrieval
    Wang, Junsheng
    Gong, Tiantian
    Yan, Yan
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 872 - 881
  • [8] Semi-Supervised Cross-Modal Retrieval Based on Discriminative Comapping
    Liu, Li
    Dong, Xiao
    Wang, Tianshi
    [J]. COMPLEXITY, 2020, 2020
  • [9] Semi-supervised discrete hashing for efficient cross-modal retrieval
    Wang, Xingzhi
    Liu, Xin
    Peng, Shu-Juan
    Zhong, Bineng
    Chen, Yewang
    Du, Ji-Xiang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25335 - 25356
  • [10] Semi-supervised discrete hashing for efficient cross-modal retrieval
    Xingzhi Wang
    Xin Liu
    Shu-Juan Peng
    Bineng Zhong
    Yewang Chen
    Ji-Xiang Du
    [J]. Multimedia Tools and Applications, 2020, 79 : 25335 - 25356