Scalable multi-label canonical correlation analysis for cross-modal retrieval

被引:14
|
作者
Shu, Xin [1 ,2 ]
Zhao, Guoying [2 ]
机构
[1] Nanjing Agr Univ, Coll Artificial Intelligence, 1 Wei Gang, Nanjing, Peoples R China
[2] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu, Finland
基金
芬兰科学院; 中国国家自然科学基金;
关键词
Canonical correlation analysis; Semantic transformation; Cross-modal retrieval; Singular value decomposition;
D O I
10.1016/j.patcog.2021.107905
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label canonical correlation analysis (ml-CCA) has been developed for cross-modal retrieval. However, the computation of ml-CCA involves dense matrices eigendecomposition, which can be computationally expensive. In addition, ml-CCA only takes semantic correlation into account which ignores the cross-modal feature correlation. In this paper, we propose a novel framework to simultaneously integrate the semantic correlation and feature correlation for cross-modal retrieval. By using the semantic transformation, we show that our model can avoid computing the covariance matrix explicitly which is a huge save of computational cost. Further analysis shows that our proposed method can be solved via singular value decomposition which has linear time complexity. Experimental results on three multi-label datasets have demonstrated the accuracy and efficiency of our proposed method. ? 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Multi-view Multi-label Canonical Correlation Analysis for Cross-modal Matching and Retrieval
    Sanghavi, Rushil
    Verma, Yashaswi
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4700 - 4709
  • [2] Semantically-enhanced kernel canonical correlation analysis: a multi-label cross-modal retrieval
    Yuhua Jia
    Liang Bai
    Shuang Liu
    Peng Wang
    Jinlin Guo
    Yuxiang Xie
    [J]. Multimedia Tools and Applications, 2019, 78 : 13169 - 13188
  • [3] Semantically-enhanced kernel canonical correlation analysis: a multi-label cross-modal retrieval
    Jia, Yuhua
    Bai, Liang
    Liu, Shuang
    Wang, Peng
    Guo, Jinlin
    Xie, Yuxiang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (10) : 13169 - 13188
  • [4] Multi-Label Cross-modal Retrieval
    Ranjan, Viresh
    Rasiwasia, Nikhil
    Jawahar, C. V.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4094 - 4102
  • [5] Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval
    Shen, Xiaobo
    Chen, Yinfan
    Liu, Weiwei
    Zheng, Yuhui
    Sun, Quan-Sen
    Pan, Shirui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [6] Hypergraph clustering based multi-label cross-modal retrieval
    Guo, Shengtang
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Lu, Xu
    Li, Liujian
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
  • [7] DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL
    Jian, Yangwo
    Xiao, Jing
    Cao, Yang
    Khan, Asad
    Zhu, Jia
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1810 - 1815
  • [8] Multi-label double-layer learning for cross-modal retrieval
    He, Jianfeng
    Ma, Bingpeng
    Wang, Shuhui
    Liu, Yugui
    Huang, Qingming
    [J]. NEUROCOMPUTING, 2018, 275 : 1893 - 1902
  • [9] Multi-label adversarial fine-grained cross-modal retrieval
    Sun, Chunpu
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Wang, Lin
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 117
  • [10] Adaptive multi-label structure preserving network for cross-modal retrieval
    Zhu, Jie
    Zhang, Hui
    Chen, Junfen
    Xie, Bojun
    Liu, Jianan
    Zhang, Junsan
    [J]. INFORMATION SCIENCES, 2024, 682