Multi-view Collective Tensor Decomposition for Cross-modal Hashing

被引:5
|
作者
Cui, Limeng [1 ]
Chen, Zhensong [2 ]
Zhang, Jiawei [3 ]
He, Lifang [4 ]
Shi, Yong [5 ]
Yu, Philip S. [6 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Econ & Management, Beijing, Peoples R China
[3] Florida State Univ, Dept Comp Sci, IFM Lab, Tallahassee, FL 32306 USA
[4] Cornell Univ, Weill Cornell Med, New York, NY 10021 USA
[5] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing, Peoples R China
[6] Univ Illinois, Dept Comp Sci, Chicago, IL USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Cross-modal hashing; tensor factorization; metric learning; multiview learning; CODES;
D O I
10.1145/3206025.3206065
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multimedia data available in various disciplines are usually heterogeneous, containing representations in multi-views, where the cross-modal search techniques become necessary and useful. It is a challenging problem due to the heterogeneity of data with multiple modalities, multi-views in each modality and the diverse data categories. In this paper, we propose a novel multi-view cross-modal hashing method named Multi-view Collective Tensor Decomposition ( MCTD) to fuse these data effectively, which can exploit the complementary feature extracted from multi-modality multi-view while simultaneously discovering multiple separated subspaces by leveraging the data categories as supervision information. Our contributions are summarized as follows: 1) we exploit tensor modeling to get better representation of the complementary features and redefine a latent representation space; 2) a block-diagonal loss is proposed to explicitly pursue a more discriminative latent tensor space by exploring supervision information; 3) we propose a new feature projection method to characterize the data and to generate the latent representation for incoming new queries. An optimization algorithm is proposed to solve the objective function designed for MCTD, which works under an iterative updating procedure. Experimental results prove the state-of-the-art precision of MCTD compared with competing methods.
引用
收藏
页码:73 / 81
页数:9
相关论文
共 50 条
  • [1] Multi-view collective tensor decomposition for cross-modal hashing
    Limeng Cui
    Jiawei Zhang
    Lifang He
    Philip S. Yu
    [J]. International Journal of Multimedia Information Retrieval, 2019, 8 : 47 - 59
  • [2] Multi-view collective tensor decomposition for cross-modal hashing
    Cui, Limeng
    Zhang, Jiawei
    He, Lifang
    Yu, Philip S.
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2019, 8 (01) : 47 - 59
  • [3] ROBUST MULTI-VIEW HASHING FOR CROSS-MODAL RETRIEVAL
    Wang, Haitao
    Chen, Hui
    Meng, Min
    Wu, JiGang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1012 - 1017
  • [4] Semi-supervised cross-modal hashing with multi-view graph representation
    Shen, Xiao
    Zhang, Haofeng
    Li, Lunbo
    Yang, Wankou
    Liu, Li
    [J]. INFORMATION SCIENCES, 2022, 604 : 45 - 60
  • [5] Learning discriminative hashing codes for cross-modal retrieval based on multi-view features
    Jun Yu
    Xiao-Jun Wu
    Josef Kittler
    [J]. Pattern Analysis and Applications, 2020, 23 : 1421 - 1438
  • [6] Learning discriminative hashing codes for cross-modal retrieval based on multi-view features
    Yu, Jun
    Wu, Xiao-Jun
    Kittler, Josef
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1421 - 1438
  • [7] MULTI-VIEW FUSION THROUGH CROSS-MODAL RETRIEVAL
    Cui, Limeng
    Chen, Zhensong
    Zhang, Jiawei
    He, Lifang
    Shi, Yong
    Yu, Philip S.
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1977 - 1981
  • [8] Collective Reconstructive Embeddings for Cross-Modal Hashing
    Hu, Mengqiu
    Yang, Yang
    Shen, Fumin
    Xie, Ning
    Hong, Richang
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) : 2770 - 2784
  • [9] MVItem: A Benchmark for Multi-View Cross-Modal Item Retrieval
    Li, Bo
    Zhu, Jiansheng
    Dai, Linlin
    Jing, Hui
    Huang, Zhizheng
    Sui, Yuteng
    [J]. IEEE ACCESS, 2024, 12 : 119563 - 119576
  • [10] Collective Affinity Learning for Partial Cross-Modal Hashing
    Guo, Jun
    Zhu, Wenwu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1344 - 1355