Adaptive multi-label structure preserving network for cross-modal retrieval

被引:0
|
作者
Zhu, Jie [1 ]
Zhang, Hui [1 ]
Chen, Junfen [1 ]
Xie, Bojun [1 ]
Liu, Jianan [2 ]
Zhang, Junsan [3 ]
机构
[1] Hebei Univ, Coll Math & Informat Sci, Key Lab Machine Learning & Computat Intelligence, Baoding 071002, Peoples R China
[2] Hebei Univ Architecture, Informat Engn Coll, Zhangjiakou 075000, Peoples R China
[3] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph convolutional networks; Multi-label graph; Multi-label correlation matrices; Group GCN;
D O I
10.1016/j.ins.2024.121279
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To integrate the label structures, which describe semantic correlations among labels, into the learned common representations, many existing methods leverage the label embeddings learned according to label structures, to map the data of different modalities into a common representation space. However, these methods cannot fully discover the semantic correlation between labels. In this paper, we propose an Adaptive Multi-label Structure Preserving Network (AMLSPN) to dynamically learn the multi-label correlations and multi-label embeddings for learning common representations, which can preserve the label structures. Our method introduces a series of multi- label correlation matrices to capture the structures of multi-label nodes in the multi-label graph. Moreover, we present a novel hierarchical correlation loss to supervise the learning process of these multi-label correlation matrices. Additionally, we introduce a group GCN to enhance the training speed of our model. Extensive evaluations on three benchmark datasets demonstrate that our proposed AMLSPN outperforms the state-of-the-art methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Multi-Label Cross-modal Retrieval
    Ranjan, Viresh
    Rasiwasia, Nikhil
    Jawahar, C. V.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4094 - 4102
  • [2] Deep Ranking Distribution Preserving Hashing for Robust Multi-Label Cross-Modal Retrieval
    Song, Ge
    Huang, Kai
    Su, Hanwen
    Song, Fengyi
    Yang, Ming
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7027 - 7042
  • [3] Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval
    Shen, Xiaobo
    Chen, Yinfan
    Liu, Weiwei
    Zheng, Yuhui
    Sun, Quan-Sen
    Pan, Shirui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [4] Hypergraph clustering based multi-label cross-modal retrieval
    Guo, Shengtang
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Lu, Xu
    Li, Liujian
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
  • [5] Multi-label semantics preserving based deep cross-modal hashing
    Zou, Xitao
    Wang, Xinzhi
    Bakker, Erwin M.
    Wu, Song
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 93
  • [6] DEEP PAIRWISE RANKING WITH MULTI-LABEL INFORMATION FOR CROSS-MODAL RETRIEVAL
    Jian, Yangwo
    Xiao, Jing
    Cao, Yang
    Khan, Asad
    Zhu, Jia
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1810 - 1815
  • [7] Multi-label double-layer learning for cross-modal retrieval
    He, Jianfeng
    Ma, Bingpeng
    Wang, Shuhui
    Liu, Yugui
    Huang, Qingming
    [J]. NEUROCOMPUTING, 2018, 275 : 1893 - 1902
  • [8] Multi-label adversarial fine-grained cross-modal retrieval
    Sun, Chunpu
    Zhang, Huaxiang
    Liu, Li
    Liu, Dongmei
    Wang, Lin
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 117
  • [9] Scalable multi-label canonical correlation analysis for cross-modal retrieval
    Shu, Xin
    Zhao, Guoying
    [J]. PATTERN RECOGNITION, 2021, 115
  • [10] Deep robust multilevel semantic hashing for multi-label cross-modal retrieval
    Song, Ge
    Tan, Xiaoyang
    Zhao, Jun
    Yang, Ming
    [J]. PATTERN RECOGNITION, 2021, 120