Deep Metric Multi-View Hashing for Multimedia Retrieval

被引:1
|
作者
Zhu, Jian [1 ]
Ruan, Xiaohu [2 ]
Cheng, Yongli [3 ]
Huang, Zhangmin [1 ]
Cui, Yu [1 ]
Zeng, Lingfang [1 ]
机构
[1] Zhejiang Lab, Hangzhou, Zhejiang, Peoples R China
[2] Vivo AI Lab, Hangzhou, Zhejiang, Peoples R China
[3] Fuzhou Univ, Fuzhou, Fujian, Peoples R China
基金
国家重点研发计划;
关键词
Multi-view hash; Multi-modal hash; Deep metric learning; Multimedia retrieval;
D O I
10.1109/ICME55011.2023.00335
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning the hash representation of multi-view heterogeneous data is an important task in multimedia retrieval. However, existing methods fail to effectively fuse the multi-view features and utilize the metric information provided by the dissimilar samples, leading to limited retrieval precision. Current methods utilize weighted sum or concatenation to fuse the multi-view features. We argue that these fusion methods cannot capture the interaction among different views. Furthermore, these methods ignored the information provided by the dissimilar samples. We propose a novel deep metric multi-view hashing (DMMVH) method to address the mentioned problems. Extensive empirical evidence is presented to show that gate-based fusion is better than typical methods. We introduce deep metric learning to the multi-view hashing problems, which can utilize metric information of dissimilar samples. On the MIR-Flickr25K, MS COCO, and NUS-WIDE, our method outperforms the current state-of-the-art methods by a large margin (up to 15.28 mean Average Precision (mAP) improvement).
引用
收藏
页码:1955 / 1960
页数:6
相关论文
共 50 条
  • [1] Fast metric multi-view hashing for multimedia retrieval
    Zhu, Jian
    Hu, Pengbo
    Li, Bingqian
    Zhou, Yi
    [J]. INFORMATION FUSION, 2024, 103
  • [2] Deep Fusion Multi-View Hashing for Multimedia Retrieval
    Zhu, Jian
    Sheng, Mingkai
    Ke, Mingda
    Cheng, Wen
    Zhang, Liying
    Zeng, Lingfang
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH, ICKG, 2023, : 91 - 99
  • [3] Central Similarity Multi-view Hashing for Multimedia Retrieval
    Zhu, Jian
    Cheng, Wen
    Cui, Yu
    Tang, Chang
    Dai, Yuyang
    Li, Yong
    Zeng, Lingfang
    [J]. WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 486 - 500
  • [4] Boosted Curriculum Multi-View Hashing for Multimedia Retrieval
    Zhu, Jian
    Huang, Zhangmin
    Liu, Lei
    Tang, Chang
    Dai, Li-Rong
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2065 - 2069
  • [5] Deep Multi-View Enhancement Hashing for Image Retrieval
    Yan, Chenggang
    Gong, Biao
    Wei, Yuxuan
    Gao, Yue
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (04) : 1445 - 1451
  • [6] Multi-view Latent Hashing for Efficient Multimedia Search
    Shen, Xiaobo
    Shen, Fumin
    Sun, Quan-Sen
    Yuan, Yun-Hao
    [J]. MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 831 - 834
  • [7] Dynamic Multi-View Hashing for Online Image Retrieval
    Xie, Liang
    Shen, Jialie
    Han, Jungong
    Zhu, Lei
    Shao, Ling
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3133 - 3139
  • [8] Discrete Multi-view Hashing for Effective Image Retrieval
    Yang, Rui
    Shi, Yuliang
    Xu, Xin-Shun
    [J]. PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 180 - 188
  • [9] Efficient fine-texture image retrieval using deep multi-view hashing
    Xiang, Jun
    Zhang, Ning
    Pan, Ruru
    Gao, Weidong
    [J]. COMPUTERS & GRAPHICS-UK, 2021, 101 : 93 - 105
  • [10] ROBUST MULTI-VIEW HASHING FOR CROSS-MODAL RETRIEVAL
    Wang, Haitao
    Chen, Hui
    Meng, Min
    Wu, JiGang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1012 - 1017