TelecomNet: Tag-Based Weakly-Supervised Modally Cooperative Hashing Network for Image Retrieval

被引:17
|
作者
Zhao, Wei [1 ]
Xu, Cai [1 ]
Guan, Ziyu [1 ]
Wu, Xunlian [1 ]
Zhao, Wanqing [2 ]
Miao, Qiguang [3 ]
He, Xiaofei [4 ]
Wang, Quan [3 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, State Key Lab Integrated Serv Networks, Xian 710071, Shaanxi, Peoples R China
[2] Northwestern Univ, Sch Informat & Technol, Xian 710127, Shaanxi, Peoples R China
[3] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci, State Key Lab CAD&CG, Hangzhou 310058, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Training; Tagging; Image retrieval; Correlation; Training data; Binary codes; multimedia retrieval; multi-modal learning; weakly-supervised learning; COMPLETION;
D O I
10.1109/TPAMI.2021.3114089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are concerned with using user-tagged images to learn proper hashing functions for image retrieval. The benefits are two-fold: (1) we could obtain abundant training data for deep hashing models; (2) tagging data possesses richer semantic information which could help better characterize similarity relationships between images. However, tagging data suffers from noises, vagueness and incompleteness. Different from previous unsupervised or supervised hashing learning, we propose a novel weakly-supervised deep hashing framework which consists of two stages: weakly-supervised pre-training and supervised fine-tuning. The second stage is as usual. In the first stage, we propose two formulations Tag-basEd weakLy-supErvised Modally COoperative hashing Network (TelecomNet) and Generalized TelecomNet (GTelecomNet). Rather than performing supervision on tags, TelecomNet first learns an observed semantic embedding vector for each image from attached tags and then uses it to guide hashing learning. GTelecomNet introduces a novel semantic network to exploit more precise semantic information. By carefully designing the optimization problem, they can well leverage tagging information and image content for hashing learning. The framework is general and does not depend on specific deep hashing methods. Empirical results on real world datasets show that they significantly increase the performance of state-of-the-art deep hashing methods.
引用
收藏
页码:7940 / 7954
页数:15
相关论文
共 50 条
  • [1] Tag-based Weakly-supervised Hashing for Image Retrieval
    Guan, Ziyu
    Xie, Fei
    Zhao, Wanqing
    Wang, Xiaopeng
    Chen, Long
    Zhao, Wei
    Peng, Jinye
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3776 - 3782
  • [2] Weakly-supervised Semantic Guided Hashing for Social Image Retrieval
    Zechao Li
    Jinhui Tang
    Liyan Zhang
    Jian Yang
    [J]. International Journal of Computer Vision, 2020, 128 : 2265 - 2278
  • [3] Weakly-supervised Semantic Guided Hashing for Social Image Retrieval
    Li, Zechao
    Tang, Jinhui
    Zhang, Liyan
    Yang, Jian
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 2265 - 2278
  • [4] Deep Enhanced Weakly-Supervised Hashing With Iterative Tag Refinement
    Wang, Min
    Zhou, Wengang
    Tian, Qi
    Li, Houqiang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2779 - 2790
  • [5] Efficient weakly-supervised discrete hashing for large-scale social image retrieval
    Cui, Hui
    Zhu, Lei
    Cui, Chaoran
    Nie, Xiushan
    Zhang, Huaxiang
    [J]. PATTERN RECOGNITION LETTERS, 2020, 130 : 174 - 181
  • [6] Weakly-Supervised Deep Image Hashing based on Cross-Modal Transformer
    Yang, Ching-Ching
    Chu, Wei-Ta
    Dubey, Shiv Ram
    [J]. 2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [7] Weakly Supervised Deep Image Hashing through Tag Embeddings
    Gattupalli, Vijetha
    Zhuo, Yaoxin
    Li, Baoxin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10367 - 10376
  • [8] WEAKLY-SUPERVISED MOMENT RETRIEVAL NETWORK FOR VIDEO CORPUS MOMENT RETRIEVAL
    Yoon, Sunjae
    Kim, Dahyun
    Hong, Ji Woo
    Kim, Junyeong
    Kim, Kookhoi
    Yoo, Chang D.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 534 - 538
  • [9] Weakly Supervised Multimodal Hashing for Scalable Social Image Retrieval
    Tang, Jinhui
    Li, Zechao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2730 - 2741
  • [10] WEAKLY SUPERVISED LOCALITY SENSITIVE HASHING FOR DUPLICATE IMAGE RETRIEVAL
    Cao, Yudong
    Zhang, Honggang
    Guo, Jun
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,