Weakly-Supervised Image Hashing through Masked Visual-Semantic Graph-based Reasoning

被引:23
|
作者
Jin, Lu [1 ]
Li, Zechao [1 ]
Pan, Yonghua [1 ]
Tang, Jinhui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Noisy tags; graph attention network; image hashing; NEURAL-NETWORKS; RETRIEVAL;
D O I
10.1145/3394171.3414022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the popularization of social websites, many methods have been proposed to explore the noisy tags for weakly-supervised image hashing.The main challenge lies in learning appropriate and sufficient information from those noisy tags. To address this issue, this work proposes a novel Masked visual-semantic Graph-based Reasoning Network, termed as MGRN, to learn joint visual-semantic representations for image hashing. Specifically, for each image, MGRN constructs a relation graph to capture the interactions among its associated tags and performs reasoning with Graph Attention Networks (GAT). MGRN randomly masks out one tag and then make GAT to predict this masked tag. This forces the GAT model to capture the dependence between the image and its associated tags, which can well address the problem of noisy tags. Thus it can capture key tags and visual structures from images to learn well-aligned visual-semantic representations. Finally, the auto-encoders is leveraged to learn hash codes that can preserve the local structure of the joint space. Meanwhile, the joint visual-semantic representations are reconstructed from those hash codes by using a decoder. Experimental results on two widely-used benchmark datasets demonstrate the superiority of the proposed method for image retrieval compared with several state-of-the-art methods.
引用
收藏
页码:916 / 924
页数:9
相关论文
共 50 条
  • [1] Weakly-supervised Semantic Guided Hashing for Social Image Retrieval
    Zechao Li
    Jinhui Tang
    Liyan Zhang
    Jian Yang
    [J]. International Journal of Computer Vision, 2020, 128 : 2265 - 2278
  • [2] Weakly-supervised Semantic Guided Hashing for Social Image Retrieval
    Li, Zechao
    Tang, Jinhui
    Zhang, Liyan
    Yang, Jian
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (8-9) : 2265 - 2278
  • [3] Semantic Graph Construction for Weakly-Supervised Image Parsing
    Xie, Wenxuan
    Peng, Yuxin
    Xiao, Jianguo
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2853 - 2859
  • [4] Graph-based supervised discrete image hashing
    Guan, Jian
    Li, Yifan
    Sun, Jianguo
    Wang, Xuan
    Zhao, Hainan
    Zhang, Jiajia
    Liu, Zechao
    Qi, Shuhan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 : 675 - 687
  • [5] Tag-based Weakly-supervised Hashing for Image Retrieval
    Guan, Ziyu
    Xie, Fei
    Zhao, Wanqing
    Wang, Xiaopeng
    Chen, Long
    Zhao, Wei
    Peng, Jinye
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3776 - 3782
  • [6] Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition
    Hu, Yang
    Wen, Guihua
    Chapman, Adriane
    Yang, Pei
    Luo, Mingnan
    Xu, Yingxue
    Dai, Dan
    Hall, Wendy
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2473 - 2487
  • [7] Visual-Semantic Graph Reasoning for Pedestrian Attribute Recognition
    Li, Qiaozhe
    Zhao, Xin
    He, Ran
    Huang, Kaiqi
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8634 - 8641
  • [8] Weakly-Supervised Deep Image Hashing based on Cross-Modal Transformer
    Yang, Ching-Ching
    Chu, Wei-Ta
    Dubey, Shiv Ram
    [J]. 2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,
  • [9] Learning Visual Words for Weakly-Supervised Semantic Segmentation
    Ru, Lixiang
    Du, Bo
    Wu, Chen
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 982 - 988
  • [10] Weakly-Supervised Image Semantic Segmentation Based on Superpixel Region Merging
    Jiang, Quanchun
    Tawose, Olamide Timothy
    Pei, Songwen
    Chen, Xiaodong
    Jiang, Linhua
    Wang, Jiayao
    Zhao, Dongfang
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2019, 3 (02) : 1 - 20