Deep hashing image retrieval based on hybrid neural network and optimized metric learning

被引:3
|
作者
Xiao, Xingming [1 ]
Cao, Shu [2 ]
Wang, Liejun [1 ]
Cheng, Shuli [1 ]
Yuan, Erdong [1 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Urumqi 830046, Peoples R China
[2] State Grid Xinjiang Elect Power Co, Informat & Commun Co, Urumqi 830063, Peoples R China
基金
美国国家科学基金会;
关键词
Image retrieval; Deep hashing; Vision transformer; New strengthened external attention; New loss;
D O I
10.1016/j.knosys.2023.111336
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While transformers have indeed improved image retrieval accuracy in computer vision, challenges persist, including insufficient and imbalanced feature extraction and the inability to create compact binary codes. This study introduces a novel approach for image retrieval called Vision Transformer with Deep Hashing (VTDH), combining a hybrid neural network and optimized metric learning. Our work offers significant contributions, summarized as follows: We introduce an innovative Strengthened External Attention (NEA) module capable of simultaneous multi-scale feature focus and comprehensive global context assimilation. This enriches the model's comprehension of both overarching structure and semantics. Additionally, we propose a fresh balanced loss function to tackle the issue of imbalanced positive and negative samples within labels. Notably, this function employs sample labels as input, utilizing the mean value of all sample labels to quantify the frequency gap between positive and negative samples. This approach, combined with a customized balance weight, effectively addresses the challenge of label imbalance. Concurrently, we enhance the quantization loss function, intensifying its penalty for instances where the model's binary code output surpasses +/- 1. This reinforcement results in a more robust and stable hash code output. The proposed method is assessed on prominent datasets, including CIFAR-10, NUS-WIDE, and ImageNet. Experimental outcomes reveal superior retrieval accuracy compared to current state-of-the-art techniques. Notably, the VTDH model achieves an exceptional mean average precision (mAP) of 97.3% on the CIFAR-10 dataset.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A Deep Neural Network Based Hashing for Efficient Image Retrieval
    Zhu, Siying
    Kang, Bong-Nam
    Kim, Daijin
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 2483 - 2488
  • [2] IMAGE RETRIEVAL BASED ON DEEP CONVOLUTIONAL NEURAL NETWORKS AND BINARY HASHING LEARNING
    Peng Tian-qiang
    Li Fang
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1742 - 1746
  • [3] Image retrieval method based on metric learning for convolutional neural network
    Wang, Jieyuan
    Qian, Ying
    Ye, Qingqing
    Wang, Biao
    2017 2ND INTERNATIONAL SEMINAR ON ADVANCES IN MATERIALS SCIENCE AND ENGINEERING, 2017, 231
  • [4] Deep Hashing Network With Hybrid Attention and Adaptive Weighting for Image Retrieval
    Pei, Yingjiao
    Wang, Zhongyuan
    Li, Na
    Chen, Heling
    Huang, Baojin
    Tu, Weiping
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4961 - 4973
  • [5] Ranking-Based Deep Hashing Network for Image Retrieval
    Zhang, Zhisheng
    Qu, Huaijing
    Xie, Ming
    Xu, Jia
    Wang, Jiwei
    Wei, Yanan
    IEEE ACCESS, 2022, 10 : 125334 - 125352
  • [6] DEEP LEARNING BASED SUPERVISED HASHING FOR EFFICIENT IMAGE RETRIEVAL
    Viet-Anh Nguyen
    Do, Minh N.
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [7] Deep Residual Hashing Network for Image Retrieval
    Jimenez-Lepe, Edwin
    Mendez-Vazquez, Andres
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 780 - 781
  • [8] Binary Neural Network Hashing for Image Retrieval
    Zhang, Wanqian
    Wu, Dayan
    Zhou, Yu
    Li, Bo
    Wang, Weiping
    Meng, Dan
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1318 - 1327
  • [9] Metric-Learning-Based Deep Hashing Network for Content-Based Retrieval of Remote Sensing Images
    Roy, Subhankar
    Sangineto, Enver
    Demir, Begum
    Sebe, Nicu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (02) : 226 - 230
  • [10] Deep Learning Image Classification Based on Neural Network Optimized SVM
    Chen, Na
    Xiao, Aiping
    Zheng, Gang
    2018 7TH INTERNATIONAL CONFERENCE ON ADVANCED MATERIALS AND COMPUTER SCIENCE (ICAMCS 2018), 2019, : 329 - 333