DEEP PRODUCT QUANTIZATION MODULE FOR EFFICIENT IMAGE RETRIEVAL

被引:0
|
作者
Liu, Meihan [1 ,2 ,3 ]
Dai, Yongxing [1 ,2 ]
Bai, Yan [1 ,2 ]
Duan, Ling-Yu [1 ,2 ]
机构
[1] Peking Univ, Inst Digital Media, Beijing, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Peking Univ, SECE Shenzhen Grad Sch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Product Quantization; Hashing; Deep Learning;
D O I
10.1109/icassp40776.2020.9054175
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Product Quantization (PQ) is one of the most popular Approximate Nearest Neighbor (ANN) methods for large-scale image retrieval, bringing better performance than hashing based methods. In recent years, several works extend the hard quantization to soft quantization with specially designed deep neural architectures. We propose a simple but effective deep Product Quantization Module (PQM) to jointly learn discriminative codebook and precise hard assignment in an end-to-end manner. In this work, we use the straight-through estimator to make it feasible to directly optimize the discrete binary representations in deep neural networks with stochastic gradient descent. Different from previous deep vector quantization methods, PQM is a plug-and-play module which can be adaptive to various base networks in the scenarios of image search or compression. Besides, we propose a reconstruction loss to minimize the domain gap between the original embedding features and codebook. Experimental results show that PQM outperforms state-of-the-art deep supervised hashing and quantization methods on several image retrieval benchmarks.
引用
收藏
页码:4382 / 4386
页数:5
相关论文
共 50 条
  • [1] Beyond Product Quantization: Deep Progressive Quantization for Image Retrieval
    Gao, Lianli
    Zhu, Xiaosu
    Song, Jingkuan
    Zhao, Zhou
    Shen, Heng Tao
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 723 - 729
  • [2] Deep Quantization Network for Efficient Image Retrieval
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Zhu, Han
    Wen, Qingfu
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3457 - 3463
  • [3] Deep Product Quantization for Large-Scale Image Retrieval
    Zhai, Qi
    Jiang, Mingyan
    2019 4TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2019), 2019, : 198 - 202
  • [4] Adversarial Attack on Deep Product Quantization Network for Image Retrieval
    Feng, Yan
    Chen, Bin
    Dai, Tao
    Xia, Shu-Tao
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10786 - 10793
  • [5] Deep Visual-Semantic Quantization for Efficient Image Retrieval
    Cao, Yue
    Long, Mingsheng
    Wang, Jianmin
    Liu, Shichen
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 916 - 925
  • [6] Adversarial Examples Generation for Deep Product Quantization Networks on Image Retrieval
    Chen, Bin
    Feng, Yan
    Dai, Tao
    Bai, Jiawang
    Jiang, Yong
    Xia, Shu-Tao
    Wang, Xuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1388 - 1404
  • [7] Self-supervised Product Quantization for Deep Unsupervised Image Retrieval
    Jang, Young Kyun
    Cho, Nam Ik
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12065 - 12074
  • [8] Entropy-Optimized Deep Weighted Product Quantization for Image Retrieval
    Gu, Lingchen
    Liu, Ju
    Liu, Xiaoxi
    Wan, Wenbo
    Sun, Jiande
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1162 - 1174
  • [9] Information Gain Product Quantization for Image Retrieval
    Chen, Jingjia
    Song, Yonghong
    Zhang, Yuanlin
    IMAGE AND GRAPHICS (ICIG 2017), PT II, 2017, 10667 : 252 - 261
  • [10] Scalable Image Retrieval by Sparse Product Quantization
    Ning, Qingqun
    Zhu, Jianke
    Zhong, Zhiyuan
    Hoi, Steven C. H.
    Chen, Chun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) : 586 - 597