Efficient end-to-end learning for quantizable representations

被引:0
|
作者
Jeong, Yeonwoo [1 ]
Song, Hyun Oh [1 ]
机构
[1] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
IMPLEMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Embedding representation learning via neural networks is at the core foundation of modern similarity based search. While much effort has been put in developing algorithms for learning binary hamming code representations for search efficiency, this still requires a linear scan of the entire dataset per each query and trades off the search accuracy through binarization. To this end, we consider the problem of directly learning a quantizable embedding representation and the sparse binary hash code end-to-end which can be used to construct an efficient hash table not only providing significant search reduction in the number of data but also achieving the state of the art search accuracy outperforming previous state of the art deep metric learning methods. We also show that finding the optimal sparse binary hash code in a mini-batch can be computed exactly in polynomial time by solving a minimum cost flow problem. Our results on Cifar-100 and on ImageNet datasets show the state of the art search accuracy in precision@k and NMI metrics while providing up to 98x and 478x search speedup respectively over exhaustive linear search. The source code is available at https://github.com/maestrojeong/Deep-Hash-Table-ICML 18.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Putting An End to End-to-End: Gradient-Isolated Learning of Representations
    Lowe, Sindy
    O'Connor, Peter
    Veeling, Bastiaan S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] End-to-End Learning of Deep Visual Representations for Image Retrieval
    Albert Gordo
    Jon Almazán
    Jerome Revaud
    Diane Larlus
    International Journal of Computer Vision, 2017, 124 : 237 - 254
  • [3] End-to-End Learning of Deep Visual Representations for Image Retrieval
    Gordo, Albert
    Almazan, Jon
    Revaud, Jerome
    Larlus, Diane
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (02) : 237 - 254
  • [4] End-to-End Learning of Representations for Asynchronous Event-Based Data
    Gehrig, Daniel
    Loquercio, Antonio
    Derpanis, Konstantinos G.
    Scaramuzza, Davide
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5632 - 5642
  • [5] An End-to-End Learning Architecture for Efficient Image Encoding and Deep Learning
    Chamain, Lahiru D.
    Qi, Siyu
    Ding, Zhi
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 691 - 695
  • [6] An efficient end-to-end deep learning architecture for activity classification
    Amel Ben Mahjoub
    Mohamed Atri
    Analog Integrated Circuits and Signal Processing, 2019, 99 : 23 - 32
  • [7] An efficient end-to-end deep learning architecture for activity classification
    Ben Mahjoub, Amel
    Atri, Mohamed
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2019, 99 (01) : 23 - 32
  • [8] LEARNING HIERARCHICAL REPRESENTATIONS FOR EXPRESSIVE SPEAKING STYLE IN END-TO-END SPEECH SYNTHESIS
    An, Xiaochun
    Wang, Yuxuan
    Yang, Shan
    Ma, Zejun
    Xie, Lei
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 184 - 191
  • [9] End-to-end learning of representations for instance-level document image retrieval
    Liu, Li
    Lu, Yue
    Suen, Ching Y.
    APPLIED SOFT COMPUTING, 2023, 136
  • [10] Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
    Agustsson, Eirikur
    Mentzer, Fabian
    Tschannen, Michael
    Cavigelli, Lukas
    Timofte, Radu
    Benini, Luca
    Van Gool, Luc
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30