Efficient end-to-end learning for quantizable representations

被引：0

作者：

Jeong, Yeonwoo ^{[1
]}

Song, Hyun Oh ^{[1
]}

机构：

[1] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80 | 2018年 / 80卷

基金：

新加坡国家研究基金会;

关键词：

IMPLEMENTATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Embedding representation learning via neural networks is at the core foundation of modern similarity based search. While much effort has been put in developing algorithms for learning binary hamming code representations for search efficiency, this still requires a linear scan of the entire dataset per each query and trades off the search accuracy through binarization. To this end, we consider the problem of directly learning a quantizable embedding representation and the sparse binary hash code end-to-end which can be used to construct an efficient hash table not only providing significant search reduction in the number of data but also achieving the state of the art search accuracy outperforming previous state of the art deep metric learning methods. We also show that finding the optimal sparse binary hash code in a mini-batch can be computed exactly in polynomial time by solving a minimum cost flow problem. Our results on Cifar-100 and on ImageNet datasets show the state of the art search accuracy in precision@k and NMI metrics while providing up to 98x and 478x search speedup respectively over exhaustive linear search. The source code is available at https://github.com/maestrojeong/Deep-Hash-Table-ICML 18.

引用

页数：10

共 50 条

[1] Putting An End to End-to-End: Gradient-Isolated Learning of Representations
Lowe, Sindy
O'Connor, Peter
Veeling, Bastiaan S.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[2] End-to-End Learning of Deep Visual Representations for Image Retrieval
Albert Gordo
Jon Almazán
Jerome Revaud
Diane Larlus
International Journal of Computer Vision, 2017, 124 : 237 - 254
[3] End-to-End Learning of Deep Visual Representations for Image Retrieval
Gordo, Albert
Almazan, Jon
Revaud, Jerome
Larlus, Diane
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (02) : 237 - 254
[4] End-to-End Learning of Representations for Asynchronous Event-Based Data
Gehrig, Daniel
Loquercio, Antonio
Derpanis, Konstantinos G.
Scaramuzza, Davide
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5632 - 5642
[5] An End-to-End Learning Architecture for Efficient Image Encoding and Deep Learning
Chamain, Lahiru D.
Qi, Siyu
Ding, Zhi
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 691 - 695
[6] An efficient end-to-end deep learning architecture for activity classification
Amel Ben Mahjoub
Mohamed Atri
Analog Integrated Circuits and Signal Processing, 2019, 99 : 23 - 32
[7] An efficient end-to-end deep learning architecture for activity classification
Ben Mahjoub, Amel
Atri, Mohamed
ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2019, 99 (01) : 23 - 32
[8] LEARNING HIERARCHICAL REPRESENTATIONS FOR EXPRESSIVE SPEAKING STYLE IN END-TO-END SPEECH SYNTHESIS
An, Xiaochun
Wang, Yuxuan
Yang, Shan
Ma, Zejun
Xie, Lei
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 184 - 191
[9] End-to-end learning of representations for instance-level document image retrieval
Liu, Li
Lu, Yue
Suen, Ching Y.
APPLIED SOFT COMPUTING, 2023, 136
[10] Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
Agustsson, Eirikur
Mentzer, Fabian
Tschannen, Michael
Cavigelli, Lukas
Timofte, Radu
Benini, Luca
Van Gool, Luc
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30

← 1 2 3 4 5 →