Long-Tail Hashing

被引：6

作者：

Chen, Yong ^{[1
,2
]}

Hou, Yuqing ^{[3
]}

Leng, Shu ^{[4
]}

Zhang, Qing ^{[3
]}

Lin, Zhouchen ^{[1
,2
]}

Zhang, Dell ^{[5
,6
]}

机构：

[1] Peking Univ, Sch EECS, Key Lab Machine Percept MoE, Beijing, Peoples R China

[2] Pazhou Lab, Guangzhou, Peoples R China

[3] Meituan, Beijing, Peoples R China

[4] Tsinghua Univ, Dept Automat, Beijing, Peoples R China

[5] Blue Prism AI Labs, London, England

[6] Birkbeck Univ London, London, England

来源：

SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 2021年

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

learning to hash; long-tail datasets; memory network; large-scale; multimedia retrieval; ITERATIVE QUANTIZATION; PROCRUSTEAN APPROACH; DISTRIBUTIONS; PARETO; CODES; SMOTE;

D O I：

10.1145/3404835.3462888

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hashing, which represents data items as compact binary codes, has been becoming a more and more popular technique, e.g., for large-scale image retrieval, owing to its super fast search speed as well as its extremely economical memory consumption. However, existing hashing methods all try to learn binary codes from artificially balanced datasets which are not commonly available in real-world scenarios. In this paper, we propose Long-Tail Hashing Network (LTHNet), a novel two-stage deep hashing approach that addresses the problem of learning to hash for more realistic datasets where the data labels roughly exhibit a long-tail distribution. Specifically, the first stage is to learn relaxed embeddings of the given dataset with its long-tail characteristic taken into account via an end-to-end deep neural network; the second stage is to binarize those obtained embeddings. A critical part of LTHNet is its dynamic meta-embedding module extended with a determinantal point process which can adaptively realize visual knowledge transfer between head and tail classes, and thus enrich image representations for hashing. Our experiments have shown that LTHNet achieves dramatic performance improvements over all state-of-the-art competitors on long-tail datasets, with no or little sacrifice on balanced datasets. Further analyses reveal that while to our surprise directly manipulating class weights in the loss function has little effect, the extended dynamic meta-embedding module, the usage of cross-entropy loss instead of square loss, and the relatively small batch-size for training all contribute to LTHNet's success.

引用

页码：1328 / 1338

页数：11

共 50 条

[21] Long-tail liabilities and claims management in the NHS
Fenn, P
Hodges, R
LAW AND UNCERTAINTY: RISKS AND LEGAL PROCESSES, 1997, : 241 - 253
[22] Logit Normalization for Long-Tail Object Detection
Zhao, Liang
Teng, Yao
Wang, Limin
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (06) : 2114 - 2134
[23] Complementary Product Recommendation for Long-tail Products
Papso, Rastislav
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 1305 - 1311
[24] A CONSIDERATION OF APPEARANCE OF LONG-TAIL TRICHEL PULSES
SAWA, G
SHINOHARA, U
IEDA, M
JOURNAL OF APPLIED PHYSICS, 1967, 38 (13) : 5352 - +
[25] Capturing long-tail distributions of object subcategories
Zhu, Xiangxin
Anguelov, Dragomir
Ramanan, Deva
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 915 - 922
[26] Long-tail behavior in locomotion of Caenorhabditis elegans
Ohkubo, Jun
Yoshida, Kazushi
Iino, Yuichi
Masuda, Naoki
JOURNAL OF THEORETICAL BIOLOGY, 2010, 267 (02) : 213 - 222
[27] Editorial: LONG-TAIL LIABILITY LAW REFORM
Freckelton, Ian
JOURNAL OF LAW AND MEDICINE, 2007, 15 (02) : 171 - 175
[28] A Survey of Long-Tail Item Recommendation Methods
Qin, Jing
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
[29] Distributional Robustness Loss for Long-tail Learning
Samuel, Dvir
Chechik, Gal
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9475 - 9484
[30] Managing Long-Tail Processes Using FormSys
Weber, Ingo
Paik, Rye-Young
Benatallah, Boualem
Vorwerk, Corren
Gong, Zifei
Zheng, Liangliang
Kim, Sung Wook
SERVICE-ORIENTED COMPUTING - ICSOC 2010, PROCEEDINGS, 2010, 6470 : 702 - 703

← 1 2 3 4 5 →