Long-Tail Hashing

被引:6
|
作者
Chen, Yong [1 ,2 ]
Hou, Yuqing [3 ]
Leng, Shu [4 ]
Zhang, Qing [3 ]
Lin, Zhouchen [1 ,2 ]
Zhang, Dell [5 ,6 ]
机构
[1] Peking Univ, Sch EECS, Key Lab Machine Percept MoE, Beijing, Peoples R China
[2] Pazhou Lab, Guangzhou, Peoples R China
[3] Meituan, Beijing, Peoples R China
[4] Tsinghua Univ, Dept Automat, Beijing, Peoples R China
[5] Blue Prism AI Labs, London, England
[6] Birkbeck Univ London, London, England
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
learning to hash; long-tail datasets; memory network; large-scale; multimedia retrieval; ITERATIVE QUANTIZATION; PROCRUSTEAN APPROACH; DISTRIBUTIONS; PARETO; CODES; SMOTE;
D O I
10.1145/3404835.3462888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing, which represents data items as compact binary codes, has been becoming a more and more popular technique, e.g., for large-scale image retrieval, owing to its super fast search speed as well as its extremely economical memory consumption. However, existing hashing methods all try to learn binary codes from artificially balanced datasets which are not commonly available in real-world scenarios. In this paper, we propose Long-Tail Hashing Network (LTHNet), a novel two-stage deep hashing approach that addresses the problem of learning to hash for more realistic datasets where the data labels roughly exhibit a long-tail distribution. Specifically, the first stage is to learn relaxed embeddings of the given dataset with its long-tail characteristic taken into account via an end-to-end deep neural network; the second stage is to binarize those obtained embeddings. A critical part of LTHNet is its dynamic meta-embedding module extended with a determinantal point process which can adaptively realize visual knowledge transfer between head and tail classes, and thus enrich image representations for hashing. Our experiments have shown that LTHNet achieves dramatic performance improvements over all state-of-the-art competitors on long-tail datasets, with no or little sacrifice on balanced datasets. Further analyses reveal that while to our surprise directly manipulating class weights in the loss function has little effect, the extended dynamic meta-embedding module, the usage of cross-entropy loss instead of square loss, and the relatively small batch-size for training all contribute to LTHNet's success.
引用
收藏
页码:1328 / 1338
页数:11
相关论文
共 50 条
  • [21] Long-tail liabilities and claims management in the NHS
    Fenn, P
    Hodges, R
    LAW AND UNCERTAINTY: RISKS AND LEGAL PROCESSES, 1997, : 241 - 253
  • [22] Logit Normalization for Long-Tail Object Detection
    Zhao, Liang
    Teng, Yao
    Wang, Limin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (06) : 2114 - 2134
  • [23] Complementary Product Recommendation for Long-tail Products
    Papso, Rastislav
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 1305 - 1311
  • [24] A CONSIDERATION OF APPEARANCE OF LONG-TAIL TRICHEL PULSES
    SAWA, G
    SHINOHARA, U
    IEDA, M
    JOURNAL OF APPLIED PHYSICS, 1967, 38 (13) : 5352 - +
  • [25] Capturing long-tail distributions of object subcategories
    Zhu, Xiangxin
    Anguelov, Dragomir
    Ramanan, Deva
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 915 - 922
  • [26] Long-tail behavior in locomotion of Caenorhabditis elegans
    Ohkubo, Jun
    Yoshida, Kazushi
    Iino, Yuichi
    Masuda, Naoki
    JOURNAL OF THEORETICAL BIOLOGY, 2010, 267 (02) : 213 - 222
  • [27] Editorial: LONG-TAIL LIABILITY LAW REFORM
    Freckelton, Ian
    JOURNAL OF LAW AND MEDICINE, 2007, 15 (02) : 171 - 175
  • [28] A Survey of Long-Tail Item Recommendation Methods
    Qin, Jing
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [29] Distributional Robustness Loss for Long-tail Learning
    Samuel, Dvir
    Chechik, Gal
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9475 - 9484
  • [30] Managing Long-Tail Processes Using FormSys
    Weber, Ingo
    Paik, Rye-Young
    Benatallah, Boualem
    Vorwerk, Corren
    Gong, Zifei
    Zheng, Liangliang
    Kim, Sung Wook
    SERVICE-ORIENTED COMPUTING - ICSOC 2010, PROCEEDINGS, 2010, 6470 : 702 - 703