AutoDPQ: Automated Differentiable Product Quantization for Embedding Compression

被引:0
|
作者
Gan, Xin [1 ]
Wang, Yuhao [1 ]
Zhao, Xiangyu [1 ]
Wang, Wanyu [1 ]
Wang, Yiqi [2 ]
Liu, Zitao [3 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] Natl Univ Def Technol, Changsha, Peoples R China
[3] Jinan Univ, Guangdong Inst Smart Educ, Guangzhou, Peoples R China
关键词
Recommender Systems; AutoML; Compact Embedding;
D O I
10.1145/3539618.3591953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep recommender systems typically involve numerous feature fields for users and items, with a large number of low-frequency features. These low-frequency features would reduce the prediction accuracy with large storage space due to their vast quantity and inadequate training. Some pioneering studies have explored embedding compression techniques to address this issue of the trade-off between storage space and model predictability. However, these methods have difficulty compacting the embedding of low-frequency features in various feature fields due to the high demand for human experience and computing resources during hyper-parameter searching. In this paper, we propose the AutoDPQ framework, which automatically compacts low-frequency feature embeddings for each feature field to an adaptive magnitude. Experimental results indicate that AutoDPQ can significantly reduce the parameter space while improving recommendation accuracy. Moreover, AutoDPQ is compatible with various deep CTR models by improving their performance significantly with high efficiency.
引用
收藏
页码:1833 / 1837
页数:5
相关论文
共 50 条
  • [41] Data, compression by geometric quantization
    Khumbah, NA
    Wegman, EJ
    RECENT ADVANCES AND TRENDS IN NONPARAMETRIC STATISTICS, 2003, : 35 - 46
  • [42] Transform Quantization for CNN Compression
    Young, Sean, I
    Zhe, Wang
    Taubman, David
    Girod, Bernd
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5700 - 5714
  • [43] Quantization of the Sobolev Space of Half-Differentiable Functions, II
    Sergeev, A. G.
    RUSSIAN JOURNAL OF MATHEMATICAL PHYSICS, 2019, 26 (03) : 401 - 405
  • [44] A Differentiable Entropy Model for Learned Image Compression
    Presta, Alberto
    Fiandrotti, Attilio
    Tartaglione, Enzo
    Grangetto, Marco
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 328 - 339
  • [45] Quantization of the Sobolev Space of Half-Differentiable Functions, II
    A. G. Sergeev
    Russian Journal of Mathematical Physics, 2019, 26 : 401 - 405
  • [46] Reversible data embedding for vector quantization indices
    Chang, Chin-Chen
    Kieu, The Duc
    2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2007, : 481 - 484
  • [47] SUPERVISED HASHING WITH JOINTLY LEARNING EMBEDDING AND QUANTIZATION
    Zhu, Hao
    Wang, Feng
    Xiang, Xiang
    Tran, Trac D.
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3715 - 3719
  • [48] DOUBLE EMBEDDING IN THE QUANTIZATION INDEX MODULATION FRAMEWORK
    Sarkar, A.
    Manjunath, B. S.
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3653 - 3656
  • [49] Joint security & robustness enhancement for quantization embedding
    Wu, M
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 483 - 486
  • [50] Embedding gray images using multiple quantization
    Wang, Guoxi
    Ma, Lihong
    Lu, Hanqing
    Guo, Weiqiang
    Yu, Yingling
    WMSCI 2006: 10TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS, 2006, : 197 - +