AutoDPQ: Automated Differentiable Product Quantization for Embedding Compression

被引:0
|
作者
Gan, Xin [1 ]
Wang, Yuhao [1 ]
Zhao, Xiangyu [1 ]
Wang, Wanyu [1 ]
Wang, Yiqi [2 ]
Liu, Zitao [3 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] Natl Univ Def Technol, Changsha, Peoples R China
[3] Jinan Univ, Guangdong Inst Smart Educ, Guangzhou, Peoples R China
关键词
Recommender Systems; AutoML; Compact Embedding;
D O I
10.1145/3539618.3591953
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep recommender systems typically involve numerous feature fields for users and items, with a large number of low-frequency features. These low-frequency features would reduce the prediction accuracy with large storage space due to their vast quantity and inadequate training. Some pioneering studies have explored embedding compression techniques to address this issue of the trade-off between storage space and model predictability. However, these methods have difficulty compacting the embedding of low-frequency features in various feature fields due to the high demand for human experience and computing resources during hyper-parameter searching. In this paper, we propose the AutoDPQ framework, which automatically compacts low-frequency feature embeddings for each feature field to an adaptive magnitude. Experimental results indicate that AutoDPQ can significantly reduce the parameter space while improving recommendation accuracy. Moreover, AutoDPQ is compatible with various deep CTR models by improving their performance significantly with high efficiency.
引用
收藏
页码:1833 / 1837
页数:5
相关论文
共 50 条
  • [1] Differentiable Product Quantization for End-to-End Embedding Compression
    Chen, Ting
    Li, Lala
    Sun, Yizhou
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [2] Differentiable Product Quantization for End-to-End Embedding Compression
    Chen, Ting
    Li, Lala
    Sun, Yizhou
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [3] Semisupervised Network Embedding With Differentiable Deep Quantization
    He, Tao
    Gao, Lianli
    Song, Jingkuan
    Li, Yuan-Fang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4791 - 4802
  • [4] Improved embedding product quantization
    The-Anh Pham
    Machine Vision and Applications, 2019, 30 : 447 - 459
  • [5] Improved embedding product quantization
    The-Anh Pham
    MACHINE VISION AND APPLICATIONS, 2019, 30 (03) : 447 - 459
  • [6] Embedding Compression with Isotropic Iterative Quantization
    Liao, Siyu
    Chen, Jie
    Wang, Yanzhi
    Qiu, Qinru
    Yuan, Bo
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8336 - 8343
  • [7] Differentiable Product Quantization for Memory Efficient Camera Relocalization
    Laskar, Zakaria
    Melekhov, Iaroslav
    Benbihi, Assia
    Wang, Shuzhe
    Kannala, Juho
    COMPUTER VISION - ECCV 2024, PT LXXXV, 2025, 15143 : 470 - 489
  • [8] Embedding hierarchical clustering in product quantization for feature indexing
    The-Anh Pham
    Nang-Toan Do
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (08) : 9991 - 10012
  • [9] Embedding hierarchical clustering in product quantization for feature indexing
    The-Anh Pham
    Nang-Toan Do
    Multimedia Tools and Applications, 2019, 78 : 9991 - 10012
  • [10] JsonGrinder.jl: automated differentiable neural architecture for embedding arbitrary JSON data
    Mandlík, Šimon
    Račinský, Matěj
    Lisý, Viliam
    Pevný, Tomáš
    Journal of Machine Learning Research, 2022, 23