AutoDPQ: Automated Differentiable Product Quantization for Embedding Compression

被引：0

作者：

Gan, Xin ^{[1
]}

Wang, Yuhao ^{[1
]}

Zhao, Xiangyu ^{[1
]}

Wang, Wanyu ^{[1
]}

Wang, Yiqi ^{[2
]}

Liu, Zitao ^{[3
]}

机构：

[1] City Univ Hong Kong, Hong Kong, Peoples R China

[2] Natl Univ Def Technol, Changsha, Peoples R China

[3] Jinan Univ, Guangdong Inst Smart Educ, Guangzhou, Peoples R China

来源：

PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年

关键词：

Recommender Systems; AutoML; Compact Embedding;

D O I：

10.1145/3539618.3591953

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep recommender systems typically involve numerous feature fields for users and items, with a large number of low-frequency features. These low-frequency features would reduce the prediction accuracy with large storage space due to their vast quantity and inadequate training. Some pioneering studies have explored embedding compression techniques to address this issue of the trade-off between storage space and model predictability. However, these methods have difficulty compacting the embedding of low-frequency features in various feature fields due to the high demand for human experience and computing resources during hyper-parameter searching. In this paper, we propose the AutoDPQ framework, which automatically compacts low-frequency feature embeddings for each feature field to an adaptive magnitude. Experimental results indicate that AutoDPQ can significantly reduce the parameter space while improving recommendation accuracy. Moreover, AutoDPQ is compatible with various deep CTR models by improving their performance significantly with high efficiency.

引用

页码：1833 / 1837

页数：5

共 50 条

[41] Data, compression by geometric quantization
Khumbah, NA
Wegman, EJ
RECENT ADVANCES AND TRENDS IN NONPARAMETRIC STATISTICS, 2003, : 35 - 46
[42] Transform Quantization for CNN Compression
Young, Sean, I
Zhe, Wang
Taubman, David
Girod, Bernd
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5700 - 5714
[43] Quantization of the Sobolev Space of Half-Differentiable Functions, II
Sergeev, A. G.
RUSSIAN JOURNAL OF MATHEMATICAL PHYSICS, 2019, 26 (03) : 401 - 405
[44] A Differentiable Entropy Model for Learned Image Compression
Presta, Alberto
Fiandrotti, Attilio
Tartaglione, Enzo
Grangetto, Marco
IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 328 - 339
[45] Quantization of the Sobolev Space of Half-Differentiable Functions, II
A. G. Sergeev
Russian Journal of Mathematical Physics, 2019, 26 : 401 - 405
[46] Reversible data embedding for vector quantization indices
Chang, Chin-Chen
Kieu, The Duc
2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2007, : 481 - 484
[47] SUPERVISED HASHING WITH JOINTLY LEARNING EMBEDDING AND QUANTIZATION
Zhu, Hao
Wang, Feng
Xiang, Xiang
Tran, Trac D.
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3715 - 3719
[48] DOUBLE EMBEDDING IN THE QUANTIZATION INDEX MODULATION FRAMEWORK
Sarkar, A.
Manjunath, B. S.
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3653 - 3656
[49] Joint security & robustness enhancement for quantization embedding
Wu, M
2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 483 - 486
[50] Embedding gray images using multiple quantization
Wang, Guoxi
Ma, Lihong
Lu, Hanqing
Guo, Weiqiang
Yu, Yingling
WMSCI 2006: 10TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL V, PROCEEDINGS, 2006, : 197 - +

← 1 2 3 4 5 →