Kraken: Memory-Efficient Continual Learning for Large-Scale Real-Time Recommendations

被引:16
|
作者
Xie, Minhui [1 ]
Ren, Kai [2 ]
Lu, Youyou [1 ]
Yang, Guangxu [2 ]
Xu, Qingxing [2 ]
Wu, Bihai [2 ]
Lin, Jiazhen [1 ]
Ao, Hongbo [2 ]
Xu, Wanhong [2 ]
Shu, Jiwu [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Kuaishou Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Systems for Machine Learning; Continual Learning; Recommendation System;
D O I
10.1109/SC41405.2020.00025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern recommendation systems in industry often use deep learning (DL) models that achieve better model accuracy with more data and model parameters. However, current open-source DL frameworks, such as TensorFiow and PyTorch, show relatively low scalability on training recommendation models with terabytes of parameters. To efficiently learn large-scale recommendation models from data streams that generate hundreds of terabytes training data daily, we introduce a continual learning system called Kraken. Kraken contains a special parameter server implementation that dynamically adapts to the rapidly changing set of sparse features for the continual training and serving of recommendation models. Kraken provides a sparsity-aware training system that uses different learning optimizers for dense and sparse parameters to reduce memory overhead. Extensive experiments using real-world datasels confirm the effectiveness and scalability of Kraken. Kraken can benefit the accuracy of recommendation tasks with the same memory resources, or trisect the memory usage while keeping model performance.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Memory-efficient real-time map building using octree of planes and points
    Jo, Yonghyun
    Jang, Hanyoung
    Kim, Yeon-Ho
    Cho, Joon-Kee
    Moradi, Hadi
    Han, JungHyun
    ADVANCED ROBOTICS, 2013, 27 (04) : 301 - 308
  • [22] Large-scale machine learning and evaluation platform for real-time traffic surveillance
    Eichel, Justin A.
    Mishra, Akshaya
    Miller, Nicholas
    Jankovic, Nicholas
    Thomas, Mohan A.
    Abbott, Tyler
    Swanson, Douglas
    Keller, Joel
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (05)
  • [23] Spatial Ensemble Distillation Learning for Large-Scale Real-Time Crash Prediction
    Islam, Md Rakibul
    Abdel-Aty, Mohamed
    Wang, Dongdong
    Islam, Zubayer
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16506 - 16521
  • [24] Gemel: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge
    Padmanabhan, Arthi
    Agarwal, Neil
    Iyer, Anand
    Ananthanarayanan, Ganesh
    Shu, Yuanchao
    Karianakis, Nikolaos
    Xu, Guoqing Harry
    Netravali, Ravi
    PROCEEDINGS OF THE 20TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, NSDI 2023, 2023, : 973 - 994
  • [25] Parallel, scalable, memory-efficient backtracking for combinatorial modeling of large-scale biological systems
    Park, Byung-Hoon
    Schmidt, Matthew
    Thomas, Kevin
    Karpinets, Tatiana
    Samatova, Nagiza F.
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 536 - +
  • [26] Fast and memory-efficient optimization for large-scale data-driven predictive control
    Schmitz, Philipp
    Schaller, Manuel
    Voigt, Matthias
    Worthmann, Karl
    IFAC PAPERSONLINE, 2024, 58 (21): : 220 - 225
  • [27] A Large-scale System for Real-time Glucose Monitoring
    Vu, Long
    Pavuluri, Venkata N.
    Chang, Yuan-chi
    Turaga, Deepak S.
    Zhong, Alex
    Agrawal, Pratik
    Singh, Amit
    Jiang, Boyi
    Chirutha, Krishna
    2018 48TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS (DSN-W), 2018, : 34 - 37
  • [28] Real-time Large-scale Deformation of Gaussian Splatting
    Gao, Lin
    Yang, Jie
    Zhang, Bo-tao
    Sun, Jia-mu
    Yuan, Yu-jie
    Fu, Hongbo
    Lai, Yu-kun
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (06):
  • [29] Real-Time Rendering of Large-Scale Ocean Environments
    HUANG Jing-jia
    LI Sheng
    LAI Shun-nan
    WANG Guo-ping
    Computer Aided Drafting,Design and Manufacturing, 2015, (02) : 47 - 53
  • [30] Real-time recognition of large-scale driving patterns
    Engström, J
    Victor, T
    2001 IEEE INTELLIGENT TRANSPORTATION SYSTEMS - PROCEEDINGS, 2001, : 1018 - 1023