Practical Design of Performant Recommender Systems using Large-scale Linear Programming-based Global Inference

被引:1
|
作者
Gupta, Aman [1 ]
Keerthi, S. Sathiya [1 ]
Acharya, Ayan [1 ]
Cheng, Miao [1 ]
Elizondo, Borja Ocejo [1 ]
Ramanath, Rohan [2 ]
Mazumder, Rahul [1 ]
Basu, Kinjal [3 ]
Tay, J. Kenneth [1 ]
Gupta, Rupesh [1 ]
机构
[1] LinkedIn, Sunnyvale, CA 94085 USA
[2] Chico AI, Chico, CA USA
[3] Aliveo AI, Middletown, CT USA
来源
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023 | 2023年
关键词
D O I
10.1145/3580305.3599183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several key problems in web-scale recommender systems, such as optimal matching and allocation, can be formulated as large-scale linear programs (LPs) [4, 1]. These LPs take predictions from ML models such as probabilities of click, like, etc. as inputs and optimize recommendations made to users. In recent years, there has been an explosion in the research and development of large-scale recommender systems, but effective optimization of business objectives using the output of those systems remains a challenge. Although LPs can help optimize such business objectives, and algorithms for solving LPs have existed since the 1950s [5, 8], generic LP solvers cannot handle the scale of these problems. At LinkedIn, we have developed algorithms that can solve LPs of various forms with trillions of variables in a Spark-based library called "DuaLip" [7], a novel distributed solver that solves a perturbation of the LP problem at scale via gradient-based algorithms on the smooth dual of the perturbed LP. DuaLip has been deployed in production at LinkedIn and powers several very large-scale recommender systems. DuaLip is open-sourced and extensible in terms of features and algorithms. In this first-of-its-kind tutorial, we will motivate the application of LPs to improve recommender systems, cover the theory of key LP algorithms [8, 6], and introduce DuaLip (https://github.com / linkedin/DuaLip), a highly performant Spark-based library that solves extreme-scale LPs for a large variety of recommender system problems. We will describe practical successes of large-scale LP in the industry [3, 2, 9], followed by a hands-on exercise to run DuaLip.
引用
收藏
页码:5781 / 5782
页数:2
相关论文
共 50 条
  • [1] A Quadratic Programming-Based Detector for Large-Scale MIMO Systems
    Elghariani, Ali
    Zoltowski, Michael
    2015 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2015, : 387 - 392
  • [2] A parallel, linear programming-based heuristic for large-scale set partitioning problems
    Linderoth, JT
    Lee, EKL
    Savelsbergh, MWP
    INFORMS JOURNAL ON COMPUTING, 2001, 13 (03) : 191 - 209
  • [3] Inference for Large-Scale Linear Systems With Known Coefficients
    Fang, Zheng
    Santos, Andres
    Shaikh, Azeem M.
    Torgovitsky, Alexander
    ECONOMETRICA, 2023, 91 (01) : 299 - 327
  • [4] Practical Large-Scale Linear Programming using Primal-Dual Hybrid Gradient
    Applegate, David
    Diaz, Mateo
    Hinder, Oliver
    Lu, Haihao
    Lubin, Miles
    O'Donoghue, Brendan
    Schudy, Warren
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Towards More Practical Linear Programming-Based Techniques for Algorithmic Mechanism Design
    Elbassioni, Khaled
    Mehlhorn, Kurt
    Ramezani, Fahimeh
    ALGORITHMIC GAME THEORY, SAGT 2015, 2015, 9347 : 98 - 109
  • [6] Towards More Practical Linear Programming-based Techniques for Algorithmic Mechanism Design
    Elbassioni, Khaled
    Mehlhorn, Kurt
    Ramezani, Fahimeh
    THEORY OF COMPUTING SYSTEMS, 2016, 59 (04) : 641 - 663
  • [7] Towards More Practical Linear Programming-based Techniques for Algorithmic Mechanism Design
    Khaled Elbassioni
    Kurt Mehlhorn
    Fahimeh Ramezani
    Theory of Computing Systems, 2016, 59 : 641 - 663
  • [8] Adaptive dynamic programming-based decentralized event-triggered control of large-scale nonlinear systems
    Hu, Chuanhao
    Zou, Yuanyuan
    Li, Shaoyuan
    ASIAN JOURNAL OF CONTROL, 2022, 24 (04) : 1542 - 1556
  • [9] Data-driven mixed-Integer linear programming-based optimisation for efficient failure detection in large-scale distributed systems
    Er-Rahmadi, Btissam
    Ma, Tiejun
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2022, 303 (01) : 337 - 353
  • [10] Design optimization of large-scale structures with sequential linear programming
    Lamberti, L
    Pappalettere, C
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2002, 216 (08) : 799 - 811