Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

被引:3
|
作者
He, Xu [1 ]
An, Bo [1 ]
Li, Yanghua [2 ]
Chen, Haikai [2 ]
Guo, Qingyu [1 ]
Li, Xin [2 ]
Wang, Zhirong [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Alibaba Grp, Hangzhou, Peoples R China
关键词
Contextual bandit; Combinatorial bandit; Position bias;
D O I
10.1145/3383313.3412234
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online recommendation services recommend multiple commodities to users. Nowadays, a considerable proportion of users visit e-commerce platforms by mobile devices. Due to the limited screen size of mobile devices, positions of items have a significant influence on clicks: 1) Higher positions lead to more clicks for one commodity. 2) The `pseudo-exposure' issue: Only a few recommended items are shown at first glance and users need to slide the screen to browse other items. Therefore, some recommended items ranked behind are not viewed by users and it is not proper to treat this kind of items as negative samples. While many works model the online recommendation as contextual bandit problems, they rarely take the influence of positions into consideration and thus the estimation of the reward function may be biased. In this paper, we aim at addressing these two issues to improve the performance of online mobile recommendation. Our contributions are four-fold. First, since we concern the reward of a set of recommended items, we model the online recommendation as a contextual combinatorial bandit problem and define the reward of a recommended set. Second, we propose a novel contextual combinatorial bandit method called UBM-LinUCB to address two issues related to positions by adopting the User Browsing Model (UBM), a click model for web search. Third, we provide a formal regret analysis and prove that our algorithm achieves sublinear regret independent of the number of items. Finally, we evaluate our algorithm on two real-world datasets by a novel unbiased estimator. An online experiment is also implemented in Taobao, one of the most popular e-commerce platforms in the world. Results on two CTR metrics show that our algorithm outperforms the other contextual bandit algorithms.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [1] Contextual and Sequential User Embeddings for Large-Scale Music Recommendation
    Hansen, Casper
    Hansen, Christian
    Maystre, Lucas
    Mehrotra, Rishabh
    Brost, Brian
    Tomasi, Federico
    Lalmas, Mounia
    [J]. RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2020, : 53 - 62
  • [2] Online Learning in Large-Scale Contextual Recommender Systems
    Song, Linqi
    Tekin, Cem
    van der Schaar, Mihaela
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2016, 9 (03) : 433 - 445
  • [3] LsRec: Large-scale social recommendation with online update
    Zhou, Wang
    Zhou, Yongluan
    Li, Jianping
    Memon, Muhammad Hammad
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
  • [4] MobileRec: A Large-Scale Dataset for Mobile Apps Recommendation
    Maqbool, M. H.
    Farooq, Umar
    Mosharrof, Adib
    Siddique, A. B.
    Foroosh, Hassan
    [J]. PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3007 - 3016
  • [5] Contextual Bandits With Hidden Features to Online Recommendation via Sparse Interactions
    Yang, Shangdong
    Zhang, Chenyu
    Gao, Yang
    Wang, Hao
    [J]. IEEE INTELLIGENT SYSTEMS, 2020, 35 (05) : 62 - 71
  • [6] Using Contextual Bandits with Behavioral Constraints for Constrained Online Movie Recommendation
    Balakrishnan, Avinash
    Bouneffouf, Djallel
    Mattei, Nicholas
    Rossi, Francesca
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5802 - 5804
  • [7] Building Discriminative User Profiles for Large-scale Content Recommendation
    Zhong, Erheng
    Liu, Nathan
    Shi, Yue
    Rajan, Suju
    [J]. KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 2277 - 2286
  • [8] A Recommendation System of Personalized Resource Reliability for Online Teaching System under Large-scale User Access
    Chen, Wenqing
    Yang, Ting
    [J]. MOBILE NETWORKS & APPLICATIONS, 2023, 28 (03): : 983 - 994
  • [9] A Recommendation System of Personalized Resource Reliability for Online Teaching System under Large-scale User Access
    Wenqing Chen
    Ting Yang
    [J]. Mobile Networks and Applications, 2023, 28 : 983 - 994
  • [10] Who to follow recommendation in large-scale online development communities
    Schall, Daniel
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2014, 56 (12) : 1543 - 1555