RLPer: A Reinforcement Learning Model for Personalized Search

被引:22
|
作者
Yao, Jing [2 ]
Dou, Zhicheng [1 ]
Xu, Jun [1 ]
Wen, Ji-Rong [3 ,4 ]
机构
[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China
[2] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[3] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China
[4] Key Lab Data Engn & Knowledge Engn, MOE, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Personalized Search; Reinforcement Learning; MDP;
D O I
10.1145/3366423.3380294
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Personalized search improves generic ranking models by taking user interests into consideration and returning more accurate search results to individual users. In recent years, machine learning and deep learning techniques have been successfully applied in personalized search. Most existing personalization models simply regard the search history as a static set of user behaviours and learn fixed ranking strategies based on the recorded data. Though improvements have been observed, it is obvious that these methods ignore the dynamic nature of the search process: search is a sequence of interactions between the search engine and the user. During the search process, the user interests may dynamically change. It would be more helpful if a personalized search model could track the whole interaction process and update its ranking strategy continuously. In this paper, we propose a reinforcement learning based personalization model, referred to as RLPer, to track the sequential interactions between the users and search engine with a hierarchical Markov Decision Process (MDP). In RLPer, the search engine interacts with the user to update the underlying ranking model continuously with real-time feedback. And we design a feedback-aware personalized ranking component to catch the user's feedback which has impacts on the user interest profile for the next query. Experimental results on the publicly available AOL search log verify that our proposed model can significantly outperform state-of-the-art personalized search models.
引用
收藏
页码:2298 / 2308
页数:11
相关论文
共 50 条
  • [1] RLPS: A Reinforcement Learning-Based Framework for Personalized Search
    Yao, Jing
    Dou, Zhicheng
    Xu, Jun
    Wen, Ji-Rong
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2021, 39 (03)
  • [2] A personalized ranking method based on inverse reinforcement learning in search engines
    Karamiyan, Fatemeh
    Mahootchi, Masoud
    Mohebi, Azadeh
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [3] Personalized and automatic model repairing using reinforcement learning
    Barriga, Angela
    Rutle, Adrian
    Heldal, Rogardt
    [J]. 2019 ACM/IEEE 22ND INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION (MODELS-C 2019), 2019, : 175 - 181
  • [4] Federated Model Search via Reinforcement Learning
    Yao, Dixi
    Wang, Lingdong
    Xu, Jiayu
    Xiang, Liyao
    Shao, Shuo
    Chen, Yingqi
    Tong, Yanjun
    [J]. 2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, : 830 - 840
  • [5] Learning a Hierarchical Embedding Model for Personalized Product Search
    Ai, Qingyao
    Zhang, Yongfeng
    Bi, Keping
    Chen, Xu
    Croft, W. Bruce
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 645 - 654
  • [6] Reinforcement Learning for Personalized Dialogue Management
    den Hengst, Floris
    Hoogendoorn, Mark
    van Harmelen, Frank
    Bosman, Joost
    [J]. 2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 59 - 67
  • [7] Personalized Reinforcement Learning with a Budget of Policies
    Ivanov, Dmitry
    Ben-Porat, Omer
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12735 - 12743
  • [8] Networked Personalized Federated Learning Using Reinforcement Learning
    Gauthier, Francois
    Gogineni, Vinay Chakravarthi
    Werner, Stefan
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4397 - 4402
  • [9] A reinforcement learning approach to personalized learning recommendation systems
    Tang, Xueying
    Chen, Yunxiao
    Li, Xiaoou
    Liu, Jingchen
    Ying, Zhiliang
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2019, 72 (01): : 108 - 135
  • [10] On the Search for Feedback in Reinforcement Learning
    Wang, Ran
    Parunandi, Karthikeya S.
    Sharma, Aayushman
    Goyal, Raman
    Chakravorty, Suman
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1560 - 1567