RLPer: A Reinforcement Learning Model for Personalized Search

被引：22

作者：

Yao, Jing ^{[2
]}

Dou, Zhicheng ^{[1
]}

Xu, Jun ^{[1
]}

Wen, Ji-Rong ^{[3
,4
]}

机构：

[1] Renmin Univ China, Gaoling Sch Artificial Intelligence, Beijing, Peoples R China

[2] Renmin Univ China, Sch Informat, Beijing, Peoples R China

[3] Beijing Key Lab Big Data Management & Anal Method, Beijing, Peoples R China

[4] Key Lab Data Engn & Knowledge Engn, MOE, Beijing, Peoples R China

来源：

WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020) | 2020年

基金：

中国国家自然科学基金;

关键词：

Personalized Search; Reinforcement Learning; MDP;

D O I：

10.1145/3366423.3380294

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Personalized search improves generic ranking models by taking user interests into consideration and returning more accurate search results to individual users. In recent years, machine learning and deep learning techniques have been successfully applied in personalized search. Most existing personalization models simply regard the search history as a static set of user behaviours and learn fixed ranking strategies based on the recorded data. Though improvements have been observed, it is obvious that these methods ignore the dynamic nature of the search process: search is a sequence of interactions between the search engine and the user. During the search process, the user interests may dynamically change. It would be more helpful if a personalized search model could track the whole interaction process and update its ranking strategy continuously. In this paper, we propose a reinforcement learning based personalization model, referred to as RLPer, to track the sequential interactions between the users and search engine with a hierarchical Markov Decision Process (MDP). In RLPer, the search engine interacts with the user to update the underlying ranking model continuously with real-time feedback. And we design a feedback-aware personalized ranking component to catch the user's feedback which has impacts on the user interest profile for the next query. Experimental results on the publicly available AOL search log verify that our proposed model can significantly outperform state-of-the-art personalized search models.

引用

页码：2298 / 2308

页数：11

共 50 条

[1] RLPS: A Reinforcement Learning-Based Framework for Personalized Search
Yao, Jing
Dou, Zhicheng
Xu, Jun
Wen, Ji-Rong
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2021, 39 (03)
[2] A personalized ranking method based on inverse reinforcement learning in search engines
Karamiyan, Fatemeh
Mahootchi, Masoud
Mohebi, Azadeh
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
[3] Personalized and automatic model repairing using reinforcement learning
Barriga, Angela
Rutle, Adrian
Heldal, Rogardt
[J]. 2019 ACM/IEEE 22ND INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION (MODELS-C 2019), 2019, : 175 - 181
[4] Federated Model Search via Reinforcement Learning
Yao, Dixi
Wang, Lingdong
Xu, Jiayu
Xiang, Liyao
Shao, Shuo
Chen, Yingqi
Tong, Yanjun
[J]. 2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, : 830 - 840
[5] Learning a Hierarchical Embedding Model for Personalized Product Search
Ai, Qingyao
Zhang, Yongfeng
Bi, Keping
Chen, Xu
Croft, W. Bruce
[J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 645 - 654
[6] Reinforcement Learning for Personalized Dialogue Management
den Hengst, Floris
Hoogendoorn, Mark
van Harmelen, Frank
Bosman, Joost
[J]. 2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 59 - 67
[7] Personalized Reinforcement Learning with a Budget of Policies
Ivanov, Dmitry
Ben-Porat, Omer
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12735 - 12743
[8] Networked Personalized Federated Learning Using Reinforcement Learning
Gauthier, Francois
Gogineni, Vinay Chakravarthi
Werner, Stefan
[J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4397 - 4402
[9] A reinforcement learning approach to personalized learning recommendation systems
Tang, Xueying
Chen, Yunxiao
Li, Xiaoou
Liu, Jingchen
Ying, Zhiliang
[J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2019, 72 (01): : 108 - 135
[10] On the Search for Feedback in Reinforcement Learning
Wang, Ran
Parunandi, Karthikeya S.
Sharma, Aayushman
Goyal, Raman
Chakravorty, Suman
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1560 - 1567

← 1 2 3 4 5 →