RLPS: A Reinforcement Learning-Based Framework for Personalized Search

被引：5

作者：

Yao, Jing ^{[1
]}

Dou, Zhicheng ^{[2
]}

Xu, Jun ^{[2
]}

Wen, Ji-Rong ^{[3
]}

机构：

[1] Renmin Univ China, Sch Informat, 59 Zhongguancun St, Beijing 100872, Peoples R China

[2] Renmin Univ China, Sch Artificial Intelligence, 59 Zhongguancun St, Beijing 100872, Peoples R China

[3] MOE, Beijing Key Lab Big Data Management & Anal Method, Key Lab Data Engn & Knowledge Engn, 59 Zhongguancun St, Beijing 100872, Peoples R China

来源：

ACM TRANSACTIONS ON INFORMATION SYSTEMS | 2021年 / 39卷 / 03期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Personalized search; reinforcement learning; Markov decision process (MDP);

D O I：

10.1145/3446617

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Personalized search is a promising way to improve search qualities by taking user interests into consideration. Recently, machine learning and deep learning techniques have been successfully applied to search result personalization. Most existing models simply regard the personal search history as a static set of user behaviors and learn fixed ranking strategies based on all the recorded data. Though improvements have been achieved, the essence that the search process is a sequence of interactions between the search engine and user is ignored. The user's interests may dynamically change during the search process, therefore, it would be more helpful if a personalized search model could track the whole interaction process and adjust its ranking strategy continuously. In this article, we adapt reinforcement learning to personalized search and propose a framework, referred to as RLPS. It utilizes a Markov Decision Process (MDP) to track sequential interactions between the user and search engine, and continuously update the underlying personalized ranking model with the user's real-time feedback to learn the user's dynamic interests. Within this framework, we implement two models: the listwise RLPS-L and the hierarchical RLPS-H. RLPS-L interacts with users and trains the ranking model with document lists, while RLPS-H improves model training by designing a layered structure and introducing document pairs. In addition, we also design a feedback-aware personalized ranking component to capture the user's feedback, which impacts the user interest profile for the next query. Significant improvements over existing personalized search models are observed in the experiments on the public AOL search log and a commercial log.

引用

页数：29

共 50 条

[1] Reinforcement Learning-Based Interactive Video Search
Ma, Zhixin
Wu, Jiaxin
Hou, Zhijian
Ngo, Chong-Wah
[J]. MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 549 - 555
[2] A Reinforcement Learning-Based Follow-up Framework
Astudillo, Javiera
Protopapas, Pavlos
Pichara, Karim
Becker, Ignacio
[J]. ASTRONOMICAL JOURNAL, 2023, 165 (03):
[3] Gamification Framework for Reinforcement Learning-based Neuropsychology Experiments
Chetitah, Mounsif
Mueller, Julian
Deserno, Lorenz
Waltmann, Maria
von Mammen, Sebastian
[J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2023, 2023,
[4] A Deep Reinforcement Learning-Based Framework for Content Caching
Zhong, Chen
Gursoy, M. Cenk
Velipasalar, Senem
[J]. 2018 52ND ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2018,
[5] Evolutionary Framework With Reinforcement Learning-Based Mutation Adaptation
Sallam, Karam M.
Elsayed, Saber M.
Chakrabortty, Ripon K.
Ryan, Michael J.
[J]. IEEE ACCESS, 2020, 8 : 194045 - 194071
[6] RLPer: A Reinforcement Learning Model for Personalized Search
Yao, Jing
Dou, Zhicheng
Xu, Jun
Wen, Ji-Rong
[J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2298 - 2308
[7] A personalized ranking method based on inverse reinforcement learning in search engines
Karamiyan, Fatemeh
Mahootchi, Masoud
Mohebi, Azadeh
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
[8] Deep Reinforcement Learning-Based Control Framework for Multilateral Telesurgery
Bacha, Sarah Chams
Bai, Weibang
Wang, Ziwei
Xiao, Bo
Yeatman, Eric M.
[J]. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2022, 4 (02): : 352 - 355
[9] A Reinforcement Learning-Based Framework for the Exploitation of Multiple Rats in the IoT
Sandoval, Ruben M.
Canovas-Carrasco, Sebastian
Garcia-Sanchez, Antonio-Javier
Garcia-Haro, Joan
[J]. IEEE ACCESS, 2019, 7 : 123341 - 123354
[10] A Green DDPG Reinforcement Learning-Based Framework for Content Caching
Li, Qing
Sun, Yanhua
Wang, Qianwen
Meng, Li
Zhang, Yanhua
[J]. 2020 12TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2020), 2020, : 223 - 227

← 1 2 3 4 5 →