RLPS: A Reinforcement Learning-Based Framework for Personalized Search

被引:5
|
作者
Yao, Jing [1 ]
Dou, Zhicheng [2 ]
Xu, Jun [2 ]
Wen, Ji-Rong [3 ]
机构
[1] Renmin Univ China, Sch Informat, 59 Zhongguancun St, Beijing 100872, Peoples R China
[2] Renmin Univ China, Sch Artificial Intelligence, 59 Zhongguancun St, Beijing 100872, Peoples R China
[3] MOE, Beijing Key Lab Big Data Management & Anal Method, Key Lab Data Engn & Knowledge Engn, 59 Zhongguancun St, Beijing 100872, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Personalized search; reinforcement learning; Markov decision process (MDP);
D O I
10.1145/3446617
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Personalized search is a promising way to improve search qualities by taking user interests into consideration. Recently, machine learning and deep learning techniques have been successfully applied to search result personalization. Most existing models simply regard the personal search history as a static set of user behaviors and learn fixed ranking strategies based on all the recorded data. Though improvements have been achieved, the essence that the search process is a sequence of interactions between the search engine and user is ignored. The user's interests may dynamically change during the search process, therefore, it would be more helpful if a personalized search model could track the whole interaction process and adjust its ranking strategy continuously. In this article, we adapt reinforcement learning to personalized search and propose a framework, referred to as RLPS. It utilizes a Markov Decision Process (MDP) to track sequential interactions between the user and search engine, and continuously update the underlying personalized ranking model with the user's real-time feedback to learn the user's dynamic interests. Within this framework, we implement two models: the listwise RLPS-L and the hierarchical RLPS-H. RLPS-L interacts with users and trains the ranking model with document lists, while RLPS-H improves model training by designing a layered structure and introducing document pairs. In addition, we also design a feedback-aware personalized ranking component to capture the user's feedback, which impacts the user interest profile for the next query. Significant improvements over existing personalized search models are observed in the experiments on the public AOL search log and a commercial log.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Reinforcement Learning-Based Interactive Video Search
    Ma, Zhixin
    Wu, Jiaxin
    Hou, Zhijian
    Ngo, Chong-Wah
    [J]. MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 549 - 555
  • [2] A Reinforcement Learning-Based Follow-up Framework
    Astudillo, Javiera
    Protopapas, Pavlos
    Pichara, Karim
    Becker, Ignacio
    [J]. ASTRONOMICAL JOURNAL, 2023, 165 (03):
  • [3] Gamification Framework for Reinforcement Learning-based Neuropsychology Experiments
    Chetitah, Mounsif
    Mueller, Julian
    Deserno, Lorenz
    Waltmann, Maria
    von Mammen, Sebastian
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES, FDG 2023, 2023,
  • [4] A Deep Reinforcement Learning-Based Framework for Content Caching
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    [J]. 2018 52ND ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2018,
  • [5] Evolutionary Framework With Reinforcement Learning-Based Mutation Adaptation
    Sallam, Karam M.
    Elsayed, Saber M.
    Chakrabortty, Ripon K.
    Ryan, Michael J.
    [J]. IEEE ACCESS, 2020, 8 : 194045 - 194071
  • [6] RLPer: A Reinforcement Learning Model for Personalized Search
    Yao, Jing
    Dou, Zhicheng
    Xu, Jun
    Wen, Ji-Rong
    [J]. WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 2298 - 2308
  • [7] A personalized ranking method based on inverse reinforcement learning in search engines
    Karamiyan, Fatemeh
    Mahootchi, Masoud
    Mohebi, Azadeh
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [8] Deep Reinforcement Learning-Based Control Framework for Multilateral Telesurgery
    Bacha, Sarah Chams
    Bai, Weibang
    Wang, Ziwei
    Xiao, Bo
    Yeatman, Eric M.
    [J]. IEEE TRANSACTIONS ON MEDICAL ROBOTICS AND BIONICS, 2022, 4 (02): : 352 - 355
  • [9] A Reinforcement Learning-Based Framework for the Exploitation of Multiple Rats in the IoT
    Sandoval, Ruben M.
    Canovas-Carrasco, Sebastian
    Garcia-Sanchez, Antonio-Javier
    Garcia-Haro, Joan
    [J]. IEEE ACCESS, 2019, 7 : 123341 - 123354
  • [10] A Green DDPG Reinforcement Learning-Based Framework for Content Caching
    Li, Qing
    Sun, Yanhua
    Wang, Qianwen
    Meng, Li
    Zhang, Yanhua
    [J]. 2020 12TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2020), 2020, : 223 - 227