Learning to rank with click-through features in a reinforcement learning framework

被引:6
|
作者
Keyhanipour, Amir Hosein [1 ]
Moshiri, Behzad [2 ]
Piroozmand, Maryam [3 ]
Oroumchian, Farhad [4 ]
Moeini, Ali [5 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Coll Engn, Ctr Excellence, Tehran, Iran
[2] Univ Tehran, Sch ECE, Ctr Excellence, Control & Intelligent Proc, Tehran, Iran
[3] Amirkabir Univ Technol, Dept Comp Engn, Tehran, Iran
[4] Univ Wollongong, Knowledge Village, Dubai, U Arab Emirates
[5] Univ Tehran, Tehran, Iran
关键词
Click-through data; Learning to rank; Reinforcement learning;
D O I
10.1108/IJWIS-12-2015-0046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - Learning to rank algorithms inherently faces many challenges. The most important challenges could be listed as high-dimensionality of the training data, the dynamic nature of Web information resources and lack of click-through data. High dimensionality of the training data affects effectiveness and efficiency of learning algorithms. Besides, most of learning to rank benchmark datasets do not include click-through data as a very rich source of information about the search behavior of users while dealing with the ranked lists of search results. To deal with these limitations, this paper aims to introduce a novel learning to rank algorithm by using a set of complex click-through features in a reinforcement learning (RL) model. These features are calculated from the existing click-through information in the data set or even from data sets without any explicit click-through information. Design/methodology/approach - The proposed ranking algorithm (QRC-Rank) applies RL techniques on a set of calculated click-through features. QRC-Rank is as a two-steps process. In the first step, Transformation phase, a compact benchmark data set is created which contains a set of click-through features. These feature are calculated from the original click-through information available in the data set and constitute a compact representation of click-through information. To find most effective click-through feature, a number of scenarios are investigated. The second phase is Model-Generation, in which a RL model is built to rank the documents. This model is created by applying temporal difference learning methods such as Q-Learning and SARSA. Findings - The proposed learning to rank method, QRC-rank, is evaluated on WCL2R and LETOR4.0 data sets. Experimental results demonstrate that QRC-Rank outperforms the state-of-the-art learning to rank methods such as SVMRank, RankBoost, ListNet and AdaRank based on the precision and normalized discount cumulative gain evaluation criteria. The use of the click-through features calculated from the training data set is a major contributor to the performance of the system. Originality/value - In this paper, we have demonstrated the viability of the proposed features that provide a compact representation for the click through data in a learning to rank application. These compact click-through features are calculated from the original features of the learning to rank benchmark data set. In addition, a Markov Decision Process model is proposed for the learning to rank problem using RL, including the sets of states, actions, rewarding strategy and the transition function.
引用
收藏
页码:448 / 476
页数:29
相关论文
共 50 条
  • [1] An ensemble learning framework for click-through rate prediction based on a reinforcement learning algorithm with parameterized actions
    Liu, Mengjuan
    Zheng, Daiwei
    Li, Jiaxing
    Hu, Zhengning
    Liu, Liu
    Ding, Yi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 283
  • [2] CF-Rank: Learning to rank by classifier fusion on click-through data
    Keyhanipour, Amir Hosein
    Moshiri, Behzad
    Rahgozar, Maseud
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8597 - 8608
  • [3] Learning to rank: new approach with the layered multi-population genetic programming on click-through features
    Keyhanipour, Amir Hosein
    Moshiri, Behzad
    Oroumchian, Farhad
    Rahgozar, Maseud
    Badie, Kambiz
    [J]. GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2016, 17 (03) : 203 - 230
  • [4] Learning to rank: new approach with the layered multi-population genetic programming on click-through features
    Amir Hosein Keyhanipour
    Behzad Moshiri
    Farhad Oroumchian
    Maseud Rahgozar
    Kambiz Badie
    [J]. Genetic Programming and Evolvable Machines, 2016, 17 : 203 - 230
  • [5] A Novel Use of Reinforcement Learning for Elevated Click-Through Rate in Online Advertising
    Haider, Umair
    Yildiz, Beytullah
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 64 - 70
  • [6] RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction
    Zhao, Pu
    Luo, Chuan
    Zhou, Cheng
    Qiao, Bo
    He, Jiale
    Zhang, Liangjie
    Lin, Qingwei
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2268 - 2272
  • [7] Click fraud resistant methods for learning click-through rates
    Immorlica, N
    Jain, K
    Mahdian, M
    Talwar, K
    [J]. INTERNET AND NETWORK ECONOMICS, PROCEEDINGS, 2005, 3828 : 34 - 45
  • [8] Deep Learning for Click-Through Rate Estimation
    Zhang, Weinan
    Qin, Jiarui
    Guo, Wei
    Tang, Ruiming
    He, Xiuqiang
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4695 - 4703
  • [9] A joint learning model for click-through prediction in display advertising
    Liu, Mengjuan
    Cai, Shijia
    Lai, Zhi
    Qiu, Lizhou
    Hu, Zhengning
    Ding, Yi
    [J]. NEUROCOMPUTING, 2021, 445 : 206 - 219
  • [10] Learning to Retrieve User Behaviors for Click-through Rate Estimation
    Qin, Jiarui
    Zhang, Weinan
    Su, Rong
    Liu, Zhirong
    Liu, Weiwen
    Zhao, Guangpeng
    Li, Hao
    Tang, Ruiming
    He, Xiuqiang
    Yu, Yong
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (04)