Learning to rank with click-through features in a reinforcement learning framework

被引：6

作者：

Keyhanipour, Amir Hosein ^{[1
]}

Moshiri, Behzad ^{[2
]}

Piroozmand, Maryam ^{[3
]}

Oroumchian, Farhad ^{[4
]}

Moeini, Ali ^{[5
]}

机构：

[1] Univ Tehran, Sch Elect & Comp Engn, Coll Engn, Ctr Excellence, Tehran, Iran

[2] Univ Tehran, Sch ECE, Ctr Excellence, Control & Intelligent Proc, Tehran, Iran

[3] Amirkabir Univ Technol, Dept Comp Engn, Tehran, Iran

[4] Univ Wollongong, Knowledge Village, Dubai, U Arab Emirates

[5] Univ Tehran, Tehran, Iran

来源：

INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS | 2016年 / 12卷 / 04期

关键词：

Click-through data; Learning to rank; Reinforcement learning;

D O I：

10.1108/IJWIS-12-2015-0046

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Purpose - Learning to rank algorithms inherently faces many challenges. The most important challenges could be listed as high-dimensionality of the training data, the dynamic nature of Web information resources and lack of click-through data. High dimensionality of the training data affects effectiveness and efficiency of learning algorithms. Besides, most of learning to rank benchmark datasets do not include click-through data as a very rich source of information about the search behavior of users while dealing with the ranked lists of search results. To deal with these limitations, this paper aims to introduce a novel learning to rank algorithm by using a set of complex click-through features in a reinforcement learning (RL) model. These features are calculated from the existing click-through information in the data set or even from data sets without any explicit click-through information. Design/methodology/approach - The proposed ranking algorithm (QRC-Rank) applies RL techniques on a set of calculated click-through features. QRC-Rank is as a two-steps process. In the first step, Transformation phase, a compact benchmark data set is created which contains a set of click-through features. These feature are calculated from the original click-through information available in the data set and constitute a compact representation of click-through information. To find most effective click-through feature, a number of scenarios are investigated. The second phase is Model-Generation, in which a RL model is built to rank the documents. This model is created by applying temporal difference learning methods such as Q-Learning and SARSA. Findings - The proposed learning to rank method, QRC-rank, is evaluated on WCL2R and LETOR4.0 data sets. Experimental results demonstrate that QRC-Rank outperforms the state-of-the-art learning to rank methods such as SVMRank, RankBoost, ListNet and AdaRank based on the precision and normalized discount cumulative gain evaluation criteria. The use of the click-through features calculated from the training data set is a major contributor to the performance of the system. Originality/value - In this paper, we have demonstrated the viability of the proposed features that provide a compact representation for the click through data in a learning to rank application. These compact click-through features are calculated from the original features of the learning to rank benchmark data set. In addition, a Markov Decision Process model is proposed for the learning to rank problem using RL, including the sets of states, actions, rewarding strategy and the transition function.

引用

页码：448 / 476

页数：29

共 50 条

[1] An ensemble learning framework for click-through rate prediction based on a reinforcement learning algorithm with parameterized actions
Liu, Mengjuan
Zheng, Daiwei
Li, Jiaxing
Hu, Zhengning
Liu, Liu
Ding, Yi
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 283
[2] CF-Rank: Learning to rank by classifier fusion on click-through data
Keyhanipour, Amir Hosein
Moshiri, Behzad
Rahgozar, Maseud
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (22) : 8597 - 8608
[3] Learning to rank: new approach with the layered multi-population genetic programming on click-through features
Keyhanipour, Amir Hosein
Moshiri, Behzad
Oroumchian, Farhad
Rahgozar, Maseud
Badie, Kambiz
[J]. GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2016, 17 (03) : 203 - 230
[4] Learning to rank: new approach with the layered multi-population genetic programming on click-through features
Amir Hosein Keyhanipour
Behzad Moshiri
Farhad Oroumchian
Maseud Rahgozar
Kambiz Badie
[J]. Genetic Programming and Evolvable Machines, 2016, 17 : 203 - 230
[5] A Novel Use of Reinforcement Learning for Elevated Click-Through Rate in Online Advertising
Haider, Umair
Yildiz, Beytullah
[J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 64 - 70
[6] RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction
Zhao, Pu
Luo, Chuan
Zhou, Cheng
Qiao, Bo
He, Jiale
Zhang, Liangjie
Lin, Qingwei
[J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 2268 - 2272
[7] Click fraud resistant methods for learning click-through rates
Immorlica, N
Jain, K
Mahdian, M
Talwar, K
[J]. INTERNET AND NETWORK ECONOMICS, PROCEEDINGS, 2005, 3828 : 34 - 45
[8] Deep Learning for Click-Through Rate Estimation
Zhang, Weinan
Qin, Jiarui
Guo, Wei
Tang, Ruiming
He, Xiuqiang
[J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4695 - 4703
[9] A joint learning model for click-through prediction in display advertising
Liu, Mengjuan
Cai, Shijia
Lai, Zhi
Qiu, Lizhou
Hu, Zhengning
Ding, Yi
[J]. NEUROCOMPUTING, 2021, 445 : 206 - 219
[10] Learning to Retrieve User Behaviors for Click-through Rate Estimation
Qin, Jiarui
Zhang, Weinan
Su, Rong
Liu, Zhirong
Liu, Weiwen
Zhao, Guangpeng
Li, Hao
Tang, Ruiming
He, Xiuqiang
Yu, Yong
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (04)

← 1 2 3 4 5 →