Reinforcement Learning Method for Ad Networks Ordering in Real-Time Bidding

被引:2
|
作者
Afshar, Reza Refaei [1 ]
Zhang, Yingqian [1 ]
Firat, Murat [1 ]
Kaymak, Uzay [1 ]
机构
[1] Eindhoven Univ Technol, NL-5600 MB Eindhoven, Netherlands
关键词
Reinforcement learning; Real time bidding; Waterfall strategy;
D O I
10.1007/978-3-030-37494-5_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High turnover of online advertising and especially real time bidding makes this ad market very attractive to beneficiary stakeholders. For publishers, it is as easy as placing some slots in their webpages and sell these slots in the available online auctions. It is important to determine which online auction market to send their slots to. Based on the traditional Waterfall Strategy, publishers have a fixed ordering of preferred online auction markets, and sell the ad slots by trying these markets sequentially. This fixed-order strategy replies heavily on the experience of publishers, and often it does not provide highest revenue. In this paper, we propose a method for dynamically deciding on the ordering of auction markets for each available ad slot. This method is based on reinforcement learning (RL) and learns the state-action through a tabular method. Since the state-action space is sparse, a prediction model is used to solve this sparsity. We analyze a real-time bidding dataset, and then show that the proposed RL method on this dataset leads to higher revenues. In addition, a sensitivity analysis is performed on the parameters of the method.
引用
收藏
页码:16 / 36
页数:21
相关论文
共 50 条
  • [1] Real-Time Bidding by Reinforcement Learning in Display Advertising
    Cai, Han
    Ren, Kan
    Zhang, Weinan
    Malialis, Kleanthis
    Wang, Jun
    Yu, Yong
    Guo, Defeng
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 661 - 670
  • [2] Deep Reinforcement Learning for Sponsored Search Real-time Bidding
    Zhao, Jun
    Qiu, Guang
    Guan, Ziyu
    Zhao, Wei
    He, Xiaofei
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1021 - 1030
  • [3] Reinforcement Learning with Sequential Information Clustering in Real-Time Bidding
    Lu, Junwei
    Yang, Chaoqi
    Gao, Xiaofeng
    Wang, Liubin
    Li, Changcheng
    Chen, Guihai
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1633 - 1641
  • [4] Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
    Jin, Junqi
    Song, Chengru
    Li, Han
    Gai, Kun
    Wang, Jun
    Zhang, Weinan
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 2193 - 2201
  • [5] An extensible approach for real-time bidding with model-free reinforcement learning
    Cheng, Yin
    Zou, Luobao
    Zhuang, Zhiwei
    Liu, Jingwei
    Xu, Bin
    Zhang, Weidong
    [J]. NEUROCOMPUTING, 2019, 360 : 97 - 106
  • [6] Evolving population method for real-time reinforcement learning
    Kim, Man-Je
    Kim, Jun Suk
    Ahn, Chang Wook
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 229
  • [7] An Intelligent Bidding Strategy Based on Model-Free Reinforcement Learning for Real-Time Bidding in Display Advertising
    Liu, Mengjuan
    Li, Jiaxing
    Yue, Wei
    Qiu, Lizhou
    Liu, Jinyu
    Qin, Zhiguang
    [J]. 2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 240 - 245
  • [8] Real-Time Bidding with Soft Actor-Critic Reinforcement Learning in Display Advertising
    Yakovleva, Dania
    Popov, Artem
    Filchenkov, Andrey
    [J]. PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 373 - 382
  • [9] Deep Reinforcement Learning Based Real-Time Renewable Energy Bidding with Battery Control
    Jeong, Jaeik
    Kim, Seung Wan
    Kim, Hongseok
    [J]. IEEE Transactions on Energy Markets, Policy and Regulation, 2023, 1 (02): : 85 - 96
  • [10] Real-time routing algorithm for mobile ad hoc networks using reinforcement learning and heuristic algorithms
    Ghaffari, Ali
    [J]. WIRELESS NETWORKS, 2017, 23 (03) : 703 - 714