Reinforcement Learning Method for Ad Networks Ordering in Real-Time Bidding

被引:2
|
作者
Afshar, Reza Refaei [1 ]
Zhang, Yingqian [1 ]
Firat, Murat [1 ]
Kaymak, Uzay [1 ]
机构
[1] Eindhoven Univ Technol, NL-5600 MB Eindhoven, Netherlands
关键词
Reinforcement learning; Real time bidding; Waterfall strategy;
D O I
10.1007/978-3-030-37494-5_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High turnover of online advertising and especially real time bidding makes this ad market very attractive to beneficiary stakeholders. For publishers, it is as easy as placing some slots in their webpages and sell these slots in the available online auctions. It is important to determine which online auction market to send their slots to. Based on the traditional Waterfall Strategy, publishers have a fixed ordering of preferred online auction markets, and sell the ad slots by trying these markets sequentially. This fixed-order strategy replies heavily on the experience of publishers, and often it does not provide highest revenue. In this paper, we propose a method for dynamically deciding on the ordering of auction markets for each available ad slot. This method is based on reinforcement learning (RL) and learns the state-action through a tabular method. Since the state-action space is sparse, a prediction model is used to solve this sparsity. We analyze a real-time bidding dataset, and then show that the proposed RL method on this dataset leads to higher revenues. In addition, a sensitivity analysis is performed on the parameters of the method.
引用
收藏
页码:16 / 36
页数:21
相关论文
共 50 条
  • [41] Real-Time Virtual Machine Scheduling in Industry IoT Network: A Reinforcement Learning Method
    Ma, Xiaojin
    Xu, Huahu
    Gao, Honghao
    Bian, Minjie
    Hussain, Walayat
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (02) : 2129 - 2139
  • [42] A Real-Time and Optimal Hypersonic Entry Guidance Method Using Inverse Reinforcement Learning
    Su, Linfeng
    Wang, Jinbo
    Chen, Hongbo
    Pezzella, Giuseppe
    [J]. AEROSPACE, 2023, 10 (11)
  • [43] Real-time Stochastic Dispatch Method for Incremental Distribution Network Based on Reinforcement Learning
    Li J.
    Yu T.
    Pan Z.
    [J]. Dianwang Jishu/Power System Technology, 2020, 44 (09): : 3321 - 3330
  • [44] Real-time planning and collision avoidance control method based on deep reinforcement learning
    Xu, Xinli
    Cai, Peng
    Cao, Yunlong
    Chu, Zhenzhong
    Zhu, Wenbo
    Zhang, Weidong
    [J]. OCEAN ENGINEERING, 2023, 281
  • [45] Learning to Calibrate Battery Models in Real-Time with Deep Reinforcement Learning
    Unagar, Ajaykumar
    Tian, Yuan
    Chao, Manuel Arias
    Fink, Olga
    [J]. ENERGIES, 2021, 14 (05)
  • [46] Integration of Adaptive Control and Reinforcement Learning for Real-Time Control and Learning
    Annaswamy, Anuradha M.
    Guha, Anubhav
    Cui, Yingnan
    Tang, Sunbochen
    Fisher, Peter A.
    Gaudio, Joseph E.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7740 - 7755
  • [47] Real-time outage management in active distribution networks using reinforcement learning over graphs
    Jacob, Roshni Anna
    Paul, Steve
    Chowdhury, Souma
    Gel, Yulia R.
    Zhang, Jie
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [49] Real-time learning capability of neural networks
    Huang, Guang-Bin
    Zhu, Qin-Yu
    Siew, Chee-Kheong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (04): : 863 - 878
  • [50] A Framework for Mobile Ad hoc Networks in Real-Time Maude
    Liu, Si
    Olveczky, Peter Csaba
    Meseguer, Jose
    [J]. REWRITING LOGIC AND ITS APPLICATIONS, WRLA 2014, 2014, 8663 : 162 - 177