Reinforcement Learning Method for Ad Networks Ordering in Real-Time Bidding

被引：2

作者：

Afshar, Reza Refaei ^{[1
]}

Zhang, Yingqian ^{[1
]}

Firat, Murat ^{[1
]}

Kaymak, Uzay ^{[1
]}

机构：

[1] Eindhoven Univ Technol, NL-5600 MB Eindhoven, Netherlands

来源：

AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2019 | 2019年 / 11978卷

关键词：

Reinforcement learning; Real time bidding; Waterfall strategy;

D O I：

10.1007/978-3-030-37494-5_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

High turnover of online advertising and especially real time bidding makes this ad market very attractive to beneficiary stakeholders. For publishers, it is as easy as placing some slots in their webpages and sell these slots in the available online auctions. It is important to determine which online auction market to send their slots to. Based on the traditional Waterfall Strategy, publishers have a fixed ordering of preferred online auction markets, and sell the ad slots by trying these markets sequentially. This fixed-order strategy replies heavily on the experience of publishers, and often it does not provide highest revenue. In this paper, we propose a method for dynamically deciding on the ordering of auction markets for each available ad slot. This method is based on reinforcement learning (RL) and learns the state-action through a tabular method. Since the state-action space is sparse, a prediction model is used to solve this sparsity. We analyze a real-time bidding dataset, and then show that the proposed RL method on this dataset leads to higher revenues. In addition, a sensitivity analysis is performed on the parameters of the method.

引用

页码：16 / 36

页数：21

共 50 条

[41] Real-Time Virtual Machine Scheduling in Industry IoT Network: A Reinforcement Learning Method
Ma, Xiaojin
Xu, Huahu
Gao, Honghao
Bian, Minjie
Hussain, Walayat
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (02) : 2129 - 2139
[42] A Real-Time and Optimal Hypersonic Entry Guidance Method Using Inverse Reinforcement Learning
Su, Linfeng
Wang, Jinbo
Chen, Hongbo
Pezzella, Giuseppe
[J]. AEROSPACE, 2023, 10 (11)
[43] Real-time Stochastic Dispatch Method for Incremental Distribution Network Based on Reinforcement Learning
Li J.
Yu T.
Pan Z.
[J]. Dianwang Jishu/Power System Technology, 2020, 44 (09): : 3321 - 3330
[44] Real-time planning and collision avoidance control method based on deep reinforcement learning
Xu, Xinli
Cai, Peng
Cao, Yunlong
Chu, Zhenzhong
Zhu, Wenbo
Zhang, Weidong
[J]. OCEAN ENGINEERING, 2023, 281
[45] Learning to Calibrate Battery Models in Real-Time with Deep Reinforcement Learning
Unagar, Ajaykumar
Tian, Yuan
Chao, Manuel Arias
Fink, Olga
[J]. ENERGIES, 2021, 14 (05)
[46] Integration of Adaptive Control and Reinforcement Learning for Real-Time Control and Learning
Annaswamy, Anuradha M.
Guha, Anubhav
Cui, Yingnan
Tang, Sunbochen
Fisher, Peter A.
Gaudio, Joseph E.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7740 - 7755
[47] Real-time outage management in active distribution networks using reinforcement learning over graphs
Jacob, Roshni Anna
Paul, Steve
Chowdhury, Souma
Gel, Yulia R.
Zhang, Jie
[J]. NATURE COMMUNICATIONS, 2024, 15 (01)
[48] Deep reinforcement learning and enhanced optimization for real-time energy management in wireless sensor networks
[J]. Sachithanandam, Vidhya (vidhya@saveetha.ac.in), 2025, 45
[49] Real-time learning capability of neural networks
Huang, Guang-Bin
Zhu, Qin-Yu
Siew, Chee-Kheong
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (04): : 863 - 878
[50] A Framework for Mobile Ad hoc Networks in Real-Time Maude
Liu, Si
Olveczky, Peter Csaba
Meseguer, Jose
[J]. REWRITING LOGIC AND ITS APPLICATIONS, WRLA 2014, 2014, 8663 : 162 - 177

← 1 2 3 4 5 →