A Dynamic Bidding Strategy Based on Model-Free Reinforcement Learning in Display Advertising

被引:6
|
作者
Liu, Mengjuan [1 ]
Jiaxing, Li [1 ]
Hu, Zhengning [1 ]
Liu, Jinyu [1 ]
Nie, Xuyun [1 ]
机构
[1] Univ Elect Sci & Technol China, Network & Data Secur Key Lab Sichuan Prov, Chengdu 610054, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Real-time bidding; bid optimization; model-free reinforcement learning;
D O I
10.1109/ACCESS.2020.3037940
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time bidding (RTB) is one of the most striking advances in online advertising, where the websites can sell each ad impression through a public auction, and the advertisers can participate in bidding the impression based on its estimated value. In RTB, the bidding strategy is an essential component for advertisers to maximize their revenues (e.g., clicks and conversions). However, most existing bidding strategies may not work well when the RTB environment changes dramatically between the historical and the new ad delivery periods since they regard the bidding decision as a static optimization problem and derive the bidding function only based on historical data. Thus, the latest research suggests using the reinforcement learning (RL) framework to learn the optimal bidding strategy suitable for the highly dynamic RTB environment. In this paper, we focus on using model-free reinforcement learning to optimize the bidding strategy. Specifically, we divide an ad delivery period into several time slots. The bidding agent decides each impression's bidding price depending on its estimated value and the bidding factor of its arriving time slot. Therefore, the bidding strategy is simplified to solve each time slot's optimal bidding factor, which can adapt dynamically to the RTB environment. We exploit the Twin Delayed Deep Deterministic policy gradient (TD3) algorithm to learn each time slot's optimal bidding factor. Finally, the empirical study on a public dataset demonstrates the superior performance and high efficiency of the proposed bidding strategy compared with other state-of-the-art baselines.
引用
收藏
页码:213587 / 213601
页数:15
相关论文
共 50 条
  • [1] An Intelligent Bidding Strategy Based on Model-Free Reinforcement Learning for Real-Time Bidding in Display Advertising
    Liu, Mengjuan
    Li, Jiaxing
    Yue, Wei
    Qiu, Lizhou
    Liu, Jinyu
    Qin, Zhiguang
    [J]. 2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 240 - 245
  • [2] Budget Constrained Bidding by Model-free Reinforcement Learning in Display Advertising
    Wu, Di
    Chen, Xiujun
    Yang, Xun
    Wang, Hao
    Tan, Qing
    Zhang, Xiaoxun
    Xu, Jian
    Gai, Kun
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1443 - 1451
  • [3] Dynamic Bidding Strategy Based on Probabilistic Feedback in Display Advertising
    Wu, Yuzhu
    Pan, Shumin
    Zhang, Qianwen
    Xie, Jinkui
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 845 - 853
  • [4] An Expected Win Rate-Based Real Time Bidding Strategy for Branding Campaign by the Model-Free Reinforcement Learning Model
    Shih, Wen-Yueh
    Lu, Yi-Shu
    Tsai, Hsiao-Ping
    Huang, Jiun-Long
    [J]. IEEE ACCESS, 2020, 8 : 151952 - 151967
  • [5] Real-Time Bidding by Reinforcement Learning in Display Advertising
    Cai, Han
    Ren, Kan
    Zhang, Weinan
    Malialis, Kleanthis
    Wang, Jun
    Yu, Yong
    Guo, Defeng
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 661 - 670
  • [6] An Actor-critic Reinforcement Learning Model for Optimal Bidding in Online Display Advertising
    Yuan, Congde
    Guo, Mengzhuo
    Xiang, Chaoneng
    Wang, Shuangyang
    Song, Guoqing
    Zhang, Qingpeng
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3604 - 3613
  • [7] An extensible approach for real-time bidding with model-free reinforcement learning
    Cheng, Yin
    Zou, Luobao
    Zhuang, Zhiwei
    Liu, Jingwei
    Xu, Bin
    Zhang, Weidong
    [J]. NEUROCOMPUTING, 2019, 360 : 97 - 106
  • [8] Model-free Based Reinforcement Learning Control Strategy of Aircraft Attitude Systems
    Huang, Dingcui
    Hu, Jiangping
    Peng, Zhinan
    Chen, Bo
    Hao, Mingrui
    Ghosh, Bijoy Kumar
    [J]. 2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 743 - 748
  • [9] Model-Free Preference-Based Reinforcement Learning
    Wirth, Christian
    Fuernkranz, Johannes
    Neumann, Gerhard
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2222 - 2228
  • [10] Reinforcement learning based model-free optimized trajectory tracking strategy design for an AUV
    Duan, Kairong
    Fong, Simon
    Chen, C. L. Philip
    [J]. NEUROCOMPUTING, 2022, 469 : 289 - 297