Real-Time Bidding with Soft Actor-Critic Reinforcement Learning in Display Advertising

被引:0
|
作者
Yakovleva, Dania [1 ]
Popov, Artem [1 ]
Filchenkov, Andrey [1 ]
机构
[1] ITMO Univ, St Petersburg, Russia
关键词
D O I
10.23919/fruct48121.2019.8981496
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The main task of advertising companies is to sell goods and services interesting to the user. Online auctions are the main mechanism for selecting ads to the user. Dynamic bidding allows advertiser to automatically calculate the bid that is profitable to set to maximize goals (for example, the number of clicks on an ad), depending on the user who sees the ad. In this case the advertiser must specify the budget of the ad and the optimization goal. During the advertising campaign the bid for each impression will be calculated by a special algorithm. In this paper, we propose a novel algorithm for calculating the dynamic bid for each impression of the ad in order to maximize the advertiser's goals, which takes into account settings of the advertising campaign, budget, the ad lifetime and other parameters. This task is formulated as reinforcement learning problem, where states are the status of auction and parameters of the advertising campaign, the actions are bidding for each ad based on the input state. Every ad has an agent who observes the states all the time and calculates the bid for the impression. We evaluated the proposed model on real advertising campaigns in a large social network. Our method achieved average 26% improvement in comparison with the state-of-the-art approach.
引用
收藏
页码:373 / 382
页数:10
相关论文
共 50 条
  • [1] An Actor-critic Reinforcement Learning Model for Optimal Bidding in Online Display Advertising
    Yuan, Congde
    Guo, Mengzhuo
    Xiang, Chaoneng
    Wang, Shuangyang
    Song, Guoqing
    Zhang, Qingpeng
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3604 - 3613
  • [2] Real-Time Bidding by Reinforcement Learning in Display Advertising
    Cai, Han
    Ren, Kan
    Zhang, Weinan
    Malialis, Kleanthis
    Wang, Jun
    Yu, Yong
    Guo, Defeng
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 661 - 670
  • [3] Actor-critic reinforcement learning for bidding in bilateral negotiation
    Arslan, Furkan
    Aydogan, Reyhan
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (05) : 1695 - 1714
  • [4] Real-Time 'Actor-Critic' Tracking
    Chen, Boyu
    Wang, Dong
    Li, Peixia
    Wang, Shuang
    Lu, Huchuan
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 328 - 345
  • [5] Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising
    Jin, Junqi
    Song, Chengru
    Li, Han
    Gai, Kun
    Wang, Jun
    Zhang, Weinan
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 2193 - 2201
  • [6] Averaged Soft Actor-Critic for Deep Reinforcement Learning
    Ding, Feng
    Ma, Guanfeng
    Chen, Zhikui
    Gao, Jing
    Li, Peng
    [J]. COMPLEXITY, 2021, 2021
  • [7] An Intelligent Bidding Strategy Based on Model-Free Reinforcement Learning for Real-Time Bidding in Display Advertising
    Liu, Mengjuan
    Li, Jiaxing
    Yue, Wei
    Qiu, Lizhou
    Liu, Jinyu
    Qin, Zhiguang
    [J]. 2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 240 - 245
  • [8] A soft actor-critic reinforcement learning algorithm for network intrusion detection
    Li, Zhengfa
    Huang, Chuanhe
    Deng, Shuhua
    Qiu, Wanyu
    Gao, Xieping
    [J]. COMPUTERS & SECURITY, 2023, 135
  • [9] Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic
    Ren, Yangang
    Duan, Jingliang
    Li, Shengbo Eben
    Guan, Yang
    Sun, Qi
    [J]. 2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [10] A Novel Actor-Critic Motor Reinforcement Learning for Continuum Soft Robots
    Pantoja-Garcia, Luis
    Parra-Vega, Vicente
    Garcia-Rodriguez, Rodolfo
    Vazquez-Garcia, Carlos Ernesto
    [J]. ROBOTICS, 2023, 12 (05)