An extensible approach for real-time bidding with model-free reinforcement learning

被引:4
|
作者
Cheng, Yin [1 ]
Zou, Luobao [1 ]
Zhuang, Zhiwei [1 ]
Liu, Jingwei [2 ]
Xu, Bin [3 ]
Zhang, Weidong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200240, Peoples R China
[2] Qihoo 360 Technol Co Ltd, Beijing 100088, Peoples R China
[3] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Model-free; Extensible approach; Real-time bidding;
D O I
10.1016/j.neucom.2019.06.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an extensible framework for model-free reinforcement learning (RL) for real-time bidding (RTB) in display advertising. This framework can be applied into both simple environments and extend to the comprehensive environment that the DSP bids for multiple advertisers at the same time. To process new information that is collected via real-time interaction with the environment, an extensible model is first introduced, which is based on the distribution of the recharging probability. Substantial effort is expended to alleviate the problem of the sparsity of the click signal with the reward function. The proposed scheme has high feasibility and can address dynamic environments in contrast to prior works, which assumed that the distribution of the feature vectors and the dealing price were already known. Furthermore, a fund-recharging mechanism is introduced for transforming the RTB model into an endless task, which allows the policy to be optimized in a farsighted rather than a myopic manner. Illustrative experiments on both the small- and large-scale real datasets demonstrate the state-of-the-art performance of the proposed framework for the issue of interest. (C) 2019 Published by Elsevier B.V.
引用
收藏
页码:97 / 106
页数:10
相关论文
共 50 条
  • [1] An Intelligent Bidding Strategy Based on Model-Free Reinforcement Learning for Real-Time Bidding in Display Advertising
    Liu, Mengjuan
    Li, Jiaxing
    Yue, Wei
    Qiu, Lizhou
    Liu, Jinyu
    Qin, Zhiguang
    [J]. 2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 240 - 245
  • [2] Model-Free Real-Time EV Charging Scheduling Based on Deep Reinforcement Learning
    Wan, Zhiqiang
    Li, Hepeng
    He, Haibo
    Prokhorov, Danil
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2019, 10 (05) : 5246 - 5257
  • [3] A Model-Free Deep Reinforcement Learning-Based Approach for Assessment of Real-Time PV Hosting Capacity
    Suchithra, Jude
    Robinson, Duane A.
    Rajabi, Amin
    [J]. ENERGIES, 2024, 17 (09)
  • [4] An Expected Win Rate-Based Real Time Bidding Strategy for Branding Campaign by the Model-Free Reinforcement Learning Model
    Shih, Wen-Yueh
    Lu, Yi-Shu
    Tsai, Hsiao-Ping
    Huang, Jiun-Long
    [J]. IEEE ACCESS, 2020, 8 : 151952 - 151967
  • [5] Real-Time Bidding by Reinforcement Learning in Display Advertising
    Cai, Han
    Ren, Kan
    Zhang, Weinan
    Malialis, Kleanthis
    Wang, Jun
    Yu, Yong
    Guo, Defeng
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 661 - 670
  • [6] Real-Time Model-Free Deep Reinforcement Learning for Force Control of a Series Elastic Actuator
    Sambhus, Ruturaj
    Gokce, Aydin
    Welch, Stephen
    Herron, Connor W.
    Leonessa, Alexander
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5645 - 5652
  • [7] Budget Constrained Bidding by Model-free Reinforcement Learning in Display Advertising
    Wu, Di
    Chen, Xiujun
    Yang, Xun
    Wang, Hao
    Tan, Qing
    Zhang, Xiaoxun
    Xu, Jian
    Gai, Kun
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1443 - 1451
  • [8] Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning
    Rana, Rupal
    Oliveira, Fernando S.
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2014, 47 : 116 - 126
  • [9] Deep Reinforcement Learning for Sponsored Search Real-time Bidding
    Zhao, Jun
    Qiu, Guang
    Guan, Ziyu
    Zhao, Wei
    He, Xiaofei
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1021 - 1030
  • [10] Reinforcement Learning with Sequential Information Clustering in Real-Time Bidding
    Lu, Junwei
    Yang, Chaoqi
    Gao, Xiaofeng
    Wang, Liubin
    Li, Changcheng
    Chen, Guihai
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1633 - 1641