Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning

被引:47
|
作者
Rana, Rupal [1 ]
Oliveira, Fernando S. [2 ]
机构
[1] Univ Loughborough, Sch Business & Econ, Loughborough, Leics, England
[2] ESSEC Univ, ESSEC Business Sch, Singapore, Singapore
关键词
Revenue management; Dynamic pricing; Reinforcement learning; Simulation; REVENUE MANAGEMENT; YIELD MANAGEMENT; SUPPLY CHAIN; UNCERTAINTY;
D O I
10.1016/j.omega.2013.10.004
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper examines the problem of establishing a pricing policy that maximizes the revenue for selling a given inventory by a fixed deadline. This problem is faced by a variety of industries, including airlines, hotels and fashion. Reinforcement learning algorithms are used to analyze how firms can both learn and optimize their pricing strategies while interacting with their customers. We show that by using reinforcement learning we can model the problem with inter-dependent demands. This type of model can be useful in producing a more accurate pricing scheme of services or products when important events affect consumer preferences. This paper proposes a methodology to optimize revenue in a model-free environment in which demand is learned and pricing decisions are updated in real-time. We compare the performance of the learning algorithms using Monte-Carlo simulation. Crown Copyright (C) 2013 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:116 / 126
页数:11
相关论文
共 50 条
  • [41] Real-Time Detection of Non-Stationary Objects Using Intensity Data in Automotive LiDAR SLAM
    Nowak, Tomasz
    Cwian, Krzysztof
    Skrzypczynski, Piotr
    [J]. SENSORS, 2021, 21 (20)
  • [42] Model-Free Non-Stationarity Detection and Adaptation in Reinforcement Learning
    Canonaco, Giuseppe
    Restelli, Marcello
    Roveri, Manuel
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1047 - 1054
  • [43] Mungojerrie: Linear-Time Objectives in Model-Free Reinforcement Learning
    Hahn, Ernst Moritz
    Perez, Mateo
    Schewe, Sven
    Somenzi, Fabio
    Trivedi, Ashutosh
    Wojtczak, Dominik
    [J]. TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, TACAS 2023, 2023, 13993 : 527 - 545
  • [44] Predictive reinforcement learning in non-stationary environments using weighted mixture policy
    Pourshamsaei, Hossein
    Nobakhti, Amin
    [J]. APPLIED SOFT COMPUTING, 2024, 153
  • [45] Real-time model calibration with deep reinforcement learning
    Tian, Yuan
    Chao, Manuel Arias
    Kulkarni, Chetan
    Goebel, Kai
    Fink, Olga
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2022, 165
  • [46] Model-Free Real-Time Autonomous Energy Management for a Residential Multi-Carrier Energy System: A Deep Reinforcement Learning Approach
    Ye, Yujian
    Qiu, Dawei
    Ward, Jonathan
    Abram, Marcin
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 339 - 346
  • [47] Model-free real-time optimization of process systems using safe Bayesian optimization
    Krishnamoorthy, Dinesh
    Doyle, Francis J.
    [J]. AICHE JOURNAL, 2023, 69 (04)
  • [48] Accelerating Model-Free Reinforcement Learning With Imperfect Model Knowledge in Dynamic Spectrum Access
    Li, Lianjun
    Liu, Lingjia
    Bai, Jianan
    Chang, Hao-Hsuan
    Chen, Hao
    Ashdown, Jonathan D.
    Zhang, Jianzhong
    Yi, Yang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (08): : 7517 - 7528
  • [49] Model-free aperiodic tracking for discrete-time systems using hierarchical reinforcement learning
    Tian, Yingqiang
    Wan, Haiying
    Karimi, Hamid Reza
    Luan, Xiaoli
    Liu, Fei
    [J]. NEUROCOMPUTING, 2024, 609
  • [50] REINFORCEMENT LEARNING ON MODULAR ROBOTS USING REAL-TIME DECENTRALIZED DECONFLICTION IN A WAREHOUSE ENVIRONMENT
    Sayed, Andres
    Scales, Sara
    Fox, Austin
    Stone, Jafar
    Crews, Steven
    [J]. PROCEEDINGS OF ASME 2021 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION (IMECE2021), VOL 6, 2021,