Deep Reinforcement Learning for Dynamic Pricing of Perishable Products

被引:2
|
作者
Burman, Vibhati [1 ]
Vashishtha, Rajesh Kumar [1 ]
Kumar, Rajan
Ramanan, Sharadha [1 ]
机构
[1] TCS Res, Chennai, India
来源
关键词
Dynamic pricing; Deep reinforcement learning; Perishable items; Retail; Grocery; Fashion industry; Deep Q-network; Revenue management; YIELD MANAGEMENT; POLICIES; MODEL; REVENUE; DEMAND; FOODS;
D O I
10.1007/978-3-030-85672-4_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic pricing is a strategy for setting flexible prices for products based on existing market demand. In this paper, we address the problem of dynamic pricing of perishable products using DQN value function approximator. A model-free reinforcement learning approach is used to maximize revenue for a perishable item with fixed initial inventory and selling horizon. The demand is influenced by the price and freshness of the product. The conventional tabular Q-learning method involves storing the Q-values for each state-action pair in a lookup table. This approach is not suitable for control problems with large state spaces. Hence, we use function approximation approach to address the limitations of a tabular Q-learning method. Using DQN function approximator we generalize the unseen states from the seen states, which reduces the space requirements for storing value function for each state-action combination. We show that using DQN we can model the problem of pricing perishable products. Our results demonstrate that the DQN based dynamic pricing algorithm generates higher revenue when compared with conventional one-step price optimization and constant pricing strategy.
引用
收藏
页码:132 / 143
页数:12
相关论文
共 50 条
  • [11] Dynamic joint pricing and production policy for perishable products
    Feng, Lin
    Zhang, Jianxiong
    Tang, Wansheng
    [J]. INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2018, 25 (06) : 2031 - 2051
  • [12] Optimal dynamic pricing and ordering decisions for perishable products
    Chew, Ek Peng
    Lee, Chulung
    Liu, Rujing
    Hong, Ki-sung
    Zhang, Anming
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2014, 157 : 39 - 48
  • [13] Optimal dynamic pricing of perishable products with demand substitution
    Yu, YB
    Luo, L
    Liu, DW
    [J]. SERVICE SYSTEMS AND SERVICE MANAGEMENT - PROCEEDINGS OF ICSSSM '04, VOLS 1 AND 2, 2004, : 221 - 225
  • [14] Effects of dynamic pricing of perishable products on revenue and waste
    Adenso-Diaz, B.
    Lozano, S.
    Palacio, A.
    [J]. APPLIED MATHEMATICAL MODELLING, 2017, 45 : 148 - 164
  • [15] Dynamic Pricing of Perishable Products with Random Fuzzy Demand
    Li Gen-dao
    Li Wei
    [J]. 2010 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (ICMSE), 2010, : 191 - 199
  • [16] Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit
    Khraishi, Raad
    Okhrati, Ramin
    [J]. 3RD ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2022, 2022, : 325 - 333
  • [17] Dynamic pricing for fast charging stations with deep reinforcement learning
    Cui, Li
    Wang, Qingyuan
    Qu, Hongquan
    Wang, Mingshen
    Wu, Yile
    Ge, Le
    [J]. APPLIED ENERGY, 2023, 346
  • [18] Dynamic pricing and reinforcement learning
    Carvalho, AX
    Puterman, ML
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2916 - 2921
  • [19] Dynamic pricing of differentiated products with incomplete information based on reinforcement learning
    Wang, Cheng
    Cui, Senbing
    Wu, Runhua
    Wang, Ziteng
    [J]. IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2022, 4 (02) : 123 - 138
  • [20] Optimal Dynamic Pricing of Perishable Products with Reference Price Effect
    Wang, Min
    Yang, Wen-Sheng
    [J]. PROCEEDINGS OF THE 3RD ANNUAL INTERNATIONAL CONFERENCE ON MANAGEMENT, ECONOMICS AND SOCIAL DEVELOPMENT (ICMESD 17), 2017, 21 : 495 - 500