Deep Reinforcement Learning for Dynamic Pricing of Perishable Products

被引:2
|
作者
Burman, Vibhati [1 ]
Vashishtha, Rajesh Kumar [1 ]
Kumar, Rajan
Ramanan, Sharadha [1 ]
机构
[1] TCS Res, Chennai, India
来源
关键词
Dynamic pricing; Deep reinforcement learning; Perishable items; Retail; Grocery; Fashion industry; Deep Q-network; Revenue management; YIELD MANAGEMENT; POLICIES; MODEL; REVENUE; DEMAND; FOODS;
D O I
10.1007/978-3-030-85672-4_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic pricing is a strategy for setting flexible prices for products based on existing market demand. In this paper, we address the problem of dynamic pricing of perishable products using DQN value function approximator. A model-free reinforcement learning approach is used to maximize revenue for a perishable item with fixed initial inventory and selling horizon. The demand is influenced by the price and freshness of the product. The conventional tabular Q-learning method involves storing the Q-values for each state-action pair in a lookup table. This approach is not suitable for control problems with large state spaces. Hence, we use function approximation approach to address the limitations of a tabular Q-learning method. Using DQN function approximator we generalize the unseen states from the seen states, which reduces the space requirements for storing value function for each state-action combination. We show that using DQN we can model the problem of pricing perishable products. Our results demonstrate that the DQN based dynamic pricing algorithm generates higher revenue when compared with conventional one-step price optimization and constant pricing strategy.
引用
收藏
页码:132 / 143
页数:12
相关论文
共 50 条
  • [1] Deep reinforcement learning algorithms for dynamic pricing and inventory management of perishable products
    Yavuz, Tugce
    Kaya, Onur
    [J]. APPLIED SOFT COMPUTING, 2024, 163
  • [3] Dynamic pricing policies for interdependent perishable products or services using reinforcement learning
    Rana, Rupal
    Oliveira, Fernando S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 426 - 436
  • [4] Learning customer preferences and dynamic pricing for perishable products
    Yang, Yang
    Chu, Wan -Ling
    Wu, Cheng-Hung
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 171
  • [5] Distributed dynamic pricing of multiple perishable products using multi-agent reinforcement learning
    Qiao, Wenchuan
    Huang, Min
    Gao, Zheming
    Wang, Xingwei
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [6] Pricing to accelerate demand learning in dynamic assortment planning for perishable products
    Talebian, Masoud
    Boland, Natashia
    Savelsbergh, Martin
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2014, 237 (02) : 555 - 565
  • [7] Dynamic Pricing for Perishable Products by Fuzzy Decision
    Luo, Chunlin
    Liu, Jian
    [J]. IEEE/SOLI'2008: PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS, VOLS 1 AND 2, 2008, : 2849 - 2852
  • [8] An Automated Deep Reinforcement Learning Pipeline for Dynamic Pricing
    Afshar, Reza Refaei
    Rhuggenaath, Jason
    Zhang, Yingqian
    Kaymak, Uzay
    [J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (03): : 428 - 437
  • [9] Dynamic pricing for perishable products with hybrid uncertainty in demand
    Li, Gendao
    Xiong, Zhongkai
    Zhou, Yu
    Xiong, Yu
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2013, 219 (20) : 10366 - 10377
  • [10] Dynamic joint pricing and production policy for perishable products
    Feng, Lin
    Zhang, Jianxiong
    Tang, Wansheng
    [J]. INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2018, 25 (06) : 2031 - 2051