Deep reinforcement learning for demand fulfillment in online retail

被引:2
|
作者
Wang, Yihua [1 ]
Minner, Stefan [1 ,2 ]
机构
[1] Tech Univ Munich, Sch Management, Logist & Supply Chain Management, D-80333 Munich, Germany
[2] Tech Univ Munich, Munich Data Sci Inst MDSI, D-85478 Garching, Germany
关键词
Demand fulfillment; Semi-markov decision processes; Deep reinforcement learning; STOCHASTIC INVENTORY CONTROL; LATERAL TRANSSHIPMENTS; APPROXIMATION SCHEME; MANAGEMENT; POLICY; ALGORITHMS;
D O I
10.1016/j.ijpe.2023.109133
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A distinctive feature of online retail is the flexibility to ship items to customers from different distribution centers (DCs). This creates interdependence between DCs and poses new challenges in demand fulfillment to decide from which DC to satisfy each customer demand. This paper addresses a demand fulfillment problem in a multi -DC online retail environment where demand and replenishment lead time are stochastic. The objective of the problem is to minimize the long-term operational costs by determining the source DC for each customer demand. We formulate the problem as a semi-Markov decision process and develop a deep reinforcement learning (DRL) algorithm to solve the problem. To evaluate the performance of the DRL algorithm, we compare it with a set of heuristic rules and exact solutions obtained by linear programming. Numerical results show that the DRL policy performs equally well with the most competitive heuristic on complete pooling DC networks and outperforms all the heuristics on partial pooling DC networks. Additionally, by analyzing the transshipment ratio of the best -observed policies, we provide managerial insights regarding the circumstances in which transshipment is more favorable.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Online Optimization for Home Integrated Demand Response Based on Deep Reinforcement Learning
    Su, Yongxin
    Wu, Zexuan
    Tan, Mao
    Duan, Bin
    [J]. Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2021, 41 (16): : 5581 - 5592
  • [2] Go Wide or Go Deep? Assortment Strategy and Order Fulfillment in Online Retail
    Gopalakrishnan, Sanjith
    Matta, Moksh
    Yourdshahy, Mona Imanpoor
    Choudhary, Vivek
    [J]. M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT, 2023, 25 (03) : 846 - 861
  • [3] Application of Deep Reinforcement Learning in Demand Response
    Sun, Yi
    Liu, Di
    Li, Bin
    Xu, Yonghai
    [J]. Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2019, 43 (05): : 183 - 191
  • [4] Residential demand response online optimization based on multi-agent deep reinforcement learning
    Yuan, Quan
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2024, 237
  • [5] Online Adaptation of Deep Architectures with Reinforcement Learning
    Ganegedara, Thushan
    Ott, Lionel
    Ramos, Fabio
    [J]. ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 577 - 585
  • [6] Deep Learning for Demand Forecasting in the Fashion and Apparel Retail Industry
    Giri, Chandadevi
    Chen, Yan
    [J]. FORECASTING, 2022, 4 (02): : 565 - 581
  • [7] Online Demand Fulfillment Under Limited Flexibility
    Xu, Zhen
    Zhang, Hailun
    Zhang, Jiheng
    Zhang, Rachel Q.
    [J]. MANAGEMENT SCIENCE, 2020, 66 (10) : 4667 - 4685
  • [8] Deep Reinforcement Learning for Demand Response in Distribution Networks
    Bahrami, Shahab
    Chen, Yu Christine
    Wong, Vincent W. S.
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (02) : 1496 - 1506
  • [9] Making Better Fulfillment Decisions on the Fly in an Online Retail Environment
    Acimovic, Jason
    Graves, Stephen C.
    [J]. M&SOM-MANUFACTURING & SERVICE OPERATIONS MANAGEMENT, 2015, 17 (01) : 34 - 51
  • [10] Adaptive Eligibility Traces for Online Deep Reinforcement Learning
    Kobayashi, Taisuke
    [J]. INTELLIGENT AUTONOMOUS SYSTEMS 16, IAS-16, 2022, 412 : 417 - 428