The use of continuous action representations to scale deep reinforcement learning for inventory control

被引:0
|
作者
Vanvuchelen, Nathalie [1 ]
De Moor, Bram J. [2 ]
Boute, Robert N. [3 ,4 ,5 ]
机构
[1] OMP, B-2160 Wommelgem, Belgium
[2] Eindhoven Univ Technol, Dept Ind Engn & Innovat Sci, NL-5600 MB Eindhoven, Netherlands
[3] Katholieke Univ Leuven, Res Ctr Operat Management, B-3000 Leuven, Belgium
[4] Vlerick Business Sch, Technol & Operat Management Area, B-3000 Leuven, Belgium
[5] Katholieke Univ Leuven, Flanders Make, B-3000 Leuven, Belgium
基金
比利时弗兰德研究基金会;
关键词
deep reinforcement learning; continuous actions; neural networks; inventory management; S POLICIES;
D O I
10.1093/imaman/dpae031
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Deep reinforcement learning (DRL) can solve complex inventory problems with a multi-dimensional state space. However, most approaches use a discrete action representation and do not scale well to problems with multi-dimensional action spaces. We use DRL with a continuous action representation for inventory problems with a large (multi-dimensional) discrete action space. To obtain feasible discrete actions from a continuous action representation, we add a tailored mapping function to the policy network that maps the continuous outputs of the policy network to a feasible integer solution. We demonstrate our approach to multi-product inventory control. We show how a continuous action representation solves larger problem instances and requires much less training time than a discrete action representation. Moreover, we show its performance matches state-of-the-art heuristic replenishment policies. This promising research avenue might pave the way for applying DRL in inventory control at scale and in practice.
引用
收藏
页码:51 / 66
页数:16
相关论文
共 50 条
  • [21] Deep reinforcement learning in continuous action space for autonomous robotic surgery
    Amin Abbasi Shahkoo
    Ahmad Ali Abin
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 423 - 431
  • [22] Continuous Control of an Underground Loader Using Deep Reinforcement Learning
    Backman, Sofi
    Lindmark, Daniel
    Bodin, Kenneth
    Servin, Martin
    Mork, Joakim
    Lofgren, Hakan
    MACHINES, 2021, 9 (10)
  • [23] Continuous Control with Deep Reinforcement Learning for Mobile Robot Navigation
    Xiang, Jiaqi
    Li, Qingdong
    Dong, Xiwang
    Ren, Zhang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1501 - 1506
  • [24] Deep Reinforcement Learning for Large-Scale Epidemic Control
    Libin, Pieter J. K.
    Moonens, Arno
    Verstraeten, Timothy
    Perez-Sanjines, Fabian
    Hens, Niel
    Lemey, Philippe
    Nowe, Ann
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V, 2021, 12461 : 155 - 170
  • [25] Inventory Pooling using Deep Reinforcement Learning
    Sampath, Kameshwaran
    Nishad, Sandeep
    Danda, Sai Koti Reddy
    Dayama, Pankaj
    Sankagiri, Suryanarayana
    2022 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (IEEE SCC 2022), 2022, : 259 - 267
  • [26] A deep reinforcement learning approach to seat inventory control for airline revenue management
    Shihab, Syed A. M.
    Wei, Peng
    JOURNAL OF REVENUE AND PRICING MANAGEMENT, 2022, 21 (02) : 183 - 199
  • [27] A deep reinforcement learning approach to seat inventory control for airline revenue management
    Syed A. M. Shihab
    Peng Wei
    Journal of Revenue and Pricing Management, 2022, 21 : 183 - 199
  • [28] Reinforcement learning in continuous action spaces
    van Hasselt, Hado
    Wiering, Marco A.
    2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 272 - +
  • [29] Convergent Reinforcement Learning Control with Neural Networks and Continuous Action Search
    Lee, Minwoo
    Anderson, Charles W.
    2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 33 - 40
  • [30] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
    Zhou, Ziyuan
    Liu, Guanjun
    Guo, Weiran
    Zhou, MengChu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646