Hybrid algorithm based on reinforcement learning for smart inventory management

被引:15
|
作者
Cuartas, Carlos [1 ]
Aguilar, Jose [1 ,2 ,3 ]
机构
[1] Univ EAFIT, GIDITIC, Medellin, Colombia
[2] Univ Alcala, Dept Automat, Alcala De Henares, Spain
[3] Univ Los Andes, CEMISID, Merida, Venezuela
关键词
Smart inventory; DDMRP model; Inventory management system; Reinforcement learning; Q-Learning; MRP;
D O I
10.1007/s10845-022-01982-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article proposes a hybrid algorithm based on reinforcement learning and the inventory management methodology called DDMRP (Demand Driven Material Requirement Planning) to determine the optimal time to buy a certain product, and how much quantity should be requested. For this, the inventory management problem is formulated as a Markov Decision Process where the environment with which the system interacts is designed from the concepts raised in the DDMRP methodology, and through the reinforcement learning algorithm-specifically, Q-Learning. The optimal policy is determined for making decisions about when and how much to buy. To determine the optimal policy, three approaches are proposed for the reward function: the first one is based on inventory levels; the second is an optimization function based on the distance of the inventory to its optimal level, and the third is a shaping function based on levels and distances to the optimal inventory. The results show that the proposed algorithm has promising results in scenarios with different characteristics, performing adequately in difficult case studies, with a diversity of situations such as scenarios with discontinuous or continuous demand, seasonal and non-seasonal behavior, and with high demand peaks, among others.
引用
收藏
页码:123 / 149
页数:27
相关论文
共 50 条
  • [31] A Hybrid Algorithm for Energy Management in Smart Grid
    Shaheen, N.
    Javaid, N.
    Iqbal, Z.
    Muhammad, K.
    Azad, K.
    Chaudhry, F. A.
    PROCEEDINGS 2015 18TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2015), 2015, : 58 - 63
  • [32] Project and Development of a Reinforcement Learning Based Control Algorithm for Hybrid Electric Vehicles
    Maino, Claudio
    Mastropietro, Antonio
    Sorrentino, Luca
    Busto, Enrico
    Misul, Daniela
    Spessa, Ezio
    APPLIED SCIENCES-BASEL, 2022, 12 (02):
  • [33] A Hybrid ACO Algorithm Based on Bayesian Factorizations and Reinforcement Learning for Continuous Optimization
    Liu, Qishuai
    Hui, Qing
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 4236 - 4243
  • [34] Batch Reinforcement Learning for Smart Home Energy Management
    Berlink, Heider
    Reali Costa, Anna Helena
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 2561 - 2567
  • [35] Hybrid Particle Swarm Optimization Algorithm Based on the Theory of Reinforcement Learning in Psychology
    Huang, Wenya
    Liu, Youjin
    Zhang, Xizheng
    SYSTEMS, 2023, 11 (02):
  • [36] Hybrid Control Algorithm for Humanoid Robots Walking Based on Episodic Reinforcement Learning
    Katic, Dusko
    Rodic, Aleksandar
    Jose Bayro-Corrochano, Eduardo
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [37] Solving Inventory Management Problems through Deep Reinforcement Learning
    Qinghao Wang
    Yijie Peng
    Yaodong Yang
    Journal of Systems Science and Systems Engineering, 2022, 31 : 677 - 689
  • [38] Automated market maker inventory management with deep reinforcement learning
    Óscar Fernández Vicente
    Fernando Fernández
    Javier García
    Applied Intelligence, 2023, 53 : 22249 - 22266
  • [39] Solving Inventory Management Problems through Deep Reinforcement Learning
    Wang, Qinghao
    Peng, Yijie
    Yang, Yaodong
    JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2022, 31 (06) : 677 - 689
  • [40] Cooperative Multi-agent Reinforcement Learning for Inventory Management
    Khirwar, Madhav
    Gurumoorthy, Karthik S.
    Jain, Ankit Ajit
    Manchenahally, Shantala
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI, 2023, 14174 : 619 - 634