Constrained Reinforcement Learning for Dynamic Material Handling

被引:0
|
作者
Hu, Chengpeng [1 ,2 ]
Wang, Ziming [1 ,2 ]
Liu, Jialin [1 ,2 ]
Wen, Junyi [3 ]
Mao, Bifei [3 ]
Yao, Xin [1 ,2 ]
机构
[1] Southern Univ Sci & Technol, RITAS, Shenzhen, Peoples R China
[2] Southern Univ Sci & Technol, Guangdong Key Lab Brain Inspired Intelligent Com, Dept Comp Sci & Engn, Shenzhen, Peoples R China
[3] Huawei Technol Co Ltd, Trustworthiness Theory Res Ctr, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic material handling; constrained reinforcement learning; automated guided vehicle; manufacturing system; benchmark; SYSTEMS; RULES;
D O I
10.1109/IJCNN54540.2023.10191999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the core parts of flexible manufacturing systems, material handling involves storage and transportation of materials between workstations with automated vehicles. The improvement in material handling can impulse the overall efficiency of the manufacturing system. However, the occurrence of dynamic events during the optimisation of task arrangements poses a challenge that requires adaptability and effectiveness. In this paper, we aim at the scheduling of automated guided vehicles for dynamic material handling. Motivated by some real-world scenarios, unknown new tasks and unexpected vehicle breakdowns are regarded as dynamic events in our problem. We formulate the problem as a constrained Markov decision process which takes into account tardiness and available vehicles as cumulative and instantaneous constraints, respectively. An adaptive constrained reinforcement learning algorithm that combines Lagrangian relaxation and invalid action masking, named RCPOM, is proposed to address the problem with two hybrid constraints. Moreover, a gym-like dynamic material handling simulator, named DMH-GYM, is developed and equipped with diverse problem instances, which can be used as benchmarks for dynamic material handling. Experimental results on the problem instances demonstrate the outstanding performance of our proposed approach compared with eight state-of-the-art constrained and non-constrained reinforcement learning algorithms, and widely used dispatching rules for material handling.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Applying reinforcement learning to plan manufacturing material handling
    Govindaiah, Swetha
    Petty, Mikel D.
    [J]. Discover Artificial Intelligence, 2021, 1 (01):
  • [2] Dynamic Vehicle Traffic Control Using Deep Reinforcement Learning in Automated Material Handling System
    Kang, Younkook
    Lyu, Sungwon
    Kim, Jeeyung
    Park, Bongjoon
    Cho, Sungzoon
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9949 - 9950
  • [3] Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty
    Petsagkourakis, P.
    Sandoval, I. O.
    Bradford, E.
    Zhang, D.
    del Rio-Chanona, E. A.
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 11264 - 11270
  • [4] Dynamic Action Space Handling Method for Reinforcement Learning models
    Woo, Sangchul
    Sung, Yunsick
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (05): : 1223 - 1230
  • [5] A dynamic penalty approach to state constraint handling in deep reinforcement learning
    Yoo, Haeun
    Zavala, Victor M.
    Lee, Jay H.
    [J]. JOURNAL OF PROCESS CONTROL, 2022, 115 : 157 - 166
  • [6] A Dynamic Penalty Function Approach for Constraint-Handling in Reinforcement Learning
    Yoo, Haeun
    Zavala, Victor M.
    Lee, Jay H.
    [J]. IFAC PAPERSONLINE, 2021, 54 (03): : 487 - 491
  • [7] Reinforcement Learning in Ship Handling
    Lacki, M.
    [J]. TRANSNAV-INTERNATIONAL JOURNAL ON MARINE NAVIGATION AND SAFETY OF SEA TRANSPORTATION, 2008, 2 (02) : 157 - 160
  • [8] Constrained Reinforcement Learning for Stochastic Dynamic Optimal Power Flow Control
    Wu, Tong
    Scaglione, Anna
    [J]. 2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [9] Reinforcement Learning for Robust Dynamic Event-Driven Constrained Control
    Yang, Xiong
    Wang, Ding
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [10] Batch-Constrained Reinforcement Learning for Dynamic Distribution Network Reconfiguration
    Gao, Yuanqi
    Wang, Wei
    Shi, Jie
    Yu, Nanpeng
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (06) : 5357 - 5369