MARL-Based Cooperative Multi-AGV Control in Warehouse Systems

被引:8
|
作者
Choi, Ho-Bin [1 ]
Kim, Ju-Bong [1 ]
Han, Youn-Hee [1 ]
Oh, Se-Won [2 ]
Kim, Kwihoon [3 ]
机构
[1] Korea Univ Technol & Educ, Future Convergence Engn, Cheonan 31253, South Korea
[2] ETRI, Dept Knowledge Converged Super Brain Convergence, Daejeon 34129, South Korea
[3] Korea Natl Univ Educ, Dept Artificial Intelligence Convergence Educ, Cheongju 28173, South Korea
基金
新加坡国家研究基金会;
关键词
Reinforcement learning; Layout; Path planning; Real-time systems; Productivity; Decision making; Convergence; AGV; fulfillment center; multi-agent reinforcement learning; path planning;
D O I
10.1109/ACCESS.2022.3206537
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated guided vehicles (AGVs) are essential components for the automation of fulfillment centers, a type of warehouse, where goods are stored on shelves and carried by AGVs. To increase the productivity in inventory management, a well-organized cooperative path control is required to transport goods to the designated picking stations. In this paper, we propose a QMIX-based scheme for the cooperative path control of multiple AGVs. Although QMIX is the one of popular cooperative multi-agent reinforcement learning algorithms, we find that QMIX alone was not enough to increase productivity in warehouse systems. So, we develop two novel techniques that can be used with QMIX: 1) sequential action masking that eliminates all the collision cases and 2) additional local loss that improves collaboration of individual AGVs. They help to encourage the AGVs to cooperate more for high productivity. By extensive simulations, we present the superiority of the proposed scheme on several layouts in fulfillment centers. The effect of cooperation among AGVs in the proposed scheme is verified through the comparison study with the existing algorithms. Additionally, we show the generalization performance by investigating the reusability of the model trained with the proposed scheme.
引用
收藏
页码:100478 / 100488
页数:11
相关论文
共 50 条
  • [1] A Probabilistic Time-Constrained Based Heuristic Path Planning Algorithm in Warehouse Multi-AGV Systems
    Lian, Yindong
    Xie, Wei
    Zhang, Langwen
    IFAC PAPERSONLINE, 2020, 53 (02): : 2538 - 2543
  • [2] Multi-AGV Tracking System Based on Global Vision and AprilTag in Smart Warehouse
    Yang, Qifan
    Lian, Yindong
    Liu, Yanru
    Xie, Wei
    Yang, Yibin
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (03)
  • [3] Multi-AGV Tracking System Based on Global Vision and AprilTag in Smart Warehouse
    Qifan Yang
    Yindong Lian
    Yanru Liu
    Wei Xie
    Yibin Yang
    Journal of Intelligent & Robotic Systems, 2022, 104
  • [4] An MARL-based Task Scheduling Algorithm for Cooperative Computation in Multi-UAV-Assisted MEC Systems
    Yang, Luyinru
    Zheng, Jun
    Zhang, Baoxian
    2023 INTERNATIONAL CONFERENCE ON FUTURE COMMUNICATIONS AND NETWORKS, FCN, 2023,
  • [5] Dynamic Spare Point Application Based Coordination Strategy for Multi-AGV Systems in a WIP Warehouse Environment
    Xu, Rui
    Feng, Haojun
    Liu, Jia
    Hong, Wenjing
    IEEE ACCESS, 2022, 10 : 80249 - 80263
  • [6] A parallel algorithm for multi-AGV systems
    Yu, Dingding
    Hu, Xianliang
    Liang, Kewei
    Ying, Jun
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (04) : 2309 - 2323
  • [7] Decentralized Control of Multi-AGV Systems in Autonomous Warehousing Applications
    Draganjac, Ivica
    Miklic, Damjan
    Kovaci, Zdenko
    Vasiljevic, Goran
    Bogdan, Stjepan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (04) : 1433 - 1447
  • [8] Collaborative Control Method of Intelligent Warehouse Traffic Signal and Multi-AGV Path Planning
    Si, Ming
    Wu, Bofan
    Hu, Can
    Xing, Weiqiang
    Computer Engineering and Applications, 2024, 60 (11) : 290 - 297
  • [9] Cooperative cloud robotics architecture for the coordination of multi-AGV systems in industrial warehouses
    Cardarelli, Elena
    Digani, Valerio
    Sabattini, Lorenzo
    Secchi, Cristian
    Fantuzzi, Cesare
    MECHATRONICS, 2017, 45 : 1 - 13
  • [10] A parallel algorithm for multi-AGV systems
    Dingding Yu
    Xianliang Hu
    Kewei Liang
    Jun Ying
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 2309 - 2323