Towards reliable robot packing system based on deep reinforcement learning

被引:7
|
作者
Xiong, Heng [1 ]
Ding, Kai [2 ]
Ding, Wan [2 ]
Peng, Jian [1 ]
Xu, Jianfeng [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China
[2] BOSCH Corp Res, Shanghai 200335, Peoples R China
关键词
Robotics; Online bin packing; Reinforcement learning; Manipulation; BIN PACKING; ALGORITHM; HEURISTICS; PICKING;
D O I
10.1016/j.aei.2023.102028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object packing by a robot has a wide range of applications in the logistics industry. This task requires the variable size items to be picked from piles one by one and then packed into another container immediately without the information about the unpicked items, modeled as an Online 3D Bin Packing Problem (3D-BPP). Due to limited information, it is a challenging problem to obtain an optimal solution for maximizing space utilization. Furthermore, existing studies do not consider practical constraints and assume an ideal perception and robotic packing manipulation. In this paper, we present a robot packing system with high performance and reliability. First, the Online 3D-BPP is formulated as a Markov decision process. A deep reinforcement learning (DRL) approach is proposed to tackle the problem utilizing the observations of the container and the current item. Specifically, a candidate map that indicates the potentially feasible placements based on heuristics is introduced to balance the exploration and exploitation in the considerable discrete action space. Second, we develop a physical robotic system to bridge the DRL agent from simulation to practical application. To make the packing manipulation resilient to uncertainties from the physical system, we design a motion primitive by moving the picked item close to its target placement from a collision-free area within the container. Experiments demonstrate that our method delivers superior performance against the baselines on two datasets, improving space utilization by over 2.7% and 3.8%, respectively, and the performance is not limited by the container size. Moreover, our robotic system can facilitate DRL to perform well in the real world.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Deep Reinforcement Learning Based Mobile Robot Navigation:A Review
    Kai Zhu
    Tao Zhang
    [J]. Tsinghua Science and Technology, 2021, 26 (05) : 674 - 691
  • [22] Robot Obstacle Avoidance Controller Based on Deep Reinforcement Learning
    Tang, Yaokun
    Chen, Qingyu
    Wei, Yuxin
    [J]. Journal of Sensors, 2022, 2022
  • [23] Autonomous Mobile Robot with Simple Navigation System Based on Deep Reinforcement Learning and a Monocular Camera
    Yokoyama, Koki
    Morioka, Kazuyuki
    [J]. 2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 525 - 530
  • [24] Robot autonomous grasping and assembly skill learning based on deep reinforcement learning
    Chen, Chengjun
    Zhang, Hao
    Pan, Yong
    Li, Dongnian
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2024, 130 (11-12): : 5233 - 5249
  • [25] Robot autonomous grasping and assembly skill learning based on deep reinforcement learning
    Chengjun Chen
    Hao Zhang
    Yong Pan
    Dongnian Li
    [J]. The International Journal of Advanced Manufacturing Technology, 2024, 130 : 5233 - 5249
  • [26] Towards Reinforcement based Learning of an Assembly Process for Human Robot Collaboration
    Akkaladevi, Sharath Chandra
    Plasch, Matthias
    Pichler, Andreas
    Ikeda, Markus
    [J]. 29TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM 2019): BEYOND INDUSTRY 4.0: INDUSTRIAL ADVANCES, ENGINEERING EDUCATION AND INTELLIGENT MANUFACTURING, 2019, 38 : 1491 - 1498
  • [27] Research progress of robot motion control based on deep reinforcement learning
    Dong H.
    Yang J.
    Li S.-B.
    Wang J.
    Duan Z.-J.
    [J]. Kongzhi yu Juece/Control and Decision, 2022, 37 (02): : 278 - 292
  • [28] Deep Reinforcement Learning Based Online Area Covering Autonomous Robot
    Saha, Olimpiya
    Ren, Guohua
    Heydari, Javad
    Ganapathy, Viswanath
    Shah, Mohak
    [J]. 2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 21 - 25
  • [29] Robot-Assisted Pedestrian Regulation Based on Deep Reinforcement Learning
    Wan, Zhiqiang
    Jiang, Chao
    Fahad, Muhammad
    Ni, Zhen
    Guo, Yi
    He, Haibo
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (04) : 1669 - 1682
  • [30] A novel mobile robot navigation method based on deep reinforcement learning
    Quan, Hao
    Li, Yansheng
    Zhang, Yi
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03):