Towards reliable robot packing system based on deep reinforcement learning

被引：7

作者：

Xiong, Heng ^{[1
]}

Ding, Kai ^{[2
]}

Ding, Wan ^{[2
]}

Peng, Jian ^{[1
]}

Xu, Jianfeng ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, State Key Lab Digital Mfg Equipment & Technol, Wuhan 430074, Peoples R China

[2] BOSCH Corp Res, Shanghai 200335, Peoples R China

来源：

ADVANCED ENGINEERING INFORMATICS | 2023年 / 57卷

关键词：

Robotics; Online bin packing; Reinforcement learning; Manipulation; BIN PACKING; ALGORITHM; HEURISTICS; PICKING;

D O I：

10.1016/j.aei.2023.102028

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object packing by a robot has a wide range of applications in the logistics industry. This task requires the variable size items to be picked from piles one by one and then packed into another container immediately without the information about the unpicked items, modeled as an Online 3D Bin Packing Problem (3D-BPP). Due to limited information, it is a challenging problem to obtain an optimal solution for maximizing space utilization. Furthermore, existing studies do not consider practical constraints and assume an ideal perception and robotic packing manipulation. In this paper, we present a robot packing system with high performance and reliability. First, the Online 3D-BPP is formulated as a Markov decision process. A deep reinforcement learning (DRL) approach is proposed to tackle the problem utilizing the observations of the container and the current item. Specifically, a candidate map that indicates the potentially feasible placements based on heuristics is introduced to balance the exploration and exploitation in the considerable discrete action space. Second, we develop a physical robotic system to bridge the DRL agent from simulation to practical application. To make the packing manipulation resilient to uncertainties from the physical system, we design a motion primitive by moving the picked item close to its target placement from a collision-free area within the container. Experiments demonstrate that our method delivers superior performance against the baselines on two datasets, improving space utilization by over 2.7% and 3.8%, respectively, and the performance is not limited by the container size. Moreover, our robotic system can facilitate DRL to perform well in the real world.

引用

页数：10

共 50 条

[21] Deep Reinforcement Learning Based Mobile Robot Navigation:A Review
Kai Zhu
Tao Zhang
[J]. Tsinghua Science and Technology, 2021, 26 (05) : 674 - 691
[22] Robot Obstacle Avoidance Controller Based on Deep Reinforcement Learning
Tang, Yaokun
Chen, Qingyu
Wei, Yuxin
[J]. Journal of Sensors, 2022, 2022
[23] Autonomous Mobile Robot with Simple Navigation System Based on Deep Reinforcement Learning and a Monocular Camera
Yokoyama, Koki
Morioka, Kazuyuki
[J]. 2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 525 - 530
[24] Robot autonomous grasping and assembly skill learning based on deep reinforcement learning
Chen, Chengjun
Zhang, Hao
Pan, Yong
Li, Dongnian
[J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2024, 130 (11-12): : 5233 - 5249
[25] Robot autonomous grasping and assembly skill learning based on deep reinforcement learning
Chengjun Chen
Hao Zhang
Yong Pan
Dongnian Li
[J]. The International Journal of Advanced Manufacturing Technology, 2024, 130 : 5233 - 5249
[26] Towards Reinforcement based Learning of an Assembly Process for Human Robot Collaboration
Akkaladevi, Sharath Chandra
Plasch, Matthias
Pichler, Andreas
Ikeda, Markus
[J]. 29TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM 2019): BEYOND INDUSTRY 4.0: INDUSTRIAL ADVANCES, ENGINEERING EDUCATION AND INTELLIGENT MANUFACTURING, 2019, 38 : 1491 - 1498
[27] Research progress of robot motion control based on deep reinforcement learning
Dong H.
Yang J.
Li S.-B.
Wang J.
Duan Z.-J.
[J]. Kongzhi yu Juece/Control and Decision, 2022, 37 (02): : 278 - 292
[28] Deep Reinforcement Learning Based Online Area Covering Autonomous Robot
Saha, Olimpiya
Ren, Guohua
Heydari, Javad
Ganapathy, Viswanath
Shah, Mohak
[J]. 2021 7TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2021), 2021, : 21 - 25
[29] Robot-Assisted Pedestrian Regulation Based on Deep Reinforcement Learning
Wan, Zhiqiang
Jiang, Chao
Fahad, Muhammad
Ni, Zhen
Guo, Yi
He, Haibo
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (04) : 1669 - 1682
[30] A novel mobile robot navigation method based on deep reinforcement learning
Quan, Hao
Li, Yansheng
Zhang, Yi
[J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03):

← 1 2 3 4 5 →