Learn multi-step object sorting tasks through deep reinforcement learning

被引：7

作者：

Bao, Jiatong ^{[1
,2
]}

Zhang, Guoqing ^{[1
]}

Peng, Yi ^{[1
]}

Shao, Zhiyu ^{[1
]}

Song, Aiguo ^{[2
]}

机构：

[1] Yangzhou Univ, Sch Elect Energy & Power Engn, Yangzhou 225000, Jiangsu, Peoples R China

[2] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210000, Peoples R China

来源：

ROBOTICA | 2022年 / 40卷 / 11期

基金：

中国国家自然科学基金;

关键词：

object sorting; deep reinforcement learning; vision-based robotic manipulation;

D O I：

10.1017/S0263574722000650

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Robotic systems are usually controlled to repetitively perform specific actions for manufacturing tasks. The traditional control methods are domain-dependent and model-dependent with cost of much human efforts. They cannot meet the new requirements of generality and flexibility in many areas such as intelligent manufacturing and customized production. This paper develops a general model-free approach to enable robots to perform multi-step object sorting tasks through deep reinforcement learning. Taking projected heightmap images from different time steps as input without extra high-level image analysis and understanding, critic models are designed to produce a pixel-wise Q value map for each type of action. It is a new trial to apply pixel-wise Q value-based critic networks to solve multi-step sorting tasks that involve many types of actions and complex action constraints. The experimental validations on simulated and realistic object sorting tasks demonstrate the effectiveness of the proposed approach. Qualitative results (videos), code for simulated and realistic experiments, and pre-trained models are available at https://github.com/JiatongBao/DRLSorting

引用

页码：3878 / 3894

页数：17

共 50 条

[1] Learning Efficient Coordination Strategy for Multi-step Tasks in Multi-agent Systems using Deep Reinforcement Learning
Zhu, Zean
Diallo, Elhadji Amadou Oury
Sugawara, Toshiharu
ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 287 - 294
[2] The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Meng, Lingheng
Gorbet, Rob
Kulic, Dana
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 347 - 353
[3] Recurrent neural networks that learn multi-step visual routines with reinforcement learning
Mollard, Sami
Wacongne, Catherine
Bohte, Sander M.
Roelfsema, Pieter R.
PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (04)
[4] A Multi-Step Reinforcement Learning Algorithm
Zhang, Zhicong
Hu, Kaishun
Huang, Huiyu
Li, Shuai
Zhao, Shaoyong
FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3611 - 3615
[5] Self-supervised reinforcement learning for multi-step object manipulation skills
Wang, Jiaqi
Chen, Chuxin
Liu, Jingwei
Du, Guanglong
Zhu, Xiaojun
Guan, Quanlong
Qiu, Xiaojian
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2025,
[6] Multi-Step Reinforcement Learning: A Unifying Algorithm
De Asis, Kristopher
Hernandez-Garcia, J. Fernando
Holland, G. Zacharias
Sutton, Richard S.
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2902 - 2909
[7] Reinforcement Learning for Multi-Step Expert Advice
Philipp, Patrick
Rettinger, Achim
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 962 - 971
[8] Learning Multi-Step Reasoning by Solving Arithmetic Tasks
Wang, Tianduo
Lu, Wei
61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1229 - 1238
[9] "Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer
Hundt, Andrew
Killeen, Benjamin
Greene, Nicholas
Wu, Hongtao
Kwon, Heeyeon
Paxton, Chris
Hager, Gregory D.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6724 - 6731
[10] Multi-step Prediction for Learning Invariant Representations in Reinforcement Learning
Xu, Xinyue
Lv, Kai
Dong, Xingye
Han, Sheng
Lin, Youfang
2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2021, : 202 - 206

← 1 2 3 4 5 →