Learn multi-step object sorting tasks through deep reinforcement learning

被引:7
|
作者
Bao, Jiatong [1 ,2 ]
Zhang, Guoqing [1 ]
Peng, Yi [1 ]
Shao, Zhiyu [1 ]
Song, Aiguo [2 ]
机构
[1] Yangzhou Univ, Sch Elect Energy & Power Engn, Yangzhou 225000, Jiangsu, Peoples R China
[2] Southeast Univ, Sch Instrument Sci & Engn, Nanjing 210000, Peoples R China
基金
中国国家自然科学基金;
关键词
object sorting; deep reinforcement learning; vision-based robotic manipulation;
D O I
10.1017/S0263574722000650
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Robotic systems are usually controlled to repetitively perform specific actions for manufacturing tasks. The traditional control methods are domain-dependent and model-dependent with cost of much human efforts. They cannot meet the new requirements of generality and flexibility in many areas such as intelligent manufacturing and customized production. This paper develops a general model-free approach to enable robots to perform multi-step object sorting tasks through deep reinforcement learning. Taking projected heightmap images from different time steps as input without extra high-level image analysis and understanding, critic models are designed to produce a pixel-wise Q value map for each type of action. It is a new trial to apply pixel-wise Q value-based critic networks to solve multi-step sorting tasks that involve many types of actions and complex action constraints. The experimental validations on simulated and realistic object sorting tasks demonstrate the effectiveness of the proposed approach. Qualitative results (videos), code for simulated and realistic experiments, and pre-trained models are available at https://github.com/JiatongBao/DRLSorting
引用
收藏
页码:3878 / 3894
页数:17
相关论文
共 50 条
  • [1] Learning Efficient Coordination Strategy for Multi-step Tasks in Multi-agent Systems using Deep Reinforcement Learning
    Zhu, Zean
    Diallo, Elhadji Amadou Oury
    Sugawara, Toshiharu
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2020, : 287 - 294
  • [2] The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
    Meng, Lingheng
    Gorbet, Rob
    Kulic, Dana
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 347 - 353
  • [3] Recurrent neural networks that learn multi-step visual routines with reinforcement learning
    Mollard, Sami
    Wacongne, Catherine
    Bohte, Sander M.
    Roelfsema, Pieter R.
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (04)
  • [4] A Multi-Step Reinforcement Learning Algorithm
    Zhang, Zhicong
    Hu, Kaishun
    Huang, Huiyu
    Li, Shuai
    Zhao, Shaoyong
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 3611 - 3615
  • [5] Self-supervised reinforcement learning for multi-step object manipulation skills
    Wang, Jiaqi
    Chen, Chuxin
    Liu, Jingwei
    Du, Guanglong
    Zhu, Xiaojun
    Guan, Quanlong
    Qiu, Xiaojian
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2025,
  • [6] Multi-Step Reinforcement Learning: A Unifying Algorithm
    De Asis, Kristopher
    Hernandez-Garcia, J. Fernando
    Holland, G. Zacharias
    Sutton, Richard S.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2902 - 2909
  • [7] Reinforcement Learning for Multi-Step Expert Advice
    Philipp, Patrick
    Rettinger, Achim
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 962 - 971
  • [8] Learning Multi-Step Reasoning by Solving Arithmetic Tasks
    Wang, Tianduo
    Lu, Wei
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1229 - 1238
  • [9] "Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer
    Hundt, Andrew
    Killeen, Benjamin
    Greene, Nicholas
    Wu, Hongtao
    Kwon, Heeyeon
    Paxton, Chris
    Hager, Gregory D.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6724 - 6731
  • [10] Multi-step Prediction for Learning Invariant Representations in Reinforcement Learning
    Xu, Xinyue
    Lv, Kai
    Dong, Xingye
    Han, Sheng
    Lin, Youfang
    2021 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE BIG DATA AND INTELLIGENT SYSTEMS (HPBD&IS), 2021, : 202 - 206