Application of Reinforcement Learning to the Orientation and Position Control of a 6 Degrees of Freedom Robotic Manipulator

被引:0
|
作者
Campos, Felipe Rigueira [1 ,4 ]
Fidencio, Aline Xavier [2 ]
Domingues, Jaco [1 ,4 ]
Pessin, Gustavo [1 ,3 ,4 ]
Freitas, Gustavo [1 ,5 ]
机构
[1] Univ Fed Ouro Preto & Inst Tecnol Vale, Programa Posgrad Instrumentacao, Controle & Automacao Proc Mineracao, Ouro Preto, MG, Brazil
[2] Ruhr Univ Bochum, Fac Elect Engn & Informat Technol, Bochum, Germany
[3] Univ Fed Ouro Preto, Dept Comp, Ouro Preto, MG, Brazil
[4] Inst Tecnol Vale, Lab Robot Controle & Instrumentacao, Ouro Preto, MG, Brazil
[5] Univ Fed Minas Gerais, Dept Engn Elect, Belo Horizonte, MG, Brazil
来源
2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE) | 2022年
关键词
Robotics; Machine Learning; Reinforcement Learning; DDPG; PPO;
D O I
10.1109/LARS/SBR/WRE56824.2022.9995835
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Applications with autonomous robots play an important role in the industry and in everyday life. Among them, the activities of manipulating and moving objects are highlighted by the wide variety of possible applications. These activities in static and known environments can be implemented through logic planned by the developer, but this is not feasible in dynamic environments. Machine Learning (ML) techniques such as Reinforcement Learning (RL) algorithms have sought to replace the pre-defined programming by teaching the robot how to act. This paper presents the implementation of two RL algorithms, Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization (PPO), for orientation and position control of a 6-degree-of-freedom (6-DoF) robotic manipulator. The results demonstrated that the DDPG have a faster learning convergence in simpler activities, but if the complexity of the problem increases, it might not obtain a satisfactory behavior. On the other hand, PPO can solve more complex problems but it limits the convergence rate to the best result in order to avoid learning instability.
引用
收藏
页码:187 / 192
页数:6
相关论文
共 50 条
  • [1] Position Domain Synchronization Control of Multi-Degrees of Freedom Robotic Manipulator
    Ouyang, P. R.
    Pano, V.
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2014, 136 (02):
  • [2] Decentralized reinforcement learning control of a robotic manipulator
    Busoniu, Lucian
    De Schutter, Bart
    Babuska, Robert
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1121 - +
  • [3] Design and Manual Control of a 3 Degrees of Freedom Social Robotic Manipulator
    Khajehpour, Pouria
    Najafi, Esmaeil
    2020 21ST INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM), 2020,
  • [4] A Reinforcement Learning Neural Network for Robotic Manipulator Control
    Hu, Yazhou
    Si, Bailu
    NEURAL COMPUTATION, 2018, 30 (07) : 1983 - 2004
  • [5] Reinforcement Learning Control for a Robotic Manipulator with Unknown Deadzone
    Li, Yanan
    Xiao, Shengtao
    Ge, Shuzhi Sam
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 593 - 598
  • [6] Joint torque control for the pneumatically robotic manipulator with 3 degrees of freedom
    SMC Pneumatic Center, Harbin Institute of Technology, Harbin 150001, China
    不详
    Jixie Gongcheng Xuebao, 2008, 12 (254-260): : 254 - 260
  • [7] Learning force control for position controlled robotic manipulator
    Qiao, B
    Zhu, JY
    Wei, ZX
    CIRP ANNALS 1999 - MANUFACTURING TECHNOLOGY, 1999, : 1 - 4
  • [8] Monitoring and Control of Position and Attitude of Flexible Manipulator with Three Degrees of Freedom
    Yang, Bangchu
    Wang, Wenbiao
    Meng, Hailiang
    Wan, Weiwei
    Bao, Guanjun
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2019), 2019, : 208 - 213
  • [9] Learning force control for position controlled robotic manipulator
    Nanjing Univ. Aero. and Astronaut., Nanjing, China
    CIRP Ann Manuf Technol, 1 (1-4):
  • [10] Active Fault-Tolerant Control Integrated with Reinforcement Learning Application to Robotic Manipulator
    Yan, Zichen
    Tan, Junbo
    Liang, Bin
    Liu, Houde
    Yang, Jun
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2656 - 2662