Application of Reinforcement Learning to the Orientation and Position Control of a 6 Degrees of Freedom Robotic Manipulator

被引：0

作者：

Campos, Felipe Rigueira ^{[1
,4
]}

Fidencio, Aline Xavier ^{[2
]}

Domingues, Jaco ^{[1
,4
]}

Pessin, Gustavo ^{[1
,3
,4
]}

Freitas, Gustavo ^{[1
,5
]}

机构：

[1] Univ Fed Ouro Preto & Inst Tecnol Vale, Programa Posgrad Instrumentacao, Controle & Automacao Proc Mineracao, Ouro Preto, MG, Brazil

[2] Ruhr Univ Bochum, Fac Elect Engn & Informat Technol, Bochum, Germany

[3] Univ Fed Ouro Preto, Dept Comp, Ouro Preto, MG, Brazil

[4] Inst Tecnol Vale, Lab Robot Controle & Instrumentacao, Ouro Preto, MG, Brazil

[5] Univ Fed Minas Gerais, Dept Engn Elect, Belo Horizonte, MG, Brazil

来源：

2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE) | 2022年

关键词：

Robotics; Machine Learning; Reinforcement Learning; DDPG; PPO;

D O I：

10.1109/LARS/SBR/WRE56824.2022.9995835

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Applications with autonomous robots play an important role in the industry and in everyday life. Among them, the activities of manipulating and moving objects are highlighted by the wide variety of possible applications. These activities in static and known environments can be implemented through logic planned by the developer, but this is not feasible in dynamic environments. Machine Learning (ML) techniques such as Reinforcement Learning (RL) algorithms have sought to replace the pre-defined programming by teaching the robot how to act. This paper presents the implementation of two RL algorithms, Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization (PPO), for orientation and position control of a 6-degree-of-freedom (6-DoF) robotic manipulator. The results demonstrated that the DDPG have a faster learning convergence in simpler activities, but if the complexity of the problem increases, it might not obtain a satisfactory behavior. On the other hand, PPO can solve more complex problems but it limits the convergence rate to the best result in order to avoid learning instability.

引用

页码：187 / 192

页数：6

共 50 条

[1] Position Domain Synchronization Control of Multi-Degrees of Freedom Robotic Manipulator
Ouyang, P. R.
Pano, V.
JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2014, 136 (02):
[2] Decentralized reinforcement learning control of a robotic manipulator
Busoniu, Lucian
De Schutter, Bart
Babuska, Robert
2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1121 - +
[3] Design and Manual Control of a 3 Degrees of Freedom Social Robotic Manipulator
Khajehpour, Pouria
Najafi, Esmaeil
2020 21ST INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM), 2020,
[4] A Reinforcement Learning Neural Network for Robotic Manipulator Control
Hu, Yazhou
Si, Bailu
NEURAL COMPUTATION, 2018, 30 (07) : 1983 - 2004
[5] Reinforcement Learning Control for a Robotic Manipulator with Unknown Deadzone
Li, Yanan
Xiao, Shengtao
Ge, Shuzhi Sam
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 593 - 598
[6] Joint torque control for the pneumatically robotic manipulator with 3 degrees of freedom
SMC Pneumatic Center, Harbin Institute of Technology, Harbin 150001, China
不详
Jixie Gongcheng Xuebao, 2008, 12 (254-260): : 254 - 260
[7] Learning force control for position controlled robotic manipulator
Qiao, B
Zhu, JY
Wei, ZX
CIRP ANNALS 1999 - MANUFACTURING TECHNOLOGY, 1999, : 1 - 4
[8] Monitoring and Control of Position and Attitude of Flexible Manipulator with Three Degrees of Freedom
Yang, Bangchu
Wang, Wenbiao
Meng, Hailiang
Wan, Weiwei
Bao, Guanjun
2019 IEEE 4TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2019), 2019, : 208 - 213
[9] Learning force control for position controlled robotic manipulator
Nanjing Univ. Aero. and Astronaut., Nanjing, China
CIRP Ann Manuf Technol, 1 (1-4):
[10] Active Fault-Tolerant Control Integrated with Reinforcement Learning Application to Robotic Manipulator
Yan, Zichen
Tan, Junbo
Liang, Bin
Liu, Houde
Yang, Jun
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2656 - 2662

← 1 2 3 4 5 →