Cable SCARA Robot Controlled by a Neural Network Using Reinforcement Learning

被引:0
|
作者
Okabe, Eduardo [1 ]
Paiva, Victor [2 ]
Silva-Teixeira, Luis H. [3 ]
Izuka, Jaime [4 ]
机构
[1] Univ Estadual Campinas, Sch Appl Sci, Rua Pedro Zaccaria 1300, BR-13484350 Limeira, Brazil
[2] Univ Estadual Campinas, Sch Mech Engn, Dept Integrated Syst, Rua Mendeleyev 200, BR-13083860 Campinas, Brazil
[3] Univ Estadual Campinas, Sch Mech Engn, Dept Integrated Syst, R Mendeleyev 200, BR-13083860 Campinas, SP, Brazil
[4] Univ Estadual Campinas, Sch Appl Sci, Rua Pedro Zaccaria 1300, BR-13484350 Limeira, SP, Brazil
来源
JOURNAL OF COMPUTATIONAL AND NONLINEAR DYNAMICS | 2023年 / 18卷 / 10期
关键词
Compendex;
D O I
10.1115/1.4063222
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
In this work, three reinforcement learning algorithms (Proximal Policy Optimization, Soft Actor-Critic, and Twin Delayed Deep Deterministic Policy Gradient) are employed to control a two link selective compliance articulated robot arm (SCARA) robot. This robot has three cables attached to its end-effector, which creates a triangular shaped workspace. Positioning the end-effector in the workspace is a relatively simple kinematic problem, but moving outside this region, although possible, requires a nonlinear dynamic model and a state-of-the-art controller. To solve this problem in a simple manner, reinforcement learning algorithms are used to find possible trajectories for three targets out of the workspace. Additionally, the SCARA mechanism offers two possible configurations for each end-effector position. The algorithm results are compared in terms of displacement error, velocity, and standard deviation among ten trajectories provided by the trained network. The results indicate the Proximal Policy Algorithm as the most consistent in the analyzed situations. Still, the Soft Actor-Critic presented better solutions, and Twin Delayed Deep Deterministic Policy Gradient provided interesting and more unusual trajectories.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Reinforcement Learning Using the Stochastic Fuzzy Min–Max Neural Network
    Aristidis Likas
    Neural Processing Letters, 2001, 13 : 213 - 220
  • [42] Performance optimization of function localization neural network by using reinforcement learning
    Sasakawa, T
    Hu, JL
    Hirasawa, K
    Proceedings of the International Joint Conference on Neural Networks (IJCNN), Vols 1-5, 2005, : 1314 - 1319
  • [43] Intelligent scheduling using a neural network model in conjunction with reinforcement learning
    Fourie, CJ
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART B-JOURNAL OF ENGINEERING MANUFACTURE, 2005, 219 (02) : 229 - 235
  • [44] Category learning in a recurrent neural network with reinforcement learning
    Zhang, Ying
    Pan, Xiaochuan
    Wang, Yihong
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [45] Reinforcement Learning Adaptive Control for Upper Limb Rehabilitation Robot Based on Fuzzy Neural Network
    Meng Fan-cheng
    Dai Ya-ping
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 5157 - 5161
  • [46] Simulation of Mobile Robot Navigation Utilizing Reinforcement and Unsupervised Weightless Neural Network Learning Algorithm
    Yusof, Yusman
    Mansor, H. M. Asri H.
    Baba, H. M. Dani
    2015 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2015, : 123 - 128
  • [47] Modular neural network and classical reinforcement learning for autonomous robot navigation:: Inhibiting undesirable behaviors
    Antonelo, Eric A.
    Baerveldt, Albert-Jan
    Rognvaldsson, Thorsteinn
    Figueiredo, Mauricio
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 498 - +
  • [48] Upgrade of a scara robot using Orocos
    Tavares, Dalton Matsuo
    Aroca, Rafael Vidal
    de Paula Caurin, Glauco Augusto
    PROCEEDINGS OF THE 13TH IASTED INTERNATIONAL CONFERENCE ON ROBOTICS AND APPLICATIONS/PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON TELEMATICS, 2007, : 252 - 257
  • [49] Simulation of a SCARA robot with PD and learning controllers
    Yamacli, Serhan
    Canbolat, Huseyin
    SIMULATION MODELLING PRACTICE AND THEORY, 2008, 16 (09) : 1477 - 1487
  • [50] A framework for the robot skill learning using reinforcement learning
    Wei, YZ
    Zhao, MY
    FIFTH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND CONTROL TECHNOLOGY, 2003, 5253 : 910 - 914