Combining reinforcement learning and differential inverse kinematics for collision-free motion of multilink manipulators

被引:0
|
作者
Martin, P [1 ]
Millan, JD
机构
[1] Univ Jaume 1, Dept Comp Sci, Castellon 12071, Spain
[2] European Commiss, Joint Res Ctr, Inst Syst Informat & Safety, I-21020 Ispra, VA, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a class of neural controllers that learn goal-oriented obstacle-avoiding strategies for multilink manipulators. They acquire these strategies on-line through reinforcement learning from local sensory data. These controllers are mainly mode of two neural modules: a reinforcement-based action generator and a module for differential inverse kinematics (DIV). The action generator generates actions with regard to a goal vector in the manipulator joint space. Suitable goal vectors are provided by the DIV module. This module is based on the inversion of a neural network that has been previously trained to approximate the manipulator forward kinematics in polar coordinates. Results for two-and three-link planar manipulators are shown. These controllers achieve a good performance quite rapidly and exhibit good generalization capabilities in the face of new environments.
引用
收藏
页码:1324 / 1333
页数:10
相关论文
共 50 条
  • [21] Collision-Free Path Planning for Multiple Drones Based on Safe Reinforcement Learning
    Chen, Hong
    Huang, Dan
    Wang, Chenggang
    Ding, Lu
    Song, Lei
    Liu, Hongtao
    [J]. DRONES, 2024, 8 (09)
  • [22] Improved reinforcement learning for collision-free local path planning of dynamic obstacle
    Yang, Xiao
    Han, Qilong
    [J]. OCEAN ENGINEERING, 2023, 283
  • [23] COLLISION-FREE UAV NAVIGATION WITH A MONOCULAR CAMERA USING DEEP REINFORCEMENT LEARNING
    Chen, Yun
    Gonzalez-Prelcic, Nuria
    Heath, Robert W., Jr.
    [J]. PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [24] Collision avoiding continuation method for the inverse kinematics of redundant manipulators
    Müller, A
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 1593 - 1598
  • [25] A Learning Framework to Inverse Kinematics of Redundant Manipulators
    Jiokou, G. K. A.
    Melingui, A.
    Lakhal, O.
    Kom, M.
    Merzouki, R.
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 9912 - 9917
  • [26] Collision-free motion coordination of heterogeneous robots
    Nak Yong Ko
    Dong Jin Seo
    Reid G. Simmons
    [J]. Journal of Mechanical Science and Technology, 2008, 22 : 2090 - 2098
  • [27] Generation of adjacent configurations for a collision-free path planning of manipulators
    Ceccarelli, M
    Valero, F
    Mata, V
    Cuadrado, I
    [J]. ROBOTICA, 1996, 14 : 391 - 396
  • [28] Collision-Free Compliance Control for Redundant Manipulators: An Optimization Case
    Zhou, Xuefeng
    Xu, Zhihao
    Lie, Shuai
    [J]. FRONTIERS IN NEUROROBOTICS, 2019, 13
  • [29] Planning of a point to point collision-free trajectory for mobile manipulators
    Pajak, Grzegorz
    Pajak, Iwona
    [J]. 2015 10TH INTERNATIONAL WORKSHOP ON ROBOT MOTION AND CONTROL (ROMOCO), 2015, : 142 - 147
  • [30] Collision-free motion coordination of heterogeneous robots
    Ko, Nak Yong
    Seo, Dong Jin
    Simmons, Reid G.
    [J]. JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2008, 22 (11) : 2090 - 2098