Deep Reinforcement Learning Based Self-Configuring Integral Sliding Mode Control Scheme for Robot Manipulators

被引:0
|
作者
Sangiovanni, Bianca [1 ]
Incremona, Gian Paolo [2 ]
Ferrara, Antonella [1 ]
Piastra, Marco [1 ]
机构
[1] Univ Pavia, Dipartimento Ingn Ind & Informaz, Via Ferrata 3-5, I-27100 Pavia, Italy
[2] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, Piazza Leonardo da Vinci 32, I-20133 Milan, Italy
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with the design of an intelligent self-configuring control scheme for robot manipulators. The scheme features two control structures: one of centralized type, implementing the inverse dynamics approach, the other of decentralized type. In both control structures, the controller is based on Integral Sliding Mode (ISM), so that matched disturbances and uncertain terms, due to unmodeled dynamics or couplings effects, are suitably compensated. The use of the ISM control also enables the exploitation of its capability of acting as a "perturbation estimator" which, in the considered case, allows us to design a Deep Reinforcement Learning (DRL) based decision making mechanism. It implements a switching rule, based on an appropriate reward function, in order to choose one of the two control structures present in the scheme, depending on the requested robot performances. The proposed scheme can accommodate a variety of velocity and acceleration requirements, in contrast with the genuine decentralized or centralized control structures taken individually. The assessment of our proposal has been carried out relying on a model of the industrial robot manipulator COMAU SMART3-S2, identified on the basis of real data and with realistic sensor noise.
引用
收藏
页码:5969 / 5974
页数:6
相关论文
共 50 条
  • [31] Sliding mode control of position commanded robot manipulators
    Adhikary, Nabanita
    Mahanta, Chitralekha
    CONTROL ENGINEERING PRACTICE, 2018, 81 : 183 - 198
  • [32] Neuro-sliding Mode Control for Robot Manipulators
    Jung, Seul
    2016 16TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2016, : 907 - 911
  • [33] Shared Control of Robot Manipulators With Obstacle Avoidance: A Deep Reinforcement Learning Approach
    Rubagotti, Matteo
    Sangiovanni, Bianca
    Nurbayeva, Aigerim
    Incremona, Gian Paolo
    Ferrara, Antonella
    Shintemirov, Almas
    IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (01): : 44 - 63
  • [34] Sliding mode based fault diagnosis with deep reinforcement learning add-ons for intrinsically redundant manipulators
    Sacchi, Nikolas
    Incremona, Gian Paolo
    Ferrara, Antonella
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (15) : 9109 - 9127
  • [35] Implementation of adaptive fault-tolerant tracking control for robot manipulators with integral sliding mode
    Liu, Linzhi
    Zhang, Liyin
    Hou, Yinlong
    Tang, Dafeng
    Liu, Hui
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (10) : 5337 - 5364
  • [36] Tensor Product Model Transformation based Integral Sliding Mode Control with Reinforcement Learning Strategy
    Zhao Guoliang
    Zhao Can
    Wang Degang
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 77 - 82
  • [37] Finite time control scheme for robot manipulators using fast terminal sliding mode control and RBFNN
    Ruchika
    Kumar N.
    International Journal of Dynamics and Control, 2019, 7 (02) : 758 - 766
  • [38] Position synchronised control of multiple robotic manipulators based on integral sliding mode
    Zhao, Dongya
    Zhu, Quanmin
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2014, 45 (03) : 556 - 570
  • [39] Vision-based reinforcement learning control of soft robot manipulators
    Li, Jinzhou
    Ma, Jie
    Hu, Yujie
    Zhang, Li
    Liu, Zhijie
    Sun, Shiying
    ROBOTIC INTELLIGENCE AND AUTOMATION, 2024, : 783 - 790
  • [40] A fuzzy adaptive sliding mode control scheme for 2-DOF underactuated robot manipulators
    Lin, Zhuang
    Zhu, Qidan
    Xing, Zhuoyi
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 489 - 493