A proximal policy optimization based deep reinforcement learning framework for tracking control of a flexible robotic manipulator

被引：0

作者：

Kumar, V. Joshi ^{[1
]}

Elumalai, Vinodh Kumar ^{[1
]}

机构：

[1] Vellore Inst Technol, Sch Elect Engn, Vellore 632014, Tamilnadu, India

来源：

RESULTS IN ENGINEERING | 2025年 / 25卷

关键词：

Deep reinforcement learning; Proximal policy gradient; Policy feedback; Flexible joint manipulator; Vibration suppression;

D O I：

10.1016/j.rineng.2025.104178

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

This paper puts forward a policy feedback based deep reinforcement learning (DRL) control scheme for a partially observable system by leveraging the potentials of proximal policy optimization (PPO) algorithm and convolutional neural network (CNN). Although several DRL algorithms have been investigated for a fully observable system, there has been limited studies on devising a DRL control for a partially observable system with uncertain dynamics. Moreover, the major limitation of the existing policy gradient based DRL techniques is that they are computationally expensive and suffer from scalability issues for complex higher order systems. Hence, in this study, we adopt the PPO technique which utilizes first-order optimization to minimize the computational complexity and devise a DRL scheme for a partially observable flexible link robot manipulator system. Specifically, to improve the stability and convergence in PPO algorithm, this study adopts a collaborative policy approach in the update of value function and presents a collaborative proximal policy optimization (CPPO) algorithm that can address the tracking control and vibration suppression problems in partially observable robotic manipulator system. Identifying the optimal hyper-parameters of DRL using the grid search method, we exploit the capability of CNN in actor-critic architecture to extract the spatial dependencies in the state sequences of the dynamical system and boost the DRL performance. To improve the convergence of the proposed DRL algorithm, this study adopts the Lyapunov based reward shaping technique. The experimental validation on robotic manipulator system through hardware in loop (HIL) testing substantiates that the proposed framework offers faster convergence and better vibration suppression feature compared to the state-of-the-art policy gradient technique and actor-critic technique.

引用

页数：15

共 50 条

[31] Reinforcement Learning Control for a Robotic Manipulator with Unknown Deadzone
Li, Yanan
Xiao, Shengtao
Ge, Shuzhi Sam
2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 593 - 598
[32] A Reinforcement Learning Neural Network for Robotic Manipulator Control
Hu, Yazhou
Si, Bailu
NEURAL COMPUTATION, 2018, 30 (07) : 1983 - 2004
[33] Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Bohn, Eivind
Coates, Erlend M.
Moe, Signe
Johansen, Tor Arne
2019 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS' 19), 2019, : 523 - 533
[34] Proximal policy optimization through a deep reinforcement learning framework for remedial action schemes of VSC-HVDC
Song, Sungyoon
Jung, Yungun
Jang, Gilsoo
Jung, Seungmin
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 150
[35] Robotic trajectory tracking control method based on reinforcement learning
Liu W.
Xing G.
Chen H.
Sun H.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2018, 24 (08): : 1996 - 2004
[36] Adaptive Fuzzy Backstepping Tracking Control for Flexible Robotic Manipulator
Chang, Wanmin
Li, Yongming
Tong, Shaocheng
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 8 (12) : 1923 - 1930
[37] Adaptive Fuzzy Backstepping Tracking Control for Flexible Robotic Manipulator
Wanmin Chang
Yongming Li
Shaocheng Tong
IEEE/CAA Journal of Automatica Sinica, 2021, 8 (12) : 1923 - 1930
[38] Control strategy of robotic manipulator based on multi-task reinforcement learning
Wang, Tao
Ruan, Ziming
Wang, Yuyan
Chen, Chong
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (03)
[39] ROBOTIC ARM TRAJECTORY TRACKING METHOD BASED ON IMPROVED PROXIMAL POLICY OPTIMIZATION
Zheng, Qingchun
Peng, Zhi
Zhu, Peihao
Zhao, Yangyang
Ma, Wenpeng
PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2023, 24 (03): : 235 - 244
[40] Trajectory-Tracking Control of Robotic Systems via Deep Reinforcement Learning
Zhang, Shansi
Sun, Chao
Feng, Zhi
Hu, Guoqiang
PROCEEDINGS OF THE IEEE 2019 9TH INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) ROBOTICS, AUTOMATION AND MECHATRONICS (RAM) (CIS & RAM 2019), 2019, : 386 - 391

← 1 2 3 4 5 →