A deep reinforcement learning algorithm to control a two-wheeled scooter with a humanoid robot

被引：1

作者：

Baltes, Jacky ^{[1
]}

Christmann, Guilherme ^{[1
]}

Saeedvand, Saeed ^{[1
]}

机构：

[1] Natl Taiwan Normal Univ, Dept Elect Engn, Taipei, Taiwan

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 126卷

关键词：

Deep reinforcement learning; Proximal policy optimization (PPO); Two-wheeled vehicles; PID control; Humanoid robotics;

D O I：

10.1016/j.engappai.2023.106941

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Balancing a two-wheeled scooter is considered a challenging task for robots, as it is a non-linear control problem in a highly dynamic environment. The rapid pace of development of deep reinforcement learning has enabled robots to perform complex control tasks. In this paper, a deep reinforcement learning algorithm is proposed to learn the steering control of the scooter for balancing and patch tracking using an unmodified humanoid robot. Two control strategies are developed, analyzed, and compared: a classical Proportional- Integral-Derivative (PID) controller and a Deep Reinforcement Learning (DRL) controller based on Proximal Policy Optimization (PPO) algorithm. The ability of the robot to balance the scooter using both approaches is extensively evaluated. Challenging control scenarios are tested at low scooter speeds, including 2.5, 5, and 10 km/h. Steering velocities are also varied, including 10, 20, and 40 rad/s. The evaluations include upright balance without disturbances, upright balance under disturbances, tracking sinusoidal path, and path tracking. A 3D model of the humanoid robot and scooter system is developed, which is simulated in a state-of-the-art GPU-based simulation environment as a training and test bed (NVidia's Isaac Gym). Despite the fact that the PID controller successfully balances the robot, better final results are achieved with the proposed DRL. The results indicate a 52% improvement on average in different speeds with better performance in path tracking control. Controller command evaluation on the real robot and scooter indicates the robot's complete capability to realize steering control velocities.

引用

下载

页数：20

共 50 条

[31] Learning to approach a moving ball with a simulated two-wheeled robot
Flentge, Felix
ROBOCUP 2005: ROBOT SOCCER WORLD CUP IX, 2006, 4020 : 106 - 117
[32] Bionic autonomous learning control of a two-wheeled self-balancing flexible robot
Jianxian CAI 1
2.Institute of Disaster Prevention
Control Theory and Technology, 2011, 9 (04) : 521 - 528
[33] Bionic autonomous learning control of a two-wheeled self-balancing flexible robot
Cai J.
Ruan X.
Journal of Control Theory and Applications, 2011, 9 (4): : 521 - 528
[34] Humanoid robot control based on reinforcement learning
Iida, S
Kuwayama, K
Kanoh, M
Kato, S
Kunitachi, T
Itoh, H
PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON MICRO-NANOMECHATRONICS AND HUMAN SCIENCE, 2004, : 353 - 358
[35] Humanoid robot control based on reinforcement learning
Iida, S. (iida@ics.nitech.ac.jp), IEEE Robotics and Automation Society; Nagoya University, Japan; City of Nagoya, Japan; Nagoya City Science Museum; Chubu Science and Technology Center (Institute of Electrical and Electronics Engineers Inc.):
[36] Comparison of two efficient control strategies for two-wheeled balancing robot
Majczak, Michal
Wawrzynski, Pawel
2015 20TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2015, : 744 - 749
[37] Just-In-Time Predictive Control for a Two-Wheeled Robot
Nakpong, Nuttapun
Yamamoto, Shigeru
2012 TENTH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING, 2012, : 95 - 98
[38] Passivity-Based Control for Two-Wheeled Robot Stabilization
Uddin, Nur
Nugroho, Teguh Aryo
Pramudito, Wahyu Agung
INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONICS, COMPUTER, AND INDUSTRIAL TECHNOLOGY, 2018, 1007
[39] Optimal Feedback Control for the Identification of Two-Wheeled Mobile Robot
Babar, Muhammad Zeeshan
Baglietto, Marco
2017 21ST INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2017, : 112 - 119
[40] A Method to Coordinate Balance and Speed Control of a Two-wheeled Robot
Hong, Shaohui
Zhao, Guiyi
Wang, Xiang
Zhong, Xunyu
2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 5798 - 5803

← 1 2 3 4 5 →