A deep reinforcement learning algorithm to control a two-wheeled scooter with a humanoid robot

被引:1
|
作者
Baltes, Jacky [1 ]
Christmann, Guilherme [1 ]
Saeedvand, Saeed [1 ]
机构
[1] Natl Taiwan Normal Univ, Dept Elect Engn, Taipei, Taiwan
关键词
Deep reinforcement learning; Proximal policy optimization (PPO); Two-wheeled vehicles; PID control; Humanoid robotics;
D O I
10.1016/j.engappai.2023.106941
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Balancing a two-wheeled scooter is considered a challenging task for robots, as it is a non-linear control problem in a highly dynamic environment. The rapid pace of development of deep reinforcement learning has enabled robots to perform complex control tasks. In this paper, a deep reinforcement learning algorithm is proposed to learn the steering control of the scooter for balancing and patch tracking using an unmodified humanoid robot. Two control strategies are developed, analyzed, and compared: a classical Proportional- Integral-Derivative (PID) controller and a Deep Reinforcement Learning (DRL) controller based on Proximal Policy Optimization (PPO) algorithm. The ability of the robot to balance the scooter using both approaches is extensively evaluated. Challenging control scenarios are tested at low scooter speeds, including 2.5, 5, and 10 km/h. Steering velocities are also varied, including 10, 20, and 40 rad/s. The evaluations include upright balance without disturbances, upright balance under disturbances, tracking sinusoidal path, and path tracking. A 3D model of the humanoid robot and scooter system is developed, which is simulated in a state-of-the-art GPU-based simulation environment as a training and test bed (NVidia's Isaac Gym). Despite the fact that the PID controller successfully balances the robot, better final results are achieved with the proposed DRL. The results indicate a 52% improvement on average in different speeds with better performance in path tracking control. Controller command evaluation on the real robot and scooter indicates the robot's complete capability to realize steering control velocities.
引用
下载
收藏
页数:20
相关论文
共 50 条
  • [31] Learning to approach a moving ball with a simulated two-wheeled robot
    Flentge, Felix
    ROBOCUP 2005: ROBOT SOCCER WORLD CUP IX, 2006, 4020 : 106 - 117
  • [32] Bionic autonomous learning control of a two-wheeled self-balancing flexible robot
    Jianxian CAI 1
    2.Institute of Disaster Prevention
    Control Theory and Technology, 2011, 9 (04) : 521 - 528
  • [33] Bionic autonomous learning control of a two-wheeled self-balancing flexible robot
    Cai J.
    Ruan X.
    Journal of Control Theory and Applications, 2011, 9 (4): : 521 - 528
  • [34] Humanoid robot control based on reinforcement learning
    Iida, S
    Kuwayama, K
    Kanoh, M
    Kato, S
    Kunitachi, T
    Itoh, H
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON MICRO-NANOMECHATRONICS AND HUMAN SCIENCE, 2004, : 353 - 358
  • [35] Humanoid robot control based on reinforcement learning
    Iida, S. (iida@ics.nitech.ac.jp), IEEE Robotics and Automation Society; Nagoya University, Japan; City of Nagoya, Japan; Nagoya City Science Museum; Chubu Science and Technology Center (Institute of Electrical and Electronics Engineers Inc.):
  • [36] Comparison of two efficient control strategies for two-wheeled balancing robot
    Majczak, Michal
    Wawrzynski, Pawel
    2015 20TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2015, : 744 - 749
  • [37] Just-In-Time Predictive Control for a Two-Wheeled Robot
    Nakpong, Nuttapun
    Yamamoto, Shigeru
    2012 TENTH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING, 2012, : 95 - 98
  • [38] Passivity-Based Control for Two-Wheeled Robot Stabilization
    Uddin, Nur
    Nugroho, Teguh Aryo
    Pramudito, Wahyu Agung
    INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONICS, COMPUTER, AND INDUSTRIAL TECHNOLOGY, 2018, 1007
  • [39] Optimal Feedback Control for the Identification of Two-Wheeled Mobile Robot
    Babar, Muhammad Zeeshan
    Baglietto, Marco
    2017 21ST INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2017, : 112 - 119
  • [40] A Method to Coordinate Balance and Speed Control of a Two-wheeled Robot
    Hong, Shaohui
    Zhao, Guiyi
    Wang, Xiang
    Zhong, Xunyu
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 5798 - 5803