Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World

被引:1
|
作者
Hong, Myung Rae [1 ]
Kang, Sanghun [1 ]
Lee, Jingoo [2 ]
Seo, Sungchul [3 ]
Han, Seungyong [1 ]
Koh, Je-Sung [1 ]
Kang, Daeshik [1 ]
机构
[1] Ajou Univ, Dept Mech Engn, Multiscale Bioinspired Technol Lab, Suwon 16499, South Korea
[2] Korea Inst Machinery ad Mat, Dept Sustainable Environm Res, Multiscale Bioinspired Technol Lab, Daejeon 34103, South Korea
[3] Seokyeong Univ, Dept Nanochem Biol & Environm Engn, Seoul 02713, South Korea
基金
新加坡国家研究基金会;
关键词
Furuta pendulum; inverted pendulum problem; reward design; reinforcement learning; Sim2Real;
D O I
10.1109/ACCESS.2023.3310405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning does not require explicit robot modeling as it learns on its own based on data, but it has temporal and spatial constraints when transferred to real-world environments. In this research, we trained a balancing Furuta pendulum problem, which is difficult to model, in a virtual environment (Unity) and transferred it to the real world. The challenge of the balancing Furuta pendulum problem is to maintain the pendulum's end effector in a vertical position. We resolved the temporal and spatial constraints by performing reinforcement learning in a virtual environment. Furthermore, we designed a novel reward function that enabled faster and more stable problem-solving compared to the two existing reward functions. We validate each reward function by applying it to the soft actor-critic (SAC) and proximal policy optimization (PPO). The experimental result shows that cosine reward function is trained faster and more stable. Finally, SAC algorithm model using a cosine reward function in the virtual environment is an optimized controller. Additionally, we evaluated the robustness of this model by transferring it to the real environment.
引用
收藏
页码:95195 / 95200
页数:6
相关论文
共 50 条
  • [21] Real-World Human-Robot Collaborative Reinforcement Learning
    Shafti, Ali
    Tjomsland, Jonas
    Dudley, William
    Faisal, A. Aldo
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 11161 - 11166
  • [22] ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
    Pendyala, Abhijeet
    Dettmer, Justin
    Glasmachers, Tobias
    Atamna, Asma
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 : 78 - 92
  • [23] A Real-World Quadrupedal Locomotion Benchmark for Offline Reinforcement Learning
    Zhang, Hongyin
    Yang, Shuyu
    Wang, Donglin
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [24] Assessment of Reward Functions for Reinforcement Learning Traffic Signal Control under Real-World Limitations
    Egea, Alvaro Cabrejas
    Howell, Shaun
    Knutins, Maksis
    Connaughton, Colm
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 965 - 972
  • [25] Design of a Decoupling Fuzzy Control Scheme for Omnidirectional Inverted Pendulum Real-World Control
    Chiu, Chih-Hui
    Hung, Yao-Ting
    Peng, Ya-Fu
    IEEE ACCESS, 2021, 9 : 26083 - 26092
  • [26] Guiding real-world reinforcement learning for in-contact manipulation tasks with Shared Control Templates
    Padalkar, Abhishek
    Quere, Gabriel
    Raffin, Antonin
    Silverio, Joao
    Stulp, Freek
    AUTONOMOUS ROBOTS, 2024, 48 (4-5)
  • [27] Enhancing Real-world Inverted Pendulum Stabilization: Addressing External Perturbations with Feedback and Model Predictive Control
    Nazare, Thalita
    Gadelha, Josefredo
    Nepomuceno, Erivelton
    2023 21ST IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS, 2023,
  • [28] Real-world Robot Reaching Skill Learning Based on Deep Reinforcement Learning
    Liu, Naijun
    Lu, Tao
    Cai, Yinghao
    Wang, Rui
    Wang, Shuo
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4780 - 4784
  • [29] Reinforcement learning to achieve real-time control of triple inverted pendulum
    Baek, Jongchan
    Lee, Changhyeon
    Lee, Young Sam
    Jeon, Soo
    Han, Soohee
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 128
  • [30] Nonlinear Convex Control of the Furuta Pendulum Based on its Descriptor Model
    Carlos Arceo, Juan
    Vazquez, David
    Estrada-Manzo, Victor
    Marquez, Raymundo
    Bernal, Miguel
    2016 13TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE), 2016,