Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World

被引:1
|
作者
Hong, Myung Rae [1 ]
Kang, Sanghun [1 ]
Lee, Jingoo [2 ]
Seo, Sungchul [3 ]
Han, Seungyong [1 ]
Koh, Je-Sung [1 ]
Kang, Daeshik [1 ]
机构
[1] Ajou Univ, Dept Mech Engn, Multiscale Bioinspired Technol Lab, Suwon 16499, South Korea
[2] Korea Inst Machinery ad Mat, Dept Sustainable Environm Res, Multiscale Bioinspired Technol Lab, Daejeon 34103, South Korea
[3] Seokyeong Univ, Dept Nanochem Biol & Environm Engn, Seoul 02713, South Korea
基金
新加坡国家研究基金会;
关键词
Furuta pendulum; inverted pendulum problem; reward design; reinforcement learning; Sim2Real;
D O I
10.1109/ACCESS.2023.3310405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning does not require explicit robot modeling as it learns on its own based on data, but it has temporal and spatial constraints when transferred to real-world environments. In this research, we trained a balancing Furuta pendulum problem, which is difficult to model, in a virtual environment (Unity) and transferred it to the real world. The challenge of the balancing Furuta pendulum problem is to maintain the pendulum's end effector in a vertical position. We resolved the temporal and spatial constraints by performing reinforcement learning in a virtual environment. Furthermore, we designed a novel reward function that enabled faster and more stable problem-solving compared to the two existing reward functions. We validate each reward function by applying it to the soft actor-critic (SAC) and proximal policy optimization (PPO). The experimental result shows that cosine reward function is trained faster and more stable. Finally, SAC algorithm model using a cosine reward function in the virtual environment is an optimized controller. Additionally, we evaluated the robustness of this model by transferring it to the real environment.
引用
收藏
页码:95195 / 95200
页数:6
相关论文
共 50 条
  • [41] Real-world validation of safe reinforcement learning, model predictive control and decision tree-based home energy management systems
    Ruddick, Julian
    Ceusters, Glenn
    Van Kriekinge, Gilles
    Genov, Evgenii
    De Cauwer, Cedric
    Coosemans, Thierry
    Messagie, Maarten
    ENERGY AND AI, 2024, 18
  • [42] SWINGING UP THE FURUTA PENDULUM AND ITS STABILIZATION VIA MODEL PREDICTIVE CONTROL
    Seman, Pavol
    Rohal'-Ilkiv, Boris
    Juhas, Martin
    Salaj, Michal
    JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2013, 64 (03): : 152 - 158
  • [43] Comparison of Model-Based and Model-Free Reinforcement Learning for Real-World Dexterous Robotic Manipulation Tasks
    Valencia, David
    Jia, John
    Li, Raymond
    Hayashi, Alex
    Lecchi, Megan
    Terezakis, Reuel
    Gee, Trevor
    Liarokapis, Minas
    MacDonald, Bruce A.
    Williams, Henry
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 871 - 878
  • [44] Iterative Learning Control Strategy for a Furuta Pendulum System with Variable-Order Linearization
    Binz, Ricardo
    Aranovskiy, Stanislav
    IFAC PAPERSONLINE, 2021, 54 (20): : 14 - 19
  • [45] Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
    Qureshi, Ahmed Hussain
    Nakamura, Yutaka
    Yoshikawa, Yuichiro
    Ishiguro, Hiroshi
    NEURAL NETWORKS, 2018, 107 : 23 - 33
  • [46] Non-blocking Asynchronous Training for Reinforcement Learning in Real-World Environments
    Bohm, Peter
    Pounds, Pauline
    Chapman, Archie C.
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10927 - 10934
  • [47] Domain Adapting Deep Reinforcement Learning for Real-World Speech Emotion Recognition
    Rajapakshe, Thejan
    Rana, Rajib
    Khalifa, Sara
    Schuller, Bjoern W.
    IEEE ACCESS, 2024, 12 : 193101 - 193114
  • [48] STASIS: Reinforcement Learning Simulators for Human-Centric Real-World Environments
    Efstathiadis, Georgios
    Emedom-Nnamdi, Patrick
    Kolbeinsson, Arinbjorn
    Onnela, Jukka-Pekka
    Lu, Junwei
    TRUSTWORTHY MACHINE LEARNING FOR HEALTHCARE, TML4H 2023, 2023, 13932 : 85 - 92
  • [49] Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households
    Gokhale, Gargya
    Tiben, Niels
    Verwee, Marie-Sophie
    Lahariya, Manu
    Claessens, Bert
    Develder, Chris
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 347 - 351
  • [50] Controlling Aluminum Strip Thickness by Clustered Reinforcement Learning With Real-World Dataset
    Xiao, Ziqi
    He, Zhili
    Liang, Huanghuang
    Hu, Chuang
    Cheng, Dazhao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 9928 - 9938