Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World

被引:1
|
作者
Hong, Myung Rae [1 ]
Kang, Sanghun [1 ]
Lee, Jingoo [2 ]
Seo, Sungchul [3 ]
Han, Seungyong [1 ]
Koh, Je-Sung [1 ]
Kang, Daeshik [1 ]
机构
[1] Ajou Univ, Dept Mech Engn, Multiscale Bioinspired Technol Lab, Suwon 16499, South Korea
[2] Korea Inst Machinery ad Mat, Dept Sustainable Environm Res, Multiscale Bioinspired Technol Lab, Daejeon 34103, South Korea
[3] Seokyeong Univ, Dept Nanochem Biol & Environm Engn, Seoul 02713, South Korea
基金
新加坡国家研究基金会;
关键词
Furuta pendulum; inverted pendulum problem; reward design; reinforcement learning; Sim2Real;
D O I
10.1109/ACCESS.2023.3310405
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning does not require explicit robot modeling as it learns on its own based on data, but it has temporal and spatial constraints when transferred to real-world environments. In this research, we trained a balancing Furuta pendulum problem, which is difficult to model, in a virtual environment (Unity) and transferred it to the real world. The challenge of the balancing Furuta pendulum problem is to maintain the pendulum's end effector in a vertical position. We resolved the temporal and spatial constraints by performing reinforcement learning in a virtual environment. Furthermore, we designed a novel reward function that enabled faster and more stable problem-solving compared to the two existing reward functions. We validate each reward function by applying it to the soft actor-critic (SAC) and proximal policy optimization (PPO). The experimental result shows that cosine reward function is trained faster and more stable. Finally, SAC algorithm model using a cosine reward function in the virtual environment is an optimized controller. Additionally, we evaluated the robustness of this model by transferring it to the real environment.
引用
收藏
页码:95195 / 95200
页数:6
相关论文
共 50 条
  • [31] Real-World Robot Control and Data Augmentation by World-Model Learning from Play
    Nomura, Yuta
    Murata, Shingo
    2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL, 2023, : 133 - 138
  • [32] Optimizing Resource Allocation Policies in Real-World Business Processes Using Hybrid Process Simulation and Deep Reinforcement Learning
    Meneghello, Francesca
    Middelhuis, Jeroen
    Genga, Laura
    Bukhsh, Zaharah
    Ronzani, Massimiliano
    Di Francescomarino, Chiara
    Ghidini, Chiara
    Dijkman, Remco
    BUSINESS PROCESS MANAGEMENT, BPM 2024, 2024, 14940 : 167 - 184
  • [33] Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications
    Nambiar, Mila
    Ghosh, Supriyo
    Ong, Priscilla
    Chan, Yu En
    Bee, Yong Mong
    Krishnaswamy, Pavitra
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 4673 - 4684
  • [34] Deep reinforcement learning for real-world quadrupedal locomotion: a comprehensive review
    Zhang, Hongyin
    He, Li
    Wang, Donglin
    INTELLIGENCE & ROBOTICS, 2022, 2 (03):
  • [35] Tackling Real-World Autonomous Driving using Deep Reinforcement Learning
    Maramotti, Paolo
    Capasso, Alessandro Paolo
    Bacchiani, Giulio
    Broggi, Alberto
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1274 - 1281
  • [36] Real-world dexterous object manipulation based deep reinforcement learning
    Yao, Qingfeng
    Wang, Jilong
    Yang, Shuyu
    arXiv, 2021,
  • [37] Simulation-Based Reinforcement Learning for Real-World Autonomous Driving
    Osinski, Blazej
    Jakubowski, Adam
    Ziecina, Pawel
    Milos, Piotr
    Galias, Christopher
    Homoceanu, Silviu
    Michalewski, Henryk
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6411 - 6418
  • [38] Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning
    Zhang, Tianhao
    Li, Yueheng
    Li, Shuai
    Ye, Qiwei
    Wang, Chen
    Xie, Guangming
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 8814 - 8820
  • [39] Demonstration of Intelligent HVAC Load Management With Deep Reinforcement Learning: Real-World Experience of Machine Learning in Demand Control
    Du, Yan
    Li, Fangxing
    Kurte, Kuldeep
    Munk, Jeffrey
    Zandi, Helia
    IEEE POWER & ENERGY MAGAZINE, 2022, 20 (03): : 42 - 53
  • [40] Intelligent Navigation of a Magnetic Microrobot with Model-Free Deep Reinforcement Learning in a Real-World Environment
    Salehi, Amar
    Hosseinpour, Soleiman
    Tabatabaei, Nasrollah
    Soltani Firouz, Mahmoud
    Yu, Tingting
    MICROMACHINES, 2024, 15 (01)