Adaptive Optimization of Hyper-Parameters for Robotic Manipulation through Evolutionary Reinforcement Learning

被引:0
|
作者
Onori, Giulio [1 ]
Shahid, Asad Ali [2 ]
Braghin, Francesco [1 ]
Roveda, Loris [2 ]
机构
[1] Politecn Milan, Dept Mech Engn, Milan, Italy
[2] Scuola univ, Ist Dalle Molle studi sullintelligenza artificiale, IDSIA USI SUPSI, CH-6962 Lugano, Switzerland
关键词
Evolutionary learning; Grasping; Reinforcement learning; ERL;
D O I
10.1007/s10846-024-02138-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Reinforcement Learning applications are growing due to their capability of teaching the agent any task autonomously and generalizing the learning. However, this comes at the cost of a large number of samples and interactions with the environment. Moreover, the robustness of learned policies is usually achieved by a tedious tuning of hyper-parameters and reward functions. In order to address this issue, this paper proposes an evolutionary RL algorithm for the adaptive optimization of hyper-parameters. The policy is trained using an on-policy algorithm, Proximal Policy Optimization (PPO), coupled with an evolutionary algorithm. The achieved results demonstrate an improvement in the sample efficiency of the RL training on a robotic grasping task. In particular, the learning is improved with respect to the baseline case of a non-evolutionary agent. The evolutionary agent needs 60\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$60$$\end{document}% fewer samples to completely learn the grasping task, enabled by the adaptive transfer of knowledge between the agents through the evolutionary algorithm. The proposed approach also demonstrates the possibility of updating reward parameters during training, potentially providing a general approach to creating reward functions.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Effects of Hyper-Parameters for Deep Reinforcement Learning in Robotic Motion Mimicry: A Preliminary Study
    Kim, Taewoo
    Lee, Joo-Haeng
    [J]. 2019 16TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2019, : 228 - 235
  • [2] Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization
    Cruz Barsce, Juan
    Palombarini, Jorge A.
    Martinez, Ernesto C.
    [J]. 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [3] Continuous optimization of hyper-parameters
    Bengio, Y
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL I, 2000, : 305 - 310
  • [4] Optimal Evolutionary Optimization Hyper-parameters to Mimic Human User Behavior
    Saha, Sneha
    Rios, Thiago
    Minku, Leandro L.
    Yao, Xin
    Xu, Zhao
    Sendhoff, Bernhard
    Menzel, Stefan
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 858 - 866
  • [5] Analysis of Hyper-Parameters for AlphaZero-Like Deep Reinforcement Learning
    Wang, Hui
    Emmerich, Michael
    Preuss, Mike
    Plaat, Aske
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2023, 22 (02) : 829 - 853
  • [6] A Meta-Reinforcement Learning Approach to Optimize Parameters and Hyper-parameters Simultaneously
    Ali, Abbas Raza
    Budka, Marcin
    Gabrys, Bogdan
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 93 - 106
  • [7] Exploiting Parameters Learning for Hyper-parameters Optimization in Deep Neural Networks
    Fraccaroli, Michele
    Lamma, Evelina
    Riguzzi, Fabrizio
    [J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2022, 364
  • [8] Evolutionary Optimisation of Kernel and Hyper-Parameters for SVM
    Diosan, Laura
    Rogozan, Alexandrina
    Pecuchet, Jean-Pierre
    [J]. MODELLING, COMPUTATION AND OPTIMIZATION IN INFORMATION SYSTEMS AND MANAGEMENT SCIENCES, PROCEEDINGS, 2008, 14 : 107 - 116
  • [9] Global optimization of hyper-parameters in reservoir computing
    Ren, Bin
    Ma, Huanfei
    [J]. ELECTRONIC RESEARCH ARCHIVE, 2022, 30 (07): : 2719 - 2729
  • [10] A Framework for Selecting Deep Learning Hyper-parameters
    Donoghue, Jim O'
    Roantree, Mark
    [J]. DATA SCIENCE, 2015, 9147 : 120 - 132