Adaptive Optimization of Hyper-Parameters for Robotic Manipulation through Evolutionary Reinforcement Learning

被引：0

作者：

Onori, Giulio ^{[1
]}

Shahid, Asad Ali ^{[2
]}

Braghin, Francesco ^{[1
]}

Roveda, Loris ^{[2
]}

机构：

[1] Politecn Milan, Dept Mech Engn, Milan, Italy

[2] Scuola univ, Ist Dalle Molle studi sullintelligenza artificiale, IDSIA USI SUPSI, CH-6962 Lugano, Switzerland

来源：

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS | 2024年 / 110卷 / 03期

关键词：

Evolutionary learning; Grasping; Reinforcement learning; ERL;

D O I：

10.1007/s10846-024-02138-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Reinforcement Learning applications are growing due to their capability of teaching the agent any task autonomously and generalizing the learning. However, this comes at the cost of a large number of samples and interactions with the environment. Moreover, the robustness of learned policies is usually achieved by a tedious tuning of hyper-parameters and reward functions. In order to address this issue, this paper proposes an evolutionary RL algorithm for the adaptive optimization of hyper-parameters. The policy is trained using an on-policy algorithm, Proximal Policy Optimization (PPO), coupled with an evolutionary algorithm. The achieved results demonstrate an improvement in the sample efficiency of the RL training on a robotic grasping task. In particular, the learning is improved with respect to the baseline case of a non-evolutionary agent. The evolutionary agent needs 60\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$60$$\end{document}% fewer samples to completely learn the grasping task, enabled by the adaptive transfer of knowledge between the agents through the evolutionary algorithm. The proposed approach also demonstrates the possibility of updating reward parameters during training, potentially providing a general approach to creating reward functions.

引用

页数：13

共 50 条

[1] Effects of Hyper-Parameters for Deep Reinforcement Learning in Robotic Motion Mimicry: A Preliminary Study
Kim, Taewoo
Lee, Joo-Haeng
[J]. 2019 16TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS (UR), 2019, : 228 - 235
[2] Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization
Cruz Barsce, Juan
Palombarini, Jorge A.
Martinez, Ernesto C.
[J]. 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
[3] Continuous optimization of hyper-parameters
Bengio, Y
[J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL I, 2000, : 305 - 310
[4] Optimal Evolutionary Optimization Hyper-parameters to Mimic Human User Behavior
Saha, Sneha
Rios, Thiago
Minku, Leandro L.
Yao, Xin
Xu, Zhao
Sendhoff, Bernhard
Menzel, Stefan
[J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 858 - 866
[5] Analysis of Hyper-Parameters for AlphaZero-Like Deep Reinforcement Learning
Wang, Hui
Emmerich, Michael
Preuss, Mike
Plaat, Aske
[J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2023, 22 (02) : 829 - 853
[6] A Meta-Reinforcement Learning Approach to Optimize Parameters and Hyper-parameters Simultaneously
Ali, Abbas Raza
Budka, Marcin
Gabrys, Bogdan
[J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 93 - 106
[7] Exploiting Parameters Learning for Hyper-parameters Optimization in Deep Neural Networks
Fraccaroli, Michele
Lamma, Evelina
Riguzzi, Fabrizio
[J]. ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2022, 364
[8] Evolutionary Optimisation of Kernel and Hyper-Parameters for SVM
Diosan, Laura
Rogozan, Alexandrina
Pecuchet, Jean-Pierre
[J]. MODELLING, COMPUTATION AND OPTIMIZATION IN INFORMATION SYSTEMS AND MANAGEMENT SCIENCES, PROCEEDINGS, 2008, 14 : 107 - 116
[9] Global optimization of hyper-parameters in reservoir computing
Ren, Bin
Ma, Huanfei
[J]. ELECTRONIC RESEARCH ARCHIVE, 2022, 30 (07): : 2719 - 2729
[10] A Framework for Selecting Deep Learning Hyper-parameters
Donoghue, Jim O'
Roantree, Mark
[J]. DATA SCIENCE, 2015, 9147 : 120 - 132

← 1 2 3 4 5 →