Efficient hyperparameter optimization through model-based reinforcement learning

被引：44

作者：

Wu, Jia ^{[1
]}

Chen, SenPeng ^{[1
]}

Liu, XiYuan ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu, Peoples R China

来源：

NEUROCOMPUTING | 2020年 / 409卷

关键词：

Hyperparameter optimization; Machine learning; Reinforcement learning;

D O I：

10.1016/j.neucom.2020.06.064

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hyperparameter tuning is critical for the performance of machine learning algorithms. However, a noticeable limitation is the high computational cost of algorithm evaluation for complex models or for large datasets, which makes the tuning process highly inefficient. In this paper, we propose a novel model-based method for efficient hyperparameter optimization. Firstly, we frame this optimization process as a reinforcement learning problem and then employ an agent to tune hyperparameters sequentially. In addition, a model that learns how to evaluate an algorithm is used to speed up the training. However, model inaccuracy is further exacerbated by long-term use, resulting in collapse performance. We propose a novel method for controlling the model use by measuring the impact of the model on the policy and limiting it to a proper range. Thus, the horizon of the model use can be dynamically adjusted. We apply the proposed method to tune the hyperparameters of the extreme gradient boosting and convolutional neural networks on 101 tasks. The experimental results verify that the proposed method achieves the highest accuracy on 86.1% of the tasks, compared with other state-of-the-art methods and the average ranking of runtime is significant lower than all methods by using the predictive model. (C) 2020 Elsevier B.V. All rights reserved.

引用

页码：381 / 393

页数：13

共 50 条

[21] Efficient Exploration in Continuous-time Model-based Reinforcement Learning
Treven, Lenart
Hubotter, Jonas
Sukhija, Bhavya
Dorfler, Florian
Krause, Andreas
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[22] Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
Corneil, Dane
Gerstner, Wulfram
Brea, Johanni
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[23] A Novel HashedNets Model Based on the Efficient Hyperparameter Optimization
Fang, Qin
Chen, Jianxia
Ma, Zhongbao
Li, Chao
Zhang, Jie
Chen, Yixin
Lv, Qiang
[J]. 2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1146 - 1151
[24] Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
Mutti, Mirco
De Santi, Riccardo
Rossi, Emanuele
Calderon, Juan Felipe
Bronstein, Michael
Restelli, Marcello
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9251 - 9259
[25] Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
Bai, Hui
Cheng, Ran
[J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
[26] Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning
Wu, Chenyang
Li, Tianci
Zhang, Zongzhang
Yu, Yang
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[27] BiES: Adaptive Policy Optimization for Model-Based Offline Reinforcement Learning
Yang, Yijun
Jiang, Jing
Wang, Zhuowei
Duan, Qiqi
Shi, Yuhui
[J]. AI 2021: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, 13151 : 570 - 581
[28] A Deep Reinforcement Learning Model-Based Optimization Method for Graphic Design
Guo, Qi
Wang, Zhen
[J]. Informatica (Slovenia), 2024, 48 (05): : 121 - 134
[29] A survey on model-based reinforcement learning
Fan-Ming Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
[J]. Science China Information Sciences, 2024, 67
[30] Nonparametric model-based reinforcement learning
Atkeson, CG
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1008 - 1014

← 1 2 3 4 5 →