Evolutionary Optimization of Hyperparameters in Deep Learning Models

被引:0
|
作者
Kim, Jin-Young [1 ]
Cho, Sung-Bae [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul, South Korea
关键词
Genetic programming; deep learning; neural networks; activation function; optimization technique; NEURAL-NETWORKS;
D O I
10.1109/cec.2019.8790354
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, deep learning is one of the most popular techniques in artificial intelligence. However, to construct a deep learning model, various components must be set up, including activation functions, optimization methods, a configuration of model structure called hyperparameters. As they affect the performance of deep learning, researchers are working hard to find optimal hyperparameters when solving problems with deep learning. Activation function and optimization technique play a crucial role in the forward and backward processes of model learning, but they are set up in a heuristic way. The previous studies have been conducted to optimize either activation function or optimization technique, while the relationship between them is neglected to search them at the same time. In this paper, we propose a novel method based on genetic programming to simultaneously find the optimal activation functions and optimization techniques. In genetic programming, each individual is composed of two chromosomes, one for the activation function and the other for the optimization technique. To calculate the fitness of one individual, we construct a neural network with the activation function and optimization technique that the individual represents. The deep learning model found through our method has 82.59% and 53.04% of accuracies for the CIFAR-10 and CIFAR-100 datasets, which outperforms the conventional methods. Moreover, we analyze the activation function found and confirm the usefulness of the proposed method.
引用
收藏
页码:831 / 837
页数:7
相关论文
共 50 条
  • [1] Optimization of model parameters and hyperparameters in deep learning models for spatial interaction prediction
    Liu, Lin
    Cao, Xiaojing
    Wang, Hengsheng
    Xiang, Junying
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [2] Influence of Hyperparameters in Deep Learning Models for Coffee Rust Detection
    Chavarro, Adrian F. F.
    Renza, Diego
    Ballesteros, Dora M. M.
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [3] A Survey on Hyperparameters Optimization of Deep Learning for Time Series Classification
    Fristiana, Ayuningtyas Hari
    Alfarozi, Syukron Abu Ishaq
    Permanasari, Adhistya Erna
    Pratama, Mahardhika
    Wibirama, Sunu
    IEEE ACCESS, 2024, 12 : 191162 - 191198
  • [4] Course Evaluation Based on Deep Learning and SSA Hyperparameters Optimization
    El-Demerdash, Alaa A.
    Hussein, Sherif E.
    Zaki, John F. W.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 941 - 959
  • [5] Optimization of Deep Learning Hyperparameters with Experimental Design in Exchange Rate Prediction
    Midilli, Yunus Emre
    Parsutins, Sergejs
    2020 61ST INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT SCIENCE OF RIGA TECHNICAL UNIVERSITY (ITMS), 2020,
  • [6] Evolutionary Optimization of Deep Learning Activation Functions
    Bingham, Garrett
    Macke, William
    Miikkulainen, Risto
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 289 - 296
  • [7] Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
    Ashraf, Nesma M.
    Mostafa, Reham R.
    Sakr, Rasha H.
    Rashad, M. Z.
    PLOS ONE, 2021, 16 (06):
  • [8] Evolutionary optimization of machine learning algorithm hyperparameters for strength prediction of high-performance concrete
    Singh S.
    Patro S.K.
    Parhi S.K.
    Asian Journal of Civil Engineering, 2023, 24 (8) : 3121 - 3143
  • [9] The effects of hyperparameters on deep learning of turbulent signals
    Tirchas, Panagiotis
    Drikakis, Dimitris
    Kokkinakis, Ioannis W.
    Spottswood, S. Michael
    PHYSICS OF FLUIDS, 2024, 36 (12)
  • [10] Curriculum learning and evolutionary optimization into deep learning for text classification
    Elias-Miranda, Alfredo Arturo
    Vallejo-Aldana, Daniel
    Sanchez-Vega, Fernando
    Lopez-Monroy, A. Pastor
    Rosales-Perez, Alejandro
    Muniz-Sanchez, Victor
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 21129 - 21164