Adaptive Dimensional Gaussian Mutation of PSO-Optimized Convolutional Neural Network Hyperparameters

被引:2
|
作者
Wang, Chaoxue [1 ]
Shi, Tengteng [1 ]
Han, Danni [1 ]
机构
[1] Xian Univ Architecture & Technol, Sch Informat & Control Engn, Xian 710055, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
基金
中国国家自然科学基金;
关键词
adaptive; convolutional neural networks; Gaussian mutation; hyperparameter optimization; particle swarm optimization algorithm; ALGORITHM; SEARCH;
D O I
10.3390/app13074254
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The configuration of the hyperparameters in convolutional neural networks (CNN) is crucial for determining their performance. However, traditional methods for hyperparameter configuration, such as grid searches and random searches, are time consuming and labor intensive. The optimization of CNN hyperparameters is a complex problem involving multiple local optima that poses a challenge for traditional particle swarm optimization (PSO) algorithms, which are prone to getting stuck in the local optima and achieving suboptimal results. To address the above issues, we proposed an adaptive dimensional Gaussian mutation PSO (ADGMPSO) to efficiently select the optimal hyperparameter configurations. The ADGMPSO algorithm utilized a cat chaos initialization strategy to generate an initial population with a more uniform distribution. It combined the sine-based inertia weights and an asynchronous change learning factor strategy to balance the global exploration and local exploitation capabilities. Finally, an elite particle adaptive dimensional Gaussian mutation strategy was proposed to improve the population diversity and convergence accuracy at the different stages of evolution. The performance of the proposed algorithm was compared to five other evolutionary algorithms, including PSO, BOA, WOA, SSA, and GWO, on ten benchmark test functions, and the results demonstrated the superiority of the proposed algorithm in terms of the optimal value, mean value, and standard deviation. The ADGMPSO algorithm was then applied to the hyperparameter optimization for the LeNet-5 and ResNet-18 network models. The results on the MNIST and CIFAR10 datasets showed that the proposed algorithm achieved a higher accuracy and generalization ability than the other optimization algorithms, such as PSO-CNN, LDWPSO-CNN, and GA-CNN.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Tuning Convolutional Neural Network Hyperparameters by Bare Bones Fireworks Algorithm
    Tuba, Ira
    Veinovic, Mladen
    Tuba, Eva
    Hrosik, Romana Capor
    Tuba, Milan
    [J]. STUDIES IN INFORMATICS AND CONTROL, 2022, 31 (01): : 25 - 35
  • [22] Enhancing Air Quality Prediction with an Adaptive PSO-Optimized CNN-Bi-LSTM Model
    Zhu, Xuguang
    Zou, Feifei
    Li, Shanghai
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [23] OPTIMIZING NEURAL NETWORK HYPERPARAMETERS WITH GAUSSIAN PROCESSES FOR DIALOG ACT CLASSIFICATION
    Dernoncourt, Franck
    Lee, Ji Young
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 406 - 413
  • [24] Optimization of convolutional neural network hyperparameters for automatic classification of adult mosquitoes
    Motta, Daniel
    Bandeira Santos, Alex Alisson
    Souza Machado, Bruna Aparecida
    Ribeiro-Filho, Otavio Goncalvez Vicente
    Arriaga Camargo, Luis Octavio
    Valdenegro-Toro, Matias Alejandro
    Kirchner, Frank
    Badaro, Roberto
    [J]. PLOS ONE, 2020, 15 (07):
  • [25] Optimizing Convolutional Neural Network Hyperparameters by Enhanced Swarm Intelligence Metaheuristics
    Bacanin, Nebojsa
    Bezdan, Timea
    Tuba, Eva
    Strumberger, Ivana
    Tuba, Milan
    [J]. ALGORITHMS, 2020, 13 (03)
  • [26] Automating Configuration of Convolutional Neural Network Hyperparameters Using Genetic Algorithm
    Johnson, Franklin
    Valderrama, Alvaro
    Valle, Carlos
    Crawford, Broderick
    Soto, Ricardo
    Nanculef, Ricardo
    [J]. IEEE ACCESS, 2020, 8 : 156139 - 156152
  • [27] PSO-optimized SSLMS adaptive filter for signal denoising of rolling bearings under small sample condition
    Deng, Linfeng
    Wang, Xiaoqiang
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [28] An optimized convolutional neural network for speech enhancement
    Karthik A.
    Mazher Iqbal J.L.
    [J]. International Journal of Speech Technology, 2023, 26 (04) : 1117 - 1129
  • [29] ZEBRA battery SOC estimation using PSO-optimized hybrid neural model considering aging effect
    Gharavian, Davood
    Pardis, Reza
    Sheikhan, Mansour
    [J]. IEICE ELECTRONICS EXPRESS, 2012, 9 (13): : 1115 - 1121
  • [30] A novel hybrid ensemble convolutional neural network for face recognition by optimizing hyperparameters
    Anwarul, Shahina
    Choudhury, Tanupriya
    Dahiya, Susheela
    [J]. NONLINEAR ENGINEERING - MODELING AND APPLICATION, 2023, 12 (01):