Adaptive Dimensional Gaussian Mutation of PSO-Optimized Convolutional Neural Network Hyperparameters

被引:2
|
作者
Wang, Chaoxue [1 ]
Shi, Tengteng [1 ]
Han, Danni [1 ]
机构
[1] Xian Univ Architecture & Technol, Sch Informat & Control Engn, Xian 710055, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
基金
中国国家自然科学基金;
关键词
adaptive; convolutional neural networks; Gaussian mutation; hyperparameter optimization; particle swarm optimization algorithm; ALGORITHM; SEARCH;
D O I
10.3390/app13074254
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The configuration of the hyperparameters in convolutional neural networks (CNN) is crucial for determining their performance. However, traditional methods for hyperparameter configuration, such as grid searches and random searches, are time consuming and labor intensive. The optimization of CNN hyperparameters is a complex problem involving multiple local optima that poses a challenge for traditional particle swarm optimization (PSO) algorithms, which are prone to getting stuck in the local optima and achieving suboptimal results. To address the above issues, we proposed an adaptive dimensional Gaussian mutation PSO (ADGMPSO) to efficiently select the optimal hyperparameter configurations. The ADGMPSO algorithm utilized a cat chaos initialization strategy to generate an initial population with a more uniform distribution. It combined the sine-based inertia weights and an asynchronous change learning factor strategy to balance the global exploration and local exploitation capabilities. Finally, an elite particle adaptive dimensional Gaussian mutation strategy was proposed to improve the population diversity and convergence accuracy at the different stages of evolution. The performance of the proposed algorithm was compared to five other evolutionary algorithms, including PSO, BOA, WOA, SSA, and GWO, on ten benchmark test functions, and the results demonstrated the superiority of the proposed algorithm in terms of the optimal value, mean value, and standard deviation. The ADGMPSO algorithm was then applied to the hyperparameter optimization for the LeNet-5 and ResNet-18 network models. The results on the MNIST and CIFAR10 datasets showed that the proposed algorithm achieved a higher accuracy and generalization ability than the other optimization algorithms, such as PSO-CNN, LDWPSO-CNN, and GA-CNN.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Tool Health Monitoring using Airborne Acoustic Emission and a PSO-optimized Neural Network
    Zafar, T.
    Kamal, K.
    Kumar, R.
    Sheikh, Z.
    Mathavan, S.
    Ali, U.
    [J]. 2015 IEEE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2015, : 271 - 276
  • [2] Fault Diagnosis Method Based on PSO-optimized H-BP Neural Network
    Rao Hong
    Li Meizhu
    Hu Qianru
    [J]. 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 2, PROCEEDINGS, 2009, : 272 - +
  • [3] Classification of Photovoltaic Faults Using PSO-Optimized Compact Convolutional Transformer
    Hong, Ying-Yi
    Chen, Li-Fan
    Zhang, Weina
    [J]. IEEE ACCESS, 2023, 11 : 140752 - 140762
  • [4] A Precise Positioning Method for a Puncture Robot Based on a PSO-Optimized BP Neural Network Algorithm
    Jiang, Guanwu
    Luo, Minzhou
    Bai, Keqiang
    Chen, Saixuan
    [J]. APPLIED SCIENCES-BASEL, 2017, 7 (10):
  • [5] Prediction of short-term traffic flow based on PSO-optimized chaotic BP neural network
    Li, Song
    Wang, Liu
    Liu, Bo
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 292 - 295
  • [6] PSO-optimized modular neural network trained by OWO-HWO algorithm for fault location in analog circuits
    Sheikhan, Mansour
    Sha'bani, Amir Ali
    [J]. NEURAL COMPUTING & APPLICATIONS, 2013, 23 (02): : 519 - 530
  • [7] PSO-optimized modular neural network trained by OWO-HWO algorithm for fault location in analog circuits
    Mansour Sheikhan
    Amir Ali Sha’bani
    [J]. Neural Computing and Applications, 2013, 23 : 519 - 530
  • [8] PSO-Optimized Hopfield Neural Network-Based Multipath Routing for Mobile Ad-hoc Networks
    Sheikhan, Mansour
    Hemmati, Ehsan
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2012, 5 (03) : 568 - 581
  • [9] PSO-Optimized Hopfield Neural Network-Based Multipath Routing for Mobile Ad-hoc Networks
    Mansour Sheikhan
    Ehsan Hemmati
    [J]. International Journal of Computational Intelligence Systems, 2012, 5 : 568 - 581
  • [10] A hybrid reliability algorithm using PSO-optimized Kriging model and adaptive importance sampling
    Tong, Cao
    Gong, Haili
    [J]. 3RD INTERNATIONAL CONFERENCE ON ENERGY EQUIPMENT SCIENCE AND ENGINEERING (ICEESE 2017), 2018, 128