Convergence Analysis of PSO for Hyper-Parameter Selection in Deep Neural Networks

被引:5
|
作者
Nalepa, Jakub [1 ,2 ]
Lorenzo, Pablo Ribalta [1 ]
机构
[1] Future Proc, Gliwice, Poland
[2] Silesian Tech Univ, Gliwice, Poland
关键词
Convergence analysis; PSO; Hyper; parameter selection; DNNs;
D O I
10.1007/978-3-319-69835-9_27
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNNs) have gained enormous research attention since they consistently outperform other state-of-the-art methods in a plethora of machine learning tasks. However, their performance strongly depends on the DNN hyper-parameters which are commonly tuned by experienced practitioners. Recently, we introduced Particle Swarm Optimization (PSO) and parallel PSO techniques to automate this process. In this work, we theoretically and experimentally investigate the convergence capabilities of these algorithms. The experiments were performed for several DNN architectures (both gradually augmented and hand-crafted by a human) using two challenging multi-class benchmark datasets-MNIST and CIFAR-10.
引用
收藏
页码:284 / 295
页数:12
相关论文
共 50 条
  • [21] A New Hyper-Parameter Optimization Method for Power Load Forecast Based on Recurrent Neural Networks
    Li, Yaru
    Zhang, Yulai
    Cai, Yongping
    ALGORITHMS, 2021, 14 (06)
  • [22] A Novel Method Based on Line-Segment Visualizations for Hyper-Parameter Optimization in Deep Networks
    Tang, Xue-song
    Ding, Yongsheng
    Hao, Kuangrong
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (03)
  • [23] Physics-informed recurrent neural networks and hyper-parameter optimization for dynamic process systems
    Asrav, Tuse
    Aydin, Erdal
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 173
  • [24] ParDen: Surrogate Assisted Hyper-Parameter Optimisation for Portfolio Selection
    van Zyl, T. L.
    Woolway, M.
    Paskaramoorthy, A.
    2021 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2021), 2021, : 101 - 107
  • [25] Deep Learning Hyper-Parameter Optimization for Video Analytics in Clouds
    Yaseen, Muhammad Usman
    Anjum, Ashiq
    Rana, Omer
    Antonopoulos, Nikolaos
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (01): : 253 - 264
  • [26] Deep Learning Hyper-parameter Tuning for Sentiment Analysis in Twitter based on Evolutionary Algorithms
    Rodriguez-Barroso, Nuria
    Moya, Antonio R.
    Fernandez, Jose A.
    Romero, Elena
    Martinez-Camara, Eugenio
    Herrera, Francisco
    PROCEEDINGS OF THE 2019 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2019, : 255 - 264
  • [27] Hyper-parameter Selection in Advanced Synthetic Aperture Radar Imaging Algorithms
    Batu, Oezge
    Cetin, Muejdat
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 493 - 496
  • [28] Hyper-parameter Search in Support Vector Machines using PSO with Cellular Fitness Approximation
    Yamada, Shinichi
    Neshatian, Kourosh
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 748 - 755
  • [29] Incremental Trainable Parameter Selection in Deep Neural Networks
    Thakur, Anshul
    Abrol, Vinayak
    Sharma, Pulkit
    Zhu, Tingting
    Clifton, David A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) : 6478 - 6491
  • [30] Hyper-Parameter Optimization by Using the Genetic Algorithm for Upper Limb Activities Recognition Based on Neural Networks
    Zhang, Junjie
    Sun, Guangmin
    Sun, Yuge
    Dou, Huijing
    Bilal, Anas
    IEEE SENSORS JOURNAL, 2021, 21 (02) : 1877 - 1884