Hyper-Parameter Selection in Convolutional Neural Networks Using Microcanonical Optimization Algorithm

被引:27
|
作者
Gulcu, Ayla [1 ]
Kus, Zeki [1 ]
机构
[1] Fatih Sultan Mehmet Univ, Dept Comp Sci, TR-34445 Istanbul, Turkey
关键词
Convolutional neural networks; hyper-parameter optimization; microcanonical optimization; tree Parzen estimator; DROPOUT; SEARCH;
D O I
10.1109/ACCESS.2020.2981141
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The success of Convolutional Neural Networks is highly dependent on the selected architecture and the hyper-parameters. The need for the automatic design of the networks is especially important for complex architectures where the parameter space is so large that trying all possible combinations is computationally infeasible. In this study, Microcanonical Optimization algorithm which is a variant of Simulated Annealing method is used for hyper-parameter optimization and architecture selection for Convolutional Neural Networks. To the best of our knowledge, our study provides a first attempt at applying Microcanonical Optimization for this task. The networks generated by the proposed method is compared to the networks generated by Simulated Annealing method in terms of both accuracy and size using six widely-used image recognition datasets. Moreover, a performance comparison using Tree Parzen Estimator which is a Bayesion optimization-based approach is also presented. It is shown that the proposed method is able to achieve competitive classification results with the state-of-the-art architectures. When the size of the networks is also taken into account, one can see that the networks generated by Microcanonical Optimization method contain far less parameters than the state-of-the-art architectures. Therefore, the proposed method can be preferred for automatically tuning the networks especially in situations where fast training is as important as the accuracy.
引用
收藏
页码:52528 / 52540
页数:13
相关论文
共 50 条
  • [41] Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit
    Hu, Yi-Qi
    Yu, Yang
    Liao, Jun-Da
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2528 - 2534
  • [42] Modified Grid Searches for Hyper-Parameter Optimization
    Lopez, David
    Alaiz, Carlos M.
    Dorronsoro, Jose R.
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2020, 2020, 12344 : 221 - 232
  • [43] Hybrid Hyper-parameter Optimization for Collaborative Filtering
    Szabo, Peter
    Genge, Bela
    2020 22ND INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2020), 2020, : 210 - 217
  • [44] Classification complexity assessment for hyper-parameter optimization
    Cai, Ziyun
    Long, Yang
    Shao, Ling
    PATTERN RECOGNITION LETTERS, 2019, 125 : 396 - 403
  • [45] Hippo: Sharing Computations in Hyper-Parameter Optimization
    Shin, Ahnjae
    Jeong, Joo Seong
    Kim, Do Yoon
    Jung, Soyoung
    Chun, Byung-Gon
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (05): : 1038 - 1052
  • [46] A New Baseline for Automated Hyper-Parameter Optimization
    Geitle, Marius
    Olsson, Roland
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, 2019, 11943 : 521 - 530
  • [47] HYPER-PARAMETER SELECTION ON CONVOLUTIONAL DICTIONARY LEARNING THROUGH LOCAL l0,∞ NORM
    Silva, Gustavo
    Quesada, Jorge
    Rodriguez, Paul
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [48] The Impact of Hyper-Parameter Tuning for Landscape-Aware Performance Regression and Algorithm Selection
    Jankovic, Anja
    Popovski, Gorjan
    Eftimov, Tome
    Doerr, Carola
    PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 687 - 696
  • [49] APPLICATION OF A HYPER-PARAMETER OPTIMIZATION ALGORITHM USING MARS SURROGATE FOR DEEP POLSAR IMAGE CLASSIFICATION MODELS
    Liu, Guangyuan
    Li, Yangyang
    Jiao, Licheng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2591 - 2594
  • [50] Techniques for regularization parameter and hyper-parameter selection in PET and SPECT imaging
    Bardsley, Johnathan M.
    Goldes, John
    INVERSE PROBLEMS IN SCIENCE AND ENGINEERING, 2011, 19 (02) : 267 - 280