Automatic Configuration of Deep Neural Networks with Parallel Efficient Global Optimization

被引:13
|
作者
van Stein, Bas [1 ]
Wang, Hao [1 ]
Back, Thomas [1 ]
机构
[1] Leiden Univ, LIACS, Leiden, Netherlands
关键词
Deep Learning; Network Architectures; Bayesian Optimization; Optimization;
D O I
10.1109/ijcnn.2019.8851720
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Designing the architecture for an artificial neural network is a cumbersome task because of the numerous parameters to configure, including activation functions, layer types, and hyper-parameters. With the large number of parameters for most networks nowadays, it is intractable to find a good configuration for a given task by hand. In this paper the Mixed Integer Parallel Efficient Global Optimization (MIP-EGO) algorithm is proposed to automatically configure convolutional neural network architectures. It is shown that on several image classification tasks this approach is able to find competitive network architectures in terms of prediction accuracy, compared to the best hand-crafted ones in literature, when using only a fraction of the number of training epochs. Moreover, instead of the standard sequential evaluation in EGO, several candidate architectures are proposed and evaluated in parallel, which reduces the execution overhead significantly and leads to an efficient automation for deep neural network design.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks
    Xu, Kaixin
    Wang, Zhe
    Geng, Xue
    Wu, Min
    Li, Xiaoli
    Lin, Weisi
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17401 - 17411
  • [32] Hyper-Parameter Selection in Deep Neural Networks Using Parallel Particle Swarm Optimization
    Lorenzo, Pablo Ribalta
    Nalepa, Jakub
    Sanchez Ramos, Luciano
    Ranilla Pastor, Jose
    [J]. PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 1864 - 1871
  • [33] Global forensic geolocation with deep neural networks
    Grantham, Neal S.
    Reich, Brian J.
    Laber, Eric B.
    Pacifici, Krishna
    Dunn, Robert R.
    Fierer, Noah
    Gebert, Matthew
    Allwood, Julia S.
    Faith, Seth A.
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2020, 69 (04) : 909 - 929
  • [34] Efficient optimization of deep neural quantum states
    Chen, Ao
    Heyl, Markus
    [J]. NATURE PHYSICS, 2024,
  • [35] A Portable, Automatic Data Quantizer for Deep Neural Networks
    Oh, Young H.
    Quan, Quan
    Kim, Daeyeon
    Kim, Seonghak
    Heo, Jun
    Jung, Sungjun
    Jang, Jaeyoung
    Lee, Jae W.
    [J]. 27TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2018), 2018,
  • [36] Automatic Photo Adjustment Using Deep Neural Networks
    Yan, Zhicheng
    Zhang, Hao
    Wang, Baoyuan
    Paris, Sylvain
    Yu, Yizhou
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (02):
  • [37] Automatic Document Summarization via Deep Neural Networks
    Yao, Chengwei
    Shen, Jianfen
    Chen, Gencai
    [J]. 2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2015, : 291 - 296
  • [38] AUTOMATIC LANGUAGE IDENTIFICATION USING DEEP NEURAL NETWORKS
    Lopez-Moreno, Ignacio
    Gonzalez-Dominguez, Javier
    Plchot, Oldrich
    Martinez, David
    Gonzalez-Rodriguez, Joaquin
    Moreno, Pedro
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [39] An Efficient Accelerator for Deep Convolutional Neural Networks
    Kuo, Yi-Xian
    Lai, Yeong-Kang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [40] On Runge-Kutta Neural Networks: Training in Series-Parallel and Parallel Configuration
    Deflorian, Michael
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 4480 - 4485