A genetic mixed-integer optimization of neural network hyper-parameters

被引:2
|
作者
Spurlock, Kyle [1 ]
Elgazzar, Heba [1 ]
机构
[1] Morehead State Univ, Sch Engn & Comp Sci, 150 Univ Blvd, Morehead, KY 40351 USA
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 12期
关键词
Genetic algorithm; Deep learning; Mixed-integer optimization; Neural architecture search; ALGORITHM;
D O I
10.1007/s11227-022-04475-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks (NN) have become immensely popular for their effectiveness and flexibility in learning complicated patterns. Despite this success, they are often considered difficult to design because of the wide variety of parameters they require. Determining the most optimal selection of parameters has become tedious and costly, and neural architecture search (NAS) methods have been employed to try and take the guesswork out of the process. A common NAS approach is the genetic algorithm (GA); however, its usage is often exclusively tied to either the learnable parameters, or the meta-parameters that augment the learning. This work proposes an experimental approach for optimizing both real-valued weights and discrete meta-parameters simultaneously. Experimental results have shown that the current approach evolves both parameter sets effectively for simple problems like Iris, but still struggles in finding an optimal model for more rigorous problems.
引用
收藏
页码:14680 / 14702
页数:23
相关论文
共 50 条
  • [21] Global mixed-integer dynamic optimization
    Chachuat, B
    Singer, AB
    Barton, PI
    AICHE JOURNAL, 2005, 51 (08) : 2235 - 2253
  • [22] Mixed-Integer Optimization with Constraint Learning
    Maragno, Donato
    Wiberg, Holly
    Bertsimas, Dimitris
    Birbil, S. . Ilker
    den Hertog, Dick
    Fajemisin, Adejuyigbe O.
    OPERATIONS RESEARCH, 2023,
  • [23] Network Formulations of Mixed-Integer Programs
    Conforti, Michele
    Di Summa, Marco
    Eisenbrand, Friedrich
    Wolsey, Laurence A.
    MATHEMATICS OF OPERATIONS RESEARCH, 2009, 34 (01) : 194 - 209
  • [24] Stability Verification of Neural Network Controllers Using Mixed-Integer Programming
    Schwan, Roland
    Jones, Colin N.
    Kuhn, Daniel
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (12) : 7514 - 7529
  • [25] Online Mixed-Integer Optimization in Milliseconds
    Bertsimas, Dimitris
    Stellato, Bartolomeo
    INFORMS JOURNAL ON COMPUTING, 2022, 34 (04) : 2229 - 2248
  • [26] A coupled gradient network approach for static and temporal mixed-integer optimization
    Watta, PB
    Hassoun, MH
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (03): : 578 - 593
  • [27] Decoupling Network Optimization in High Speed Systems by Mixed-Integer Programming
    Tripathi, Jai Narayan
    Mahajan, Ashutosh
    Mukherjee, Jayanta
    Nagpal, Raj Kumar
    Malik, Rakesh
    Gupta, Nitin
    2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 1010 - 1013
  • [28] A Study of Features and Deep Neural Network Architectures and Hyper-Parameters for Domestic Audio Classification
    Copiaco, Abigail
    Ritz, Christian
    Abdulaziz, Nidhal
    Fasciani, Stefano
    APPLIED SCIENCES-BASEL, 2021, 11 (11):
  • [29] Automatically Avoiding Overfitting in Deep Neural Networks by Using Hyper-Parameters Optimization Methods
    Kadhim, Zahraa Saddi
    Abdullah, Hasanen S.
    Ghathwan, Khalil I.
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (05) : 146 - 162
  • [30] AUTOMATED OPTIMIZATION OF DECODER HYPER-PARAMETERS FOR ONLINE LVCSR
    Chandrashekaran, Akshay
    Lane, Ian
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 454 - 460