A General-Purpose Neural Architecture Search Algorithm for Building Deep Neural Networks

被引:0
|
作者
Zito, Francesco [1 ]
Cutello, Vincenzo [1 ]
Pavone, Mario [1 ]
机构
[1] Univ Catania, Dept Math & Comp Sci, Vle Andrea Doria 6, I-95125 Catania, Italy
来源
关键词
Automated Machine Learning; Neural Architecture Search; Hyperparameter Optimization; Deep Neural Network; Metaheuristic;
D O I
10.1007/978-3-031-62922-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the increasing availability of data and the development of powerful algorithms, deep neural networks have become an essential tool for all sectors. However, it can be challenging to automate the process of building and tuning them, due to the rapid growth of data and their complexity. The demand for handling large amounts of data has led to an increasing number of hidden layers and hyperparameters. A framework or methodology to design the architecture of deep neural networks will be crucial in the future, as it could significantly speed up the process of using deep learning models. We present here a first attempt to create an algorithm that combines aspects of Neural Architecture Search and Hyperparameter Optimization to build and optimize a neural network architecture. The particularity of our algorithm is that it is able to learn how to link neural layers of different types to create increasingly performant neural network architectures. We conducted experiments on four different tasks, including regression, binary and multi-classification, and forecasting, to compare our algorithm with common machine learning models.
引用
收藏
页码:126 / 141
页数:16
相关论文
共 50 条
  • [1] A General-Purpose Transferable Predictor for Neural Architecture Search
    Han, Fred X.
    Mills, Keith G.
    Chudak, Fabian
    Riahi, Parsa
    Salameh, Mohammad
    Zhang, Jialin
    Lul, Wei
    Jui, Shangling
    Niu, Di
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 721 - 729
  • [2] A GENERAL-PURPOSE DIGITAL ARCHITECTURE FOR NEURAL NETWORK SIMULATIONS
    DURANTON, M
    MAUDUIT, N
    FIRST IEE INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1989, : 62 - 66
  • [3] A Unified FPGA Virtualization Framework for General-Purpose Deep Neural Networks in the Cloud
    Zeng, Shulin
    Dai, Guohao
    Sun, Hanbo
    Liu, Jun
    Li, Shiyao
    Ge, Guangjun
    Zhong, Kai
    Guo, Kaiyuan
    Wang, Yu
    Yang, Huazhong
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (03)
  • [4] DESIGN OF A GENERAL-PURPOSE MIMO PREDICTOR WITH NEURAL NETWORKS
    CUI, XZ
    SHIN, KG
    JOURNAL OF INTELLIGENT MATERIAL SYSTEMS AND STRUCTURES, 1994, 5 (02) : 198 - 210
  • [5] SPARCE: Sparsity Aware General-Purpose Core Extensions to Accelerate Deep Neural Networks
    Sen, Sanchari
    Jain, Shubham
    Venkataramani, Swagath
    Raghunathan, Anand
    IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (06) : 912 - 925
  • [6] General-purpose neural network mapping scheduling genetic algorithm
    Jisuanji Yanjiu yu Fazhan, 11 (872-876):
  • [7] Tanji: a General-purpose Neural Network Accelerator with Unified Crossbar Architecture
    Zhu, Haozhe
    Wang, Yu
    Shi, C. -J. Richard
    IEEE DESIGN & TEST, 2020, 37 (01) : 56 - 63
  • [8] Data-driven simulation for general-purpose multibody dynamics using Deep Neural Networks
    Hee-Sun Choi
    Junmo An
    Seongji Han
    Jin-Gyun Kim
    Jae-Yoon Jung
    Juhwan Choi
    Grzegorz Orzechowski
    Aki Mikkola
    Jin Hwan Choi
    Multibody System Dynamics, 2021, 51 : 419 - 454
  • [9] Data-driven simulation for general-purpose multibody dynamics using Deep Neural Networks
    Choi, Hee-Sun
    An, Junmo
    Han, Seongji
    Kim, Jin-Gyun
    Jung, Jae-Yoon
    Choi, Juhwan
    Orzechowski, Grzegorz
    Mikkola, Aki
    Choi, Jin Hwan
    MULTIBODY SYSTEM DYNAMICS, 2021, 51 (04) : 419 - 454
  • [10] Efficient Architecture Search for Deep Neural Networks
    Gottapu, Ram Deepak
    Dagli, Cihan H.
    COMPLEX ADAPTIVE SYSTEMS, 2020, 168 : 19 - 25