Automated Search for Configurations of Convolutional Neural Network Architectures

被引:5
|
作者
Ghamizi, Salah [1 ]
Cordy, Maxime [1 ]
Papadakis, Mike [1 ]
Le Traon, Yves [1 ]
机构
[1] Univ Luxembourg, SnT, Luxembourg, Luxembourg
关键词
Feature model; configuration search; NAS; Neural Architecture Search; AutoML;
D O I
10.1145/3336294.3336306
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Convolutional Neural Networks (CNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view CNNs as configurable systems and propose an end-to-end framework that allows the configuration, evaluation and automated search for CNN architectures. Therefore, our contribution is three-fold. First, we model the variability of CNN architectures with a Feature Model (FM) that generalizes over existing architectures. Each valid configuration of the FM corresponds to a valid CNN model that can be built and trained. Second, we implement, on top of Tensorflow, an automated procedure to deploy, train and evaluate the performance of a configured model. Third, we propose a method to search for configurations and demonstrate that it leads to good CNN models. We evaluate our method by applying it on image classification tasks (MNIST, CIFAR-10) and show that, with limited amount of computation and training, our method can identify high-performing architectures (with high accuracy). We also demonstrate that we outperform existing state-of-the-art architectures handcrafted by ML researchers. Our FM and framework have been released to support replication and future research.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 50 条
  • [1] Convolutional Neural Network Architectures for the Automated Diagnosis of Celiac Disease
    Wimmer, G.
    Hegenbart, S.
    Vecsei, A.
    Uhl, A.
    [J]. COMPUTER-ASSISTED AND ROBOTIC ENDOSCOPY, 2017, 10170 : 104 - 113
  • [2] Optimizing Convolutional Neural Network Architectures
    Balderas, Luis
    Lastra, Miguel
    Benitez, Jose M.
    [J]. MATHEMATICS, 2024, 12 (19)
  • [3] Design of deep convolutional neural network architectures for automated feature extraction in industrial inspection
    Weimer, Daniel
    Scholz-Reiter, Bernd
    Shpitalni, Moshe
    [J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2016, 65 (01) : 417 - 420
  • [4] Modified symbiotic organisms search optimization for automatic construction of convolutional neural network architectures
    Jauro, Fatsuma
    Ya'u Gital, Abdulsalam
    Abdullahi, Usman Ali
    Abdulsalami, Aminu Onimisi
    Abdullahi, Mohammed
    Ibrahim, Adamu Abubakar
    Chiroma, Haruna
    [J]. Intelligent Systems with Applications, 2024, 22
  • [5] A Systematic Search over Deep Convolutional Neural Network Architectures for Screening Chest Radiographs
    Mitra, Arka
    Chakravarty, Arunava
    Ghosh, Nirmalya
    Sarkar, Tandra
    Sethuraman, Ramanathan
    Sheet, Debdoot
    [J]. 42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 1225 - 1228
  • [6] Bandwidth Efficient Architectures for Convolutional Neural Network
    Wang, Jichen
    Lin, Jun
    Wang, Zhongfeng
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2018, : 94 - 99
  • [7] A review of convolutional neural network architectures and their optimizations
    Cong, Shuang
    Zhou, Yang
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (03) : 1905 - 1969
  • [8] Efficient Convolution Architectures for Convolutional Neural Network
    Wang, Jichen
    Lin, Jun
    Wang, Zhongfeng
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [9] A review of convolutional neural network architectures and their optimizations
    Shuang Cong
    Yang Zhou
    [J]. Artificial Intelligence Review, 2023, 56 : 1905 - 1969
  • [10] AUTOMATED HARDENING OF DEEP NEURAL NETWORK ARCHITECTURES
    Beyer, Michael
    Schorn, Christoph
    Fabarisov, Tagir
    Morozov, Andrey
    Janschek, Klaus
    [J]. PROCEEDINGS OF ASME 2021 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION (IMECE2021), VOL 13, 2021,