A data-driven approach to neural architecture search initialization

被引:0
|
作者
Traore, Kalifou Rene [1 ,2 ]
Camero, Andres [2 ,3 ]
Zhu, Xiao Xiang [1 ,2 ]
机构
[1] Tech Univ Munich, Data Sci Earth Observat, Arcisstr 21, D-80333 Munich, Bavaria, Germany
[2] German Aerosp Ctr DLR, Remote Sensing Inst, Munchener Str 20, D-82234 Wessling, Bavaria, Germany
[3] Helmholtz AI, Munich, Germany
基金
欧洲研究理事会;
关键词
AutoML; Neural architecture search; Evolutionary computation; Search; Initialization; 68Txx; NETWORKS; POPULATION;
D O I
10.1007/s10472-022-09823-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Algorithmic design in neural architecture search (NAS) has received a lot of attention, aiming to improve performance and reduce computational cost. Despite the great advances made, few authors have proposed to tailor initialization techniques for NAS. However, the literature shows that a good initial set of solutions facilitates finding the optima. Therefore, in this study, we propose a data-driven technique to initialize a population-based NAS algorithm. First, we perform a calibrated clustering analysis of the search space, and second, we extract the centroids and use them to initialize a NAS algorithm. We benchmark our proposed approach against random and Latin hypercube sampling initialization using three population-based algorithms, namely a genetic algorithm, an evolutionary algorithm, and aging evolution, on CIFAR-10. More specifically, we use NAS-Bench-101 to leverage the availability of NAS benchmarks. The results show that compared to random and Latin hypercube sampling, the proposed initialization technique enables achieving significant long-term improvements for two of the search baselines, and sometimes in various search scenarios (various training budget). Besides, we also investigate how an initial population gathered on the tabular benchmark can be used for improving search on another dataset, the So2Sat LCZ-42. Our results show similar improvements on the target dataset, despite a limited training budget. Moreover, we analyse the distributions of solutions obtained and find that that the population provided by the data-driven initialization technique enables retrieving local optima (maxima) of high fitness and similar configurations.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Data-Driven Initialization of SParSE
    Roh, Min K.
    Proctor, Joshua L.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2016 (ICNAAM-2016), 2017, 1863
  • [2] Data-driven initialization and structure learning in fuzzy neural networks
    Setnes, M
    Koene, A
    Babuska, R
    Bruijn, PM
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1147 - 1152
  • [3] A data-driven neural network architecture for sentiment analysis
    Cano, Erion
    Morisio, Maurizio
    [J]. DATA TECHNOLOGIES AND APPLICATIONS, 2019, 53 (01) : 2 - 19
  • [4] A Data-Driven Architecture for Sensor Validation Based on Neural Networks
    Darvishi, Hossein
    Ciuonzo, Domenico
    Eide, Eivind Roson
    Rossi, Pierluigi Salvo
    [J]. 2020 IEEE SENSORS, 2020,
  • [5] Nucleus Neural Network: A Data-driven Self-organized Architecture
    Liu, Jia
    He, Haibo
    Gong, Maoguo
    Zhang, Wenhua
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [6] DeepSafe: A Data-Driven Approach for Assessing Robustness of Neural Networks
    Gopinath, Divya
    Katz, Guy
    Pasareanu, Corina S.
    Barrett, Clark
    [J]. AUTOMATED TECHNOLOGY FOR VERIFICATION AND ANALYSIS (ATVA 2018), 2018, 11138 : 3 - 19
  • [7] A data-driven neural network approach to simulate pedestrian movement
    Song, Xiao
    Han, Daolin
    Sun, Jinghan
    Zhang, Zenghui
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 509 : 827 - 844
  • [8] An Identification Approach for the Data-Driven SIR in the PnP Monitoring and Control Architecture
    Luo, Hao
    Liu, Tianyu
    Yin, Shen
    Kaynak, Okyay
    [J]. IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 5359 - 5364
  • [9] Creating a data-driven tool architecture
    Bourget, Larry
    Faulkner, David
    [J]. SOLID STATE TECHNOLOGY, 2009, 52 (06) : 32 - 32
  • [10] Architecture of the multichannel data-driven ASIC
    Normanov, D. D.
    Atkin, E. V.
    [J]. INTERNATIONAL CONFERENCE ON PARTICLE PHYSICS AND ASTROPHYSICS (ICPPA-2015), PTS 1-4, 2016, 675