A Gradient-Guided Evolutionary Neural Architecture Search

被引:5
|
作者
Xue, Yu [1 ]
Han, Xiaolong [1 ]
Neri, Ferrante [2 ]
Qin, Jiafeng [1 ]
Pelusi, Danilo [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing 210044, Peoples R China
[2] Univ Surrey, Dept Comp Sci, Nat Inspired Comp & Engn Res Grp, Guildford GU2 7XH, England
[3] Univ Teramo, Fac Commun Sci, I-64100 Teramo, Italy
基金
中国国家自然科学基金;
关键词
Computer architecture; Microprocessors; Search problems; Couplings; Evolutionary computation; Encoding; Statistics; gradient optimization; image classification; neural architecture search (NAS);
D O I
10.1109/TNNLS.2024.3371432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural architecture search (NAS) is a popular method that can automatically design deep neural network structures. However, designing a neural network using NAS is computationally expensive. This article proposes a gradient-guided evolutionary NAS (GENAS) to design convolutional neural networks (CNNs) for image classification. GENAS is a hybrid algorithm that combines evolutionary global and local search operators to evolve a population of subnets sampled from a supernet. Each candidate architecture is encoded as a table describing which operations are associated with the edges between nodes signifying feature maps. Besides, evolutionary optimization uses novel crossover and mutation operators to manipulate the subnets using the proposed tabular encoding. Every n generations, the candidate architectures undergo a local search inspired by differentiable NAS. GENAS is designed to overcome the limitations of both evolutionary and gradient descent NAS. This algorithmic structure enables the performance assessment of the candidate architecture without retraining, thus limiting the NAS calculation time. Furthermore, subnet individuals are decoupled during evaluation to prevent strong coupling of operations in the supernet. The experimental results indicate that the searched structures achieve test errors of 2.45%, 16.86%, and 23.9% on CIFAR-10/100/ImageNet datasets and it costs only 0.26 GPU days on a graphic card. GENAS can effectively expedite the training and evaluation processes and obtain high-performance network structures.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] Shivering greedy snakes, gradient-guided in wavelet domain
    Sakalli, M
    Lam, KM
    Yan, H
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 2, 1998, : 886 - 890
  • [32] GRADIENT-GUIDED SPARSE REPRESENTATION FOR HYPERSPECTRAL IMAGE DENOISING
    Lu, Ting
    Li, Shutao
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 1128 - 1131
  • [33] NPENAS: Neural Predictor Guided Evolution for Neural Architecture Search
    Wei, Chen
    Niu, Chuang
    Tang, Yiping
    Wang, Yue
    Hu, Haihong
    Liang, Jimin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8441 - 8455
  • [34] Gradient-guided Unsupervised Lexically Constrained Text Generation
    Sha, Lei
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8692 - 8703
  • [35] Aircraft conceptual design by genetic/gradient-guided optimization
    Bos, AHW
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 1998, 11 (03) : 377 - 382
  • [36] Evolutionary Neural Architecture Search Supporting Approximate Multipliers
    Pinos, Michal
    Mrazek, Vojtech
    Sekanina, Lukas
    GENETIC PROGRAMMING, EUROGP 2021, 2021, 12691 : 82 - 97
  • [37] Evolutionary neural architecture search on transformers for RUL prediction
    Mo, Hyunho
    Iacca, Giovanni
    MATERIALS AND MANUFACTURING PROCESSES, 2023, 38 (15) : 1881 - 1898
  • [38] Comparison of Topologies Generated by Evolutionary Neural Architecture Search
    Yoo, YongSuk
    Park, Manbok
    Park, Kang-Moon
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [39] Gradient-Guided Modality Decoupling for Missing-Modality Robustness
    Wang, Hao
    Luo, Shengda
    Hu, Guosheng
    Zhang, Jianguo
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15483 - 15491
  • [40] Evolutionary Neural Architecture Search by Mutual Information Analysis
    Namekawa, Shizuma
    Tezuka, Taro
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 966 - 972