A Gradient-Guided Evolutionary Neural Architecture Search

被引:1
|
作者
Xue, Yu [1 ]
Han, Xiaolong [1 ]
Neri, Ferrante [2 ]
Qin, Jiafeng [1 ]
Pelusi, Danilo [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing 210044, Peoples R China
[2] Univ Surrey, Dept Comp Sci, Nat Inspired Comp & Engn Res Grp, Guildford GU2 7XH, England
[3] Univ Teramo, Fac Commun Sci, I-64100 Teramo, Italy
基金
中国国家自然科学基金;
关键词
Computer architecture; Microprocessors; Search problems; Couplings; Evolutionary computation; Encoding; Statistics; gradient optimization; image classification; neural architecture search (NAS);
D O I
10.1109/TNNLS.2024.3371432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural architecture search (NAS) is a popular method that can automatically design deep neural network structures. However, designing a neural network using NAS is computationally expensive. This article proposes a gradient-guided evolutionary NAS (GENAS) to design convolutional neural networks (CNNs) for image classification. GENAS is a hybrid algorithm that combines evolutionary global and local search operators to evolve a population of subnets sampled from a supernet. Each candidate architecture is encoded as a table describing which operations are associated with the edges between nodes signifying feature maps. Besides, evolutionary optimization uses novel crossover and mutation operators to manipulate the subnets using the proposed tabular encoding. Every n generations, the candidate architectures undergo a local search inspired by differentiable NAS. GENAS is designed to overcome the limitations of both evolutionary and gradient descent NAS. This algorithmic structure enables the performance assessment of the candidate architecture without retraining, thus limiting the NAS calculation time. Furthermore, subnet individuals are decoupled during evaluation to prevent strong coupling of operations in the supernet. The experimental results indicate that the searched structures achieve test errors of 2.45%, 16.86%, and 23.9% on CIFAR-10/100/ImageNet datasets and it costs only 0.26 GPU days on a graphic card. GENAS can effectively expedite the training and evaluation processes and obtain high-performance network structures.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [21] Novelty Driven Evolutionary Neural Architecture Search
    Sinha, Nilotpal
    Chen, Kuan-Wen
    [J]. PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 671 - 674
  • [22] A Survey of Advances in Evolutionary Neural Architecture Search
    Zhou, Xun
    Qin, A. K.
    Sun, Yanan
    Tan, Kay Chen
    [J]. 2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 950 - 957
  • [23] Evolutionary Neural Architecture Search for Traffic Forecasting
    Klosa, Daniel
    Bueskens, Christof
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1230 - 1237
  • [24] Efficient Guided Evolution for Neural Architecture Search
    Lopes, Vasco
    Santos, Miguel
    Degardin, Bruno
    Alexandre, Luis A.
    [J]. PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 655 - 658
  • [25] Posterior-Guided Neural Architecture Search
    Zhou, Yizhou
    Sun, Xiaoyan
    Luo, Chong
    Zha, Zheng-Jun
    Zeng, Wenjun
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6973 - 6980
  • [26] GRADIENT-GUIDED SPARSE REPRESENTATION FOR HYPERSPECTRAL IMAGE DENOISING
    Lu, Ting
    Li, Shutao
    [J]. 2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 1128 - 1131
  • [27] Gradient-Guided DCNN for Inverse Halftoning and Image Expanding
    Xiao, Yi
    Pan, Chao
    Zheng, Yan
    Zhu, Xianyi
    Qin, Zheng
    Yuan, Jin
    [J]. COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 207 - 222
  • [28] Shivering greedy snakes, gradient-guided in wavelet domain
    Sakalli, M
    Lam, KM
    Yan, H
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 2, 1998, : 886 - 890
  • [29] GradNet: Gradient-Guided Network for Visual Object Tracking
    Li, Peixia
    Chen, Boyu
    Ouyan, Wanli
    Wang, Dong
    Yang, Xiaoyun
    Lu, Huchuan
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6161 - 6170
  • [30] Aircraft conceptual design by genetic/gradient-guided optimization
    Bos, AHW
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 1998, 11 (03) : 377 - 382