A Gradient-Guided Evolutionary Neural Architecture Search

被引:5
|
作者
Xue, Yu [1 ]
Han, Xiaolong [1 ]
Neri, Ferrante [2 ]
Qin, Jiafeng [1 ]
Pelusi, Danilo [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing 210044, Peoples R China
[2] Univ Surrey, Dept Comp Sci, Nat Inspired Comp & Engn Res Grp, Guildford GU2 7XH, England
[3] Univ Teramo, Fac Commun Sci, I-64100 Teramo, Italy
基金
中国国家自然科学基金;
关键词
Computer architecture; Microprocessors; Search problems; Couplings; Evolutionary computation; Encoding; Statistics; gradient optimization; image classification; neural architecture search (NAS);
D O I
10.1109/TNNLS.2024.3371432
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural architecture search (NAS) is a popular method that can automatically design deep neural network structures. However, designing a neural network using NAS is computationally expensive. This article proposes a gradient-guided evolutionary NAS (GENAS) to design convolutional neural networks (CNNs) for image classification. GENAS is a hybrid algorithm that combines evolutionary global and local search operators to evolve a population of subnets sampled from a supernet. Each candidate architecture is encoded as a table describing which operations are associated with the edges between nodes signifying feature maps. Besides, evolutionary optimization uses novel crossover and mutation operators to manipulate the subnets using the proposed tabular encoding. Every n generations, the candidate architectures undergo a local search inspired by differentiable NAS. GENAS is designed to overcome the limitations of both evolutionary and gradient descent NAS. This algorithmic structure enables the performance assessment of the candidate architecture without retraining, thus limiting the NAS calculation time. Furthermore, subnet individuals are decoupled during evaluation to prevent strong coupling of operations in the supernet. The experimental results indicate that the searched structures achieve test errors of 2.45%, 16.86%, and 23.9% on CIFAR-10/100/ImageNet datasets and it costs only 0.26 GPU days on a graphic card. GENAS can effectively expedite the training and evaluation processes and obtain high-performance network structures.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [41] Toward Evolutionary Multitask Convolutional Neural Architecture Search
    Zhou, Xun
    Wang, Zhenkun
    Feng, Liang
    Liu, Songbai
    Wong, Ka-Chun
    Tan, Kay Chen
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (03) : 682 - 695
  • [42] GradDT: Gradient-Guided Despeckling Transformer for Industrial Imaging Sensors
    Lu, Yuxu
    Guo, Yu
    Liu, Ryan Wen
    Chui, Kwok Tai
    Gupta, Brij B.
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (02) : 2238 - 2248
  • [43] Evolutionary Neural Architecture Search and Its Applications in Healthcare
    Liu, Xin
    Li, Jie
    Zhao, Jianwei
    Cao, Bin
    Yan, Rongge
    Lyu, Zhihan
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (01): : 143 - 185
  • [44] Evolutionary Neural Architecture Search for Facial Expression Recognition
    Deng, Shuchao
    Lv, Zeqiong
    Galvan, Edgar
    Sun, Yanan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (05): : 1405 - 1419
  • [45] Evolutionary Neural Architecture Search for Transformer in Knowledge Tracing
    Yang, Shangshang
    Yu, Xiaoshan
    Tian, Ye
    Yan, Xueming
    Ma, Haiping
    Zhang, Xingyi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [46] BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search
    Xie, Xiangning
    Liu, Yuqiao
    Sun, Yanan
    Yen, Gary G.
    Xue, Bing
    Zhang, Mengjie
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (06) : 1473 - 1485
  • [47] Efficient evolutionary neural architecture search based on hybrid search space
    Gong, Tao
    Ma, Yongjie
    Xu, Yang
    Song, Changwei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (08) : 3313 - 3326
  • [48] Hybrid Architecture-Based Evolutionary Robust Neural Architecture Search
    Yang, Shangshang
    Sun, Xiangkun
    Xu, Ke
    Liu, Yuanchao
    Tian, Ye
    Zhang, Xingyi
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2919 - 2934
  • [49] Traditional and Accelerated Gradient Descent for Neural Architecture Search
    Trillos, Nicolas Garcia
    Morales, Felix
    Morales, Javier
    GEOMETRIC SCIENCE OF INFORMATION (GSI 2021), 2021, 12829 : 507 - 514
  • [50] Gradient-Guided Residual Learning for Inverse Halftoning and Image Expanding
    Yuan, Jin
    Pan, Chao
    Zheng, Yan
    Zhu, Xianyi
    Qin, Zheng
    Xiao, Yi
    IEEE ACCESS, 2020, 8 : 50995 - 51007