Evolutionary Recurrent Neural Architecture Search

被引:2
|
作者
Tian, Shuo [1 ]
Hu, Kai [1 ]
Guo, Shasha [1 ]
Li, Shiming [1 ]
Wang, Lei [1 ]
Xu, Weixia [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp Sci & Technol, Changsha 410000, Peoples R China
关键词
Computer architecture; Sociology; Statistics; Microprocessors; Manuals; Training; Computational modeling; Deep learning; evolution algorithm; neural architecture search (NAS); parameter sharing;
D O I
10.1109/LES.2020.3005753
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep learning has promoted remarkable progress in various tasks while the effort devoted to these hand-crafting neural networks has motivated so-called neural architecture search (NAS) to discover them automatically. Recent aging evolution (AE) automatic search algorithm turns to discard the oldest model in population and finds image classifiers beyond manual design. However, it achieves a low speed of convergence. A nonaging evolution (NAE) algorithm tends to neglect the worst architecture in population to accelerate the search process whereas it obtains a lower performance compared with AE. To address this issue, in this letter, we propose to use an optimized evolution algorithm for recurrent NAS (EvoRNAS) by setting a probability epsilon to remove the worst or oldest model in population alternatively, which can balance the performance and time length. Besides, parameter sharing mechanism is introduced in our approach due to the heavy cost of evaluating the candidate models in both AE and NAE. Furthermore, we train the sharing parameters only once instead of many epochs like ENAS, which makes the evaluation of candidate models faster. On Penn Treebank, we first explore different epsilon in EvoRNAS and find the best value suited for the learning task, which is also better than AE and NAE. Second, the optimal cell found by EvoRNAS can achieve state-of-the-art performance within only 0.6 GPU hours, which is 20 x and 40 x faster than ENAS and DARTS. Moreover, the transferability of the learned architecture to WikiText-2 also shows strong performance compared with ENAS or DARTS.
引用
收藏
页码:110 / 113
页数:4
相关论文
共 50 条
  • [21] Evolutionary Neural Architecture Search for Transformer in Knowledge Tracing
    Yang, Shangshang
    Yu, Xiaoshan
    Tian, Ye
    Yan, Xueming
    Ma, Haiping
    Zhang, Xingyi
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search
    Xie, Xiangning
    Liu, Yuqiao
    Sun, Yanan
    Yen, Gary G.
    Xue, Bing
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (06) : 1473 - 1485
  • [23] Efficient evolutionary neural architecture search based on hybrid search space
    Gong, Tao
    Ma, Yongjie
    Xu, Yang
    Song, Changwei
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (08) : 3313 - 3326
  • [24] Hybrid Architecture-Based Evolutionary Robust Neural Architecture Search
    Yang, Shangshang
    Sun, Xiangkun
    Xu, Ke
    Liu, Yuanchao
    Tian, Ye
    Zhang, Xingyi
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2919 - 2934
  • [25] A surrogate evolutionary neural architecture search algorithm for graph neural networks
    Liu, Yang
    Liu, Jing
    [J]. APPLIED SOFT COMPUTING, 2023, 144
  • [26] Recurrent Neural Network Architecture Search for Geophysical Emulation
    Maulik, Romit
    Egele, Romain
    Lusch, Bethany
    Balaprakash, Prasanna
    [J]. PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
  • [27] Gated Recurrent Unit Neural Networks for Wind Power Forecasting based on Surrogate-Assisted Evolutionary Neural Architecture Search
    Zhang, Kehao
    Jin, Huaiping
    Jin, Huaikang
    Wang, Bin
    Yu, Wangyang
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1774 - 1779
  • [28] CURIOUS: Efficient Neural Architecture Search Based on a Performance Predictor and Evolutionary Search
    Hassantabar, Shayan
    Dai, Xiaoliang
    Jha, Niraj K.
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 4975 - 4990
  • [29] EQNAS: Evolutionary Quantum Neural Architecture Search for Image Classification
    Li, Yangyang
    Liu, Ruijiao
    Hao, Xiaobin
    Shang, Ronghua
    Zhao, Peixiang
    Jiao, Licheng
    [J]. NEURAL NETWORKS, 2023, 168 : 471 - 483
  • [30] Guided evolutionary neural architecture search with efficient performance estimation
    Lopes, Vasco
    Santos, Miguel
    Degardin, Bruno
    Alexandre, Luis A.
    [J]. NEUROCOMPUTING, 2024, 584