Automatic Design of Convolutional Neural Network Architectures Under Resource Constraints

被引:12
|
作者
Li, Siyi [1 ]
Sun, Yanan [1 ]
Yen, Gary G. [2 ]
Zhang, Mengjie [3 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Oklahoma State Univ, Sch Elect & Comp Engn, Stillwater, OK 74078 USA
[3] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
基金
中国国家自然科学基金;
关键词
Constraint handling; convolutional neural networks (CNNs); evolutionary deep learning; genetic algorithms (GAs); neural architecture search (NAS); GENETIC ALGORITHM;
D O I
10.1109/TNNLS.2021.3123105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rise of various smart electronics and mobile/edge devices, many existing high-accuracy convolutional neural network (CNN) models are difficult to be applied in practice due to the limited resources, such as memory capacity, power consumption, and spectral efficiency. In order to meet these constraints, researchers have carefully designed some lightweight networks. Meanwhile, to reduce the reliance on manual design on expert experience, some researchers also work to improve neural architecture search (NAS) algorithms to automatically design small networks, exploiting the multiobjective approaches that consider both accuracy and other important goals during optimization. However, simply searching for smaller network models is not consistent with the current research belief of ``the deeper the better'' and may affect the effectiveness of the model and thus waste the limited resources available. In this article, we propose an automatic method for designing CNNs architectures under constraint handling, which can search for optimal network models meeting the preset constraint. Specifically, an adaptive penalty algorithm is used for fitness evaluation, and a selective repair operation is developed for infeasible individuals to search for feasible CNN architectures. As a case study, we set the complexity (the number of parameters) as a resource constraint and perform multiple experiments on CIFAR-10 and CIFAR-100, to demonstrate the effectiveness of the proposed method. In addition, the proposed algorithm is compared with a state-of-the-art algorithm, NSGA-Net, and several manual-designed models. The experimental results show that the proposed algorithm can successfully solve the problem of the uncertain size of the optimal CNN model under the random search strategy, and the automatically designed CNN model can satisfy the predefined resource constraint while achieving better accuracy.
引用
收藏
页码:3832 / 3846
页数:15
相关论文
共 50 条
  • [1] Learning Network Architectures of Deep CNNs under Resource Constraints
    Chan, Michael
    Scarafoni, Daniel
    Duarte, Ronald
    Thornton, Jason
    Skelly, Luke
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 1784 - 1791
  • [2] Optimizing Convolutional Neural Network Architectures
    Balderas, Luis
    Lastra, Miguel
    Benitez, Jose M.
    [J]. MATHEMATICS, 2024, 12 (19)
  • [3] Application of an Innovative Convolutional/LSTM Neural Network for Computing Resource Allocation in NFV Network Architectures
    Eramo, Vincenzo
    Catena, Tiziana
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (03): : 2929 - 2943
  • [4] Design of Deep Convolutional Neural Network Architectures for Denoising Electrocardiographic Signals
    Arsene, Corneliu
    [J]. 2020 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2020, : 10 - 17
  • [5] Automatic Design of Convolutional Neural Network for Hyperspectral Image Classification
    Chen, Yushi
    Zhu, Kaiqiang
    Zhu, Lin
    He, Xin
    Ghamisi, Pedram
    Benediktsson, Jon Atli
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (09): : 7048 - 7066
  • [6] Modified symbiotic organisms search optimization for automatic construction of convolutional neural network architectures
    Jauro F.
    Ya'u Gital A.
    Abdullahi U.A.
    Abdulsalami A.O.
    Abdullahi M.
    Ibrahim A.A.
    Chiroma H.
    [J]. Intelligent Systems with Applications, 2024, 22
  • [7] Bandwidth Efficient Architectures for Convolutional Neural Network
    Wang, Jichen
    Lin, Jun
    Wang, Zhongfeng
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2018, : 94 - 99
  • [8] A review of convolutional neural network architectures and their optimizations
    Cong, Shuang
    Zhou, Yang
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (03) : 1905 - 1969
  • [9] Efficient Convolution Architectures for Convolutional Neural Network
    Wang, Jichen
    Lin, Jun
    Wang, Zhongfeng
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [10] A review of convolutional neural network architectures and their optimizations
    Shuang Cong
    Yang Zhou
    [J]. Artificial Intelligence Review, 2023, 56 : 1905 - 1969