Regularized Differentiable Architecture Search

被引:0
|
作者
Wang, Lanfei [1 ,2 ]
Xie, Lingxi [3 ]
Zhao, Kaili [2 ]
Guo, Jun [2 ]
Tian, Qi [3 ]
机构
[1] Huawei Technol Co Ltd, Huawei Cloud & AI BG, Beijing 100085, Peoples R China
[2] Beijing Univ Posts & Telecommun, Pattern Recognit & Intelligent Syst Lab, Beijing 100876, Peoples R China
[3] Huawei Technol Co Ltd, Beijing 100085, Peoples R China
关键词
Computer architecture; Resource management; Training; Optimization; Market research; Topology; Stacking; Embedded systems; Deep learning; Computer vision; Machine learning; Neural networks; Automated machine learning; computer vision; deep learning; neural architecture search (NAS);
D O I
10.1109/LES.2022.3204856
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Differentiable architecture search (DARTS) transforms architectural optimization into a super network optimization by stacking two cells (2 c.). However, repeatedly stacking two cells is a suboptimal operation since cells in different depths should be various. Besides, we find that the performance is slightly improved by increasing the number of searched cells (e.g., from 2 c. to 5 c.), but it leads to uneven resource allocation. This letter proposes a regularized DARTS (RDARTS) to adjust the architectural differences and balance degrees of freedom and resource allocation. Specifically, we use separate architectural parameters for two reduction cells and three normal cells, and then propose an Reg distance to calculate the difference between cells. We design a new validation loss which is the weighting of cross-entropy and Reg loss and introduce an adaptive adjustment method. Results show that RDARTS achieves the top-1 accuracy of 97.64% and 75.8% on CIFAR and ImageNet.
引用
收藏
页码:129 / 132
页数:4
相关论文
共 50 条
  • [11] Unchain the Search Space with Hierarchical Differentiable Architecture Search
    Liu, Guanting
    Zhong, Yujie
    Guo, Sheng
    Scott, Matthew R.
    Huang, Weilin
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8644 - 8652
  • [12] Delve into the Performance Degradation of Differentiable Architecture Search
    Zhang, Jiuling
    Ding, Zhiming
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2547 - 2556
  • [13] Layered feature representation for differentiable architecture search
    Jie Hao
    William Zhu
    [J]. Soft Computing, 2022, 26 : 4741 - 4753
  • [14] Image Understanding by Captioning with Differentiable Architecture Search
    Hosseini, Ramtin
    Xie, Pengtao
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4665 - 4673
  • [15] AUTOKWS: KEYWORD SPOTTING WITH DIFFERENTIABLE ARCHITECTURE SEARCH
    Zhang, Bo
    Li, Wenfeng
    Li, Qingyuan
    Zhuang, Weiji
    Chu, Xiangxiang
    Wang, Yujun
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2830 - 2834
  • [16] Incremental Learning with Differentiable Architecture and Forgetting Search
    Smith, James Seale
    Seymour, Zachary
    Chiu, Han-Pang
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [17] Differentiable Architecture Search Based on Coordinate Descent
    Ahn, Pyunghwan
    Hong, Hyeong Gwon
    Kim, Junmo
    [J]. IEEE ACCESS, 2021, 9 (09): : 48544 - 48554
  • [18] Decoupled differentiable graph neural architecture search
    Chen, Jiamin
    Gao, Jianliang
    Wu, Zhenpeng
    Al-Sabri, Raeed
    Oloulade, Babatounde Moctard
    [J]. INFORMATION SCIENCES, 2024, 673
  • [19] Graph Differentiable Architecture Search with Structure Learning
    Qin, Yijian
    Wang, Xin
    Zhang, Zeyang
    Zhu, Wenwu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [20] Layered feature representation for differentiable architecture search
    Hao, Jie
    Zhu, William
    [J]. SOFT COMPUTING, 2022, 26 (10) : 4741 - 4753