Cyclic Differentiable Architecture Search

被引:20
|
作者
Yu, Hongyuan [1 ]
Peng, Houwen [2 ]
Huang, Yan [1 ]
Fu, Jianlong [2 ]
Du, Hao [3 ]
Wang, Liang [1 ]
Ling, Haibin [4 ]
机构
[1] Univ Chinese Acad Sci UCAS, Ctr Res Intelligent Percept & Comp CRIPAC, Natl Lab Pattern Recognit NLPR, Ctr Excellence Brain Sci & Intelligence Technol CE, Beijing 101408, Peoples R China
[2] Microsoft Res, Beijing 100080, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[4] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
基金
中国国家自然科学基金;
关键词
Cyclic; introspective distillation; differentiable architecture search; unified framework; NETWORK;
D O I
10.1109/TPAMI.2022.3153065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differentiable ARchiTecture Search, i.e., DARTS, has drawn great attention in neural architecture search. It tries to find the optimal architecture in a shallow search network and then measures its performance in a deep evaluation network. The independent optimization of the search and evaluation networks, however, leaves a room for potential improvement by allowing interaction between the two networks. To address the problematic optimization issue, we propose new joint optimization objectives and a novel Cyclic Differentiable ARchiTecture Search framework, dubbed CDARTS. Considering the structure difference, CDARTS builds a cyclic feedback mechanism between the search and evaluation networks with introspective distillation. First, the search network generates an initial architecture for evaluation, and the weights of the evaluation network are optimized. Second, the architecture weights in the search network are further optimized by the label supervision in classification, as well as the regularization from the evaluation network through feature distillation. Repeating the above cycle results in a joint optimization of the search and evaluation networks and thus enables the evolution of the architecture to fit the final evaluation network. The experiments and analysis on CIFAR, ImageNet and NATS-Bench [95] demonstrate the effectiveness of the proposed approach over the state-of-the-art ones. Specifically, in the DARTS search space, we achieve 97.52% top-1 accuracy on CIFAR10 and 76.3% top-1 accuracy on ImageNet. In the chain-structured search space, we achieve 78.2% top-1 accuracy on ImageNet, which is 1.1% higher than EfficientNet-B0. Our code and models are publicly available at https://github.com/microsoft/Cream.
引用
收藏
页码:211 / 228
页数:18
相关论文
共 50 条
  • [1] Differentiable quantum architecture search
    Zhang, Shi-Xin
    Hsieh, Chang-Yu
    Zhang, Shengyu
    Yao, Hong
    [J]. QUANTUM SCIENCE AND TECHNOLOGY, 2022, 7 (04)
  • [2] Regularized Differentiable Architecture Search
    Wang, Lanfei
    Xie, Lingxi
    Zhao, Kaili
    Guo, Jun
    Tian, Qi
    [J]. IEEE EMBEDDED SYSTEMS LETTERS, 2023, 15 (03) : 129 - 132
  • [3] Group Differentiable Architecture Search
    Shen, Chaoyuan
    Xu, Jinhua
    [J]. IEEE ACCESS, 2021, 9 : 76585 - 76591
  • [4] The limitations of differentiable architecture search
    Guillaume, Lacharme
    Hubert, Cardot
    Christophe, Lente
    Nicolas, Monmarche
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [5] Differentiable Architecture Search with Random Features
    Zhang, Xuanyang
    Li, Yonggang
    Zhang, Xiangyu
    Wang, Yongtao
    Sun, Jian
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16060 - 16069
  • [6] Enhanced Gradient for Differentiable Architecture Search
    Zhang, Haichao
    Hao, Kuangrong
    Gao, Lei
    Tang, Xuesong
    Wei, Bing
    Wei, Bing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9606 - 9620
  • [7] Sparse Gate for Differentiable Architecture Search
    Fan, Liang
    Wang, Handing
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Differentiable Architecture Search for Reinforcement Learning
    Miao, Yingjie
    Song, Xingyou
    Co-Reyes, John D.
    Peng, Daiyi
    Yue, Summer
    Brevdo, Eugene
    Faust, Aleksandra
    [J]. INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, VOL 188, 2022, 188
  • [9] IDARTS: Interactive Differentiable Architecture Search
    Xue, Song
    Wang, Runqi
    Zhang, Baochang
    Wang, Tian
    Guo, Guodong
    Doermann, David
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1143 - 1152
  • [10] An architecture entropy regularizer for differentiable neural architecture search
    Jing, Kun
    Chen, Luoyu
    Xu, Jungang
    [J]. NEURAL NETWORKS, 2023, 158 : 111 - 120