Cyclic Differentiable Architecture Search

被引:20
|
作者
Yu, Hongyuan [1 ]
Peng, Houwen [2 ]
Huang, Yan [1 ]
Fu, Jianlong [2 ]
Du, Hao [3 ]
Wang, Liang [1 ]
Ling, Haibin [4 ]
机构
[1] Univ Chinese Acad Sci UCAS, Ctr Res Intelligent Percept & Comp CRIPAC, Natl Lab Pattern Recognit NLPR, Ctr Excellence Brain Sci & Intelligence Technol CE, Beijing 101408, Peoples R China
[2] Microsoft Res, Beijing 100080, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[4] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
基金
中国国家自然科学基金;
关键词
Cyclic; introspective distillation; differentiable architecture search; unified framework; NETWORK;
D O I
10.1109/TPAMI.2022.3153065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differentiable ARchiTecture Search, i.e., DARTS, has drawn great attention in neural architecture search. It tries to find the optimal architecture in a shallow search network and then measures its performance in a deep evaluation network. The independent optimization of the search and evaluation networks, however, leaves a room for potential improvement by allowing interaction between the two networks. To address the problematic optimization issue, we propose new joint optimization objectives and a novel Cyclic Differentiable ARchiTecture Search framework, dubbed CDARTS. Considering the structure difference, CDARTS builds a cyclic feedback mechanism between the search and evaluation networks with introspective distillation. First, the search network generates an initial architecture for evaluation, and the weights of the evaluation network are optimized. Second, the architecture weights in the search network are further optimized by the label supervision in classification, as well as the regularization from the evaluation network through feature distillation. Repeating the above cycle results in a joint optimization of the search and evaluation networks and thus enables the evolution of the architecture to fit the final evaluation network. The experiments and analysis on CIFAR, ImageNet and NATS-Bench [95] demonstrate the effectiveness of the proposed approach over the state-of-the-art ones. Specifically, in the DARTS search space, we achieve 97.52% top-1 accuracy on CIFAR10 and 76.3% top-1 accuracy on ImageNet. In the chain-structured search space, we achieve 78.2% top-1 accuracy on ImageNet, which is 1.1% higher than EfficientNet-B0. Our code and models are publicly available at https://github.com/microsoft/Cream.
引用
收藏
页码:211 / 228
页数:18
相关论文
共 50 条
  • [21] Search Space Adaptation for Differentiable Neural Architecture Search in Image Classification
    Kim, Youngkee
    Jung, Soyi
    Choi, Minseok
    Kim, Joongheon
    [J]. 2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 363 - 365
  • [22] Enhanced Differentiable Architecture Search Based on Asymptotic Regularization
    Jin, Cong
    Huang, Jinjie
    Chen, Yuanjian
    Gong, Yuqing
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02): : 1547 - 1568
  • [23] MergeNAS: Merge Operations into One for Differentiable Architecture Search
    Wang, Xiaoxing
    Xue, Chao
    Yan, Junchi
    Yang, Xiaokang
    Hu, Yonggang
    Sun, Kewei
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3065 - 3072
  • [24] Mean-Shift Based Differentiable Architecture Search
    Hsieh J.-W.
    Chou C.-H.
    Chang M.-C.
    Chen P.-Y.
    Santra S.
    Huang C.-S.
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (03): : 1235 - 1246
  • [25] NDARTS: A Differentiable Architecture Search Based on the Neumann Series
    Han, Xiaoyu
    Li, Chenyu
    Wang, Zifan
    Liu, Guohua
    [J]. ALGORITHMS, 2023, 16 (12)
  • [26] Differentiable neural architecture search with channel performance measurement
    Pan, Jie
    Zheng, Xue-Chi
    Zou, Xiao-Yu
    [J]. Kongzhi yu Juece/Control and Decision, 2024, 39 (07): : 2151 - 2160
  • [27] DOTS: Decoupling Operation and Topology in Differentiable Architecture Search
    Gu, Yu-Chao
    Wang, Li-Juan
    Liu, Yun
    Yang, Yi
    Wu, Yu-Huan
    Lu, Shao-Ping
    Cheng, Ming-Ming
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12306 - 12315
  • [28] DASS: Differentiable Architecture Search for Sparse Neural Networks
    Mousavi, Hamid
    Loni, Mohammad
    Alibeigi, Mina
    Daneshtalab, Masoud
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (05)
  • [29] Operation and Topology Aware Fast Differentiable Architecture Search
    Siddiqui, Shahid
    Kyrkou, Christos
    Theocharides, Theocharis
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9666 - 9673
  • [30] D-DARTS: Distributed Differentiable Architecture Search
    Heuillet, Alexandre
    Tabia, Hedi
    Arioui, Hichem
    Youcef-Toumi, Kamal
    [J]. PATTERN RECOGNITION LETTERS, 2023, 176 : 42 - 48