Efficient Architecture Search for Diverse Tasks

被引:0
|
作者
Shen, Junhong [1 ]
Khodak, Mikhail [1 ]
Talwalkar, Ameet [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While neural architecture search (NAS) has enabled automated machine learning (AutoML) for well-researched areas, its application to tasks beyond computer vision is still under-explored. As less-studied domains are precisely those where we expect AutoML to have the greatest impact, in this work we study NAS for efficiently solving diverse problems. Seeking an approach that is fast, simple, and broadly applicable, we fix a standard convolutional network (CNN) topology and propose to search for the right kernel sizes and dilations its operations should take on. This dramatically expands the model's capacity to extract features at multiple resolutions for different types of data while only requiring search over the operation space. To overcome the efficiency challenges of naive weight-sharing in this search space, we introduce DASH, a differentiable NAS algorithm that computes the mixture-of-operations using the Fourier diagonalization of convolution, achieving both a better asymptotic complexity and an up-to-10x search time speedup in practice. We evaluate DASH on ten tasks spanning a variety of application domains such as PDE solving, protein folding, and heart disease detection. DASH outperforms state-of-the-art AutoML methods in aggregate, attaining the best-known automated performance on seven tasks. Meanwhile, on six of the ten tasks, the combined search and retraining time is less than 2x slower than simply training a CNN backbone that is far less accurate.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Efficient Non-Parametric Optimizer Search for Diverse Tasks
    Wang, Ruochen
    Xiong, Yuanhao
    Cheng, Minhao
    Hsieh, Cho-Jui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] D2NAS: Efficient Neural Architecture Search With Performance Improvement and Model Size Reduction for Diverse Tasks
    Lee, Jungeun
    Han, Seungyub
    Lee, Jungwoo
    IEEE ACCESS, 2024, 12 : 127074 - 127085
  • [3] NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks
    Tu, Renbo
    Roberts, Nicholas
    Khodak, Mikhail
    Shen, Junhong
    Sala, Frederic
    Talwalkar, Ameet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Efficient Search for Efficient Architecture
    Liao, Liewen
    Wang, Yaoming
    Li, Hao
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3140 - 3144
  • [5] NVP: A Flexible and Efficient Processor Architecture for Accelerating Diverse Computer Vision Tasks including DNN
    Liu, Ye
    Wu, Fei
    Zhao, Neng
    Zhang, Qirong
    Wang, Wenqiang
    Yang, Yutong
    Li, Xiangting
    Li, Sixu
    Huang, Zili
    Hao, Shuang
    Ou, Guangbin
    Zhou, Liang
    Chang, Liang
    Lin, Shuisheng
    Xu, Ningyi
    Zhou, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (01) : 271 - 275
  • [6] Generating Neural Networks for Diverse Networking Classification Tasks via Hardware-Aware Neural Architecture Search
    Xie, Guorui
    Li, Qing
    Shi, Zhenning
    Fang, Hanbin
    Ji, Shengpeng
    Jiang, Yong
    Yuan, Zhenhui
    Ma, Lianbo
    Xu, Mingwei
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (02) : 481 - 494
  • [7] Efficient Search for Diverse Coherent Explanations
    Russell, Chris
    FAT*'19: PROCEEDINGS OF THE 2019 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2019, : 20 - 28
  • [8] Efficient Forward Architecture Search
    Hu, Hanzhang
    Langford, John
    Caruana, Rich
    Mukherjee, Saurajit
    Horvitz, Eric
    Dey, Debadeepta
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [9] Effective, Efficient and Robust Neural Architecture Search Effective, Efficient and Robust Neural Architecture Search
    Yue, Zhixiong
    Lin, Baijiong
    Zhang, Yu
    Liang, Christy
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] INFORMATION ARCHITECTURE - IN SEARCH OF EFFICIENT FLEXIBILITY
    ALLEN, BR
    BOYNTON, AC
    MIS QUARTERLY, 1991, 15 (04) : 435 - 445