Efficient Architecture Search for Diverse Tasks

被引:0
|
作者
Shen, Junhong [1 ]
Khodak, Mikhail [1 ]
Talwalkar, Ameet [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While neural architecture search (NAS) has enabled automated machine learning (AutoML) for well-researched areas, its application to tasks beyond computer vision is still under-explored. As less-studied domains are precisely those where we expect AutoML to have the greatest impact, in this work we study NAS for efficiently solving diverse problems. Seeking an approach that is fast, simple, and broadly applicable, we fix a standard convolutional network (CNN) topology and propose to search for the right kernel sizes and dilations its operations should take on. This dramatically expands the model's capacity to extract features at multiple resolutions for different types of data while only requiring search over the operation space. To overcome the efficiency challenges of naive weight-sharing in this search space, we introduce DASH, a differentiable NAS algorithm that computes the mixture-of-operations using the Fourier diagonalization of convolution, achieving both a better asymptotic complexity and an up-to-10x search time speedup in practice. We evaluate DASH on ten tasks spanning a variety of application domains such as PDE solving, protein folding, and heart disease detection. DASH outperforms state-of-the-art AutoML methods in aggregate, attaining the best-known automated performance on seven tasks. Meanwhile, on six of the ten tasks, the combined search and retraining time is less than 2x slower than simply training a CNN backbone that is far less accurate.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Carbon-Efficient Neural Architecture Search
    Zhao, Yiyang
    Guo, Tian
    PROCEEDINGS OF THE 2ND ACM WORKSHOP ON SUSTAINABLE COMPUTER SYSTEMS, HOTCARBON 2023, 2023,
  • [22] A neural architecture generator for efficient search space
    Jing, Kun
    Xu, Jungang
    Zhang, Zhen
    NEUROCOMPUTING, 2022, 486 : 189 - 199
  • [23] Efficient neural architecture search for emotion recognition
    Verma, Monu
    Mandal, Murari
    Reddy, Satish Kumar
    Meedimale, Yashwanth Reddy
    Vipparthi, Santosh Kumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
  • [24] Extensible and Efficient Proxy for Neural Architecture Search
    Li, Yuhong
    Li, Jiajie
    Hao, Cong
    Li, Pan
    Xiong, Jinjun
    Chen, Deming
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6176 - 6187
  • [25] EfficientTDNN: Efficient Architecture Search for Speaker Recognition
    Wang, Rui
    Wei, Zhihua
    Duan, Haoran
    Ji, Shouling
    Long, Yang
    Hong, Zhen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2267 - 2279
  • [26] Efficient Architecture Search for Deep Neural Networks
    Gottapu, Ram Deepak
    Dagli, Cihan H.
    COMPLEX ADAPTIVE SYSTEMS, 2020, 168 : 19 - 25
  • [27] EAutoDet: Efficient Architecture Search for Object Detection
    Wang, Xiaoxing
    Lin, Jiale
    Zhao, Juanping
    Yang, Xiaokang
    Yan, Junchi
    COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 668 - 684
  • [28] DONNAv2-Lightweight Neural Architecture Search for Vision tasks
    Priyadarshi, Sweta
    Jiang, Tianyu
    Cheng, Hsin-Pai
    Krishna, Sendil
    Ganapathy, Viswanath
    Patel, Chirag
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 1376 - 1384
  • [29] Efficient evolutionary neural architecture search based on hybrid search space
    Gong, Tao
    Ma, Yongjie
    Xu, Yang
    Song, Changwei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (08) : 3313 - 3326
  • [30] BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture
    Ding, Zixiang
    Chen, Yaran
    Li, Nannan
    Zhao, Dongbin
    Sun, Zhiquan
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 5004 - 5018