Efficient Architecture Search for Diverse Tasks

被引:0
|
作者
Shen, Junhong [1 ]
Khodak, Mikhail [1 ]
Talwalkar, Ameet [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While neural architecture search (NAS) has enabled automated machine learning (AutoML) for well-researched areas, its application to tasks beyond computer vision is still under-explored. As less-studied domains are precisely those where we expect AutoML to have the greatest impact, in this work we study NAS for efficiently solving diverse problems. Seeking an approach that is fast, simple, and broadly applicable, we fix a standard convolutional network (CNN) topology and propose to search for the right kernel sizes and dilations its operations should take on. This dramatically expands the model's capacity to extract features at multiple resolutions for different types of data while only requiring search over the operation space. To overcome the efficiency challenges of naive weight-sharing in this search space, we introduce DASH, a differentiable NAS algorithm that computes the mixture-of-operations using the Fourier diagonalization of convolution, achieving both a better asymptotic complexity and an up-to-10x search time speedup in practice. We evaluate DASH on ten tasks spanning a variety of application domains such as PDE solving, protein folding, and heart disease detection. DASH outperforms state-of-the-art AutoML methods in aggregate, attaining the best-known automated performance on seven tasks. Meanwhile, on six of the ten tasks, the combined search and retraining time is less than 2x slower than simply training a CNN backbone that is far less accurate.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] ECCNAS: Efficient Crowd Counting Neural Architecture Search
    Wang, Yabin
    Ma, Zhiheng
    Wei, Xing
    Zheng, Shuai
    Wang, Yaowei
    Hong, Xiaopeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [42] AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment
    Sato, Rei
    Sakuma, Jun
    Akimoto, Youhei
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9489 - 9496
  • [43] Binarized Neural Architecture Search for Efficient Object Recognition
    Chen, Hanlin
    Zhuo, Li'an
    Zhang, Baochang
    Zheng, Xiawu
    Liu, Jianzhuang
    Ji, Rongrong
    Doermann, David
    Guo, Guodong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 501 - 516
  • [44] RankNAS: Efficient Neural Architecture Search by Pairwise Ranking
    Hu, Chi
    Wang, Chenglong
    Ma, Xiangnan
    Meng, Xia
    Li, Yinqiao
    Xiao, Tong
    Zhu, Jingbo
    Li, Changliang
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2469 - 2480
  • [45] Efficient Evaluation Methods for Neural Architecture Search: A Survey
    Song, Xiaotian
    Xie, Xiangning
    Lv, Zeqiong
    Yen, Gary G.
    Ding, Weiping
    Lv, Jiancheng
    Sun, Yanan
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 5990 - 6011
  • [46] Efficient Network Architecture Search Using Hybrid Optimizer
    Wang, Ting-Ting
    Chu, Shu-Chuan
    Hu, Chia-Cheng
    Jia, Han-Dong
    Pan, Jeng-Shyang
    ENTROPY, 2022, 24 (05)
  • [47] CURIOUS: Efficient Neural Architecture Search Based on a Performance Predictor and Evolutionary Search
    Hassantabar, Shayan
    Dai, Xiaoliang
    Jha, Niraj K.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 4975 - 4990
  • [48] Implementing Cognitive Architecture Plan-Image to Search Tasks in Agent Models
    Mashkova, Alrksandra
    Savina, Olga
    Mashkov, Eugene
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2018, : 290 - 294
  • [49] A New Workflow Model for Energy Efficient Cloud Tasks Scheduling Architecture
    Saxena, Sandeep
    Sanyal, Goutam
    Sharma, Suraj
    Yadav, Suneel Kr
    2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 21 - 27
  • [50] An Efficient Tasks Offloading Procedure for an Integrated Edge-Computing Architecture
    Picano, Benedetta
    Fantacci, Romano
    JOURNAL OF COMMUNICATIONS AND NETWORKS, 2024, 26 (02) : 215 - 224