Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

被引:8
|
作者
Mills, Keith G. [1 ]
Han, Fred X. [2 ]
Zhang, Jialin [3 ]
Rezaei, Seyed Saeed Changiz [2 ]
Chudak, Fabian [2 ]
Lu, Wei [2 ]
Lian, Shuo [3 ]
Jui, Shangling [3 ]
Niu, Di [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Huawei Technol, Edmonton, AB, Canada
[3] Huawei Kirin Solut, Shanghai, Peoples R China
关键词
Neural Architecture Search; Design Space; Latency Measurement;
D O I
10.1145/3459637.3481944
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural architecture search automates neural network design and has achieved state-of-the-art results in many deep learning applications. While recent literature has focused on designing networks to maximize accuracy, little work has been conducted to understand the compatibility of architecture design spaces to varying hardware. In this paper, we analyze the neural blocks used to build Once-for-All (MobileNetV3), ProxylessNAS and ResNet families, in order to understand their predictive power and inference latency on various devices, including Huawei Kirin 9000 NPU, RTX 2080 Ti, AMD Threadripper 2990WX, and Samsung Note10. We introduce a methodology to quantify the friendliness of neural blocks to hardware and the impact of their placement in a macro network on overall network performance via only end-to-end measurements. Based on extensive profiling results, we derive design insights and apply them to hardware-specific search space reduction. We show that searching in the reduced search space generates better accuracy-latency Pareto frontiers than searching in the original search spaces, customizing architecture search according to the hardware. Moreover, insights derived from measurements lead to notably higher ImageNet top-1 scores on all search spaces investigated.
引用
收藏
页码:4026 / 4035
页数:10
相关论文
共 50 条
  • [21] Neural architecture search: A survey
    Elsken, Thomas
    Metzen, Jan Hendrik
    Hutter, Frank
    Journal of Machine Learning Research, 2019, 20
  • [22] Advances in neural architecture search
    Xin Wang
    Wenwu Zhu
    National Science Review, 2024, 11 (08) : 24 - 38
  • [23] Progressive Neural Architecture Search
    Liu, Chenxi
    Zoph, Barret
    Neumann, Maxim
    Shlens, Jonathon
    Hua, Wei
    Li, Li-Jia
    Li Fei-Fei
    Yuille, Alan
    Huang, Jonathan
    Murphy, Kevin
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 19 - 35
  • [24] Advances in neural architecture search
    Wang, Xin
    Zhu, Wenwu
    NATIONAL SCIENCE REVIEW, 2024, 11 (08)
  • [25] Personalized Neural Architecture Search
    Kulbach, Cedric
    Thoma, Steffen
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 581 - 590
  • [26] Binarized Neural Architecture Search
    Chen, Hanlin
    Zhuo, Li'an
    Zhang, Baochang
    Zheng, Xiawu
    Liu, Jianzhuang
    Doermann, David
    Ji, Rongrong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10526 - 10533
  • [27] Diversity in Neural Architecture Search
    Hu, Wenzheng
    Li, Mingyang
    Yuan, Changhe
    Zhang, Changshui
    Wang, Jianqiang
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [28] Neural Architecture Search: A Survey
    Elsken, Thomas
    Metzen, Jan Hendrik
    Hutter, Frank
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [29] Balanced neural architecture search
    Li, Yangyang
    Liu, Guanlong
    Zhao, Peixiang
    Shang, Ronghua
    Jiao, Licheng
    NEUROCOMPUTING, 2024, 594
  • [30] Hypergraph Neural Architecture Search
    Lin, Wei
    Peng, Xu
    Yu, Zhengtao
    Jin, Taisong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13837 - 13845