Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

被引:8
|
作者
Mills, Keith G. [1 ]
Han, Fred X. [2 ]
Zhang, Jialin [3 ]
Rezaei, Seyed Saeed Changiz [2 ]
Chudak, Fabian [2 ]
Lu, Wei [2 ]
Lian, Shuo [3 ]
Jui, Shangling [3 ]
Niu, Di [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Huawei Technol, Edmonton, AB, Canada
[3] Huawei Kirin Solut, Shanghai, Peoples R China
关键词
Neural Architecture Search; Design Space; Latency Measurement;
D O I
10.1145/3459637.3481944
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural architecture search automates neural network design and has achieved state-of-the-art results in many deep learning applications. While recent literature has focused on designing networks to maximize accuracy, little work has been conducted to understand the compatibility of architecture design spaces to varying hardware. In this paper, we analyze the neural blocks used to build Once-for-All (MobileNetV3), ProxylessNAS and ResNet families, in order to understand their predictive power and inference latency on various devices, including Huawei Kirin 9000 NPU, RTX 2080 Ti, AMD Threadripper 2990WX, and Samsung Note10. We introduce a methodology to quantify the friendliness of neural blocks to hardware and the impact of their placement in a macro network on overall network performance via only end-to-end measurements. Based on extensive profiling results, we derive design insights and apply them to hardware-specific search space reduction. We show that searching in the reduced search space generates better accuracy-latency Pareto frontiers than searching in the original search spaces, customizing architecture search according to the hardware. Moreover, insights derived from measurements lead to notably higher ImageNet top-1 scores on all search spaces investigated.
引用
收藏
页码:4026 / 4035
页数:10
相关论文
共 50 条
  • [1] Evolving blocks by segmentation for neural architecture search
    Zhao, Xiaoping
    Jiang, Liwen
    Slowik, Adam
    Zhang, Zhenman
    Xue, Yu
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (03): : 2016 - 2032
  • [2] Neural Network Design: Learning from Neural Architecture Search
    van Stein, Bas
    Wang, Hao
    Back, Thomas
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1341 - 1349
  • [3] Efficient spiking neural network design via neural architecture search
    Yan, Jiaqi
    Liu, Qianhui
    Zhang, Malu
    Feng, Lang
    Ma, De
    Li, Haizhou
    Pan, Gang
    NEURAL NETWORKS, 2024, 173
  • [4] Contrastive Neural Architecture Search with Neural Architecture Comparators
    Chen, Yaofo
    Guo, Yong
    Chen, Qi
    Li, Minli
    Zeng, Wei
    Wang, Yaowei
    Tan, Mingkui
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9497 - 9506
  • [5] DIPO: Differentiable Parallel Operation Blocks for Surgical Neural Architecture Search
    Lee, Matthew
    Sanchez-Matilla, Ricardo
    Stoyanov, Danail
    Luengo, Imanol
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (09) : 5540 - 5550
  • [6] MnasNet: Platform-Aware Neural Architecture Search for Mobile
    Tan, Mingxing
    Chen, Bo
    Pang, Ruoming
    Vasudevan, Vijay
    Sandier, Mark
    Howard, Andrew
    Le, Quoc V.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2815 - 2823
  • [7] Efficient Automation of Neural Network Design: A Survey on Differentiable Neural Architecture Search
    Heuillet, Alexandre
    Nasser, Ahmad
    Arioui, Hichem
    Tabia, Hedi
    ACM COMPUTING SURVEYS, 2024, 56 (11)
  • [8] Universal Binary Neural Networks Design by Improved Differentiable Neural Architecture Search
    Tan, Menghao
    Gao, Weifeng
    Li, Hong
    Xie, Jin
    Gong, Maoguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (10) : 9153 - 9165
  • [9] Neural Architecture Search for Spiking Neural Networks
    Kim, Youngeun
    Li, Yuhang
    Park, Hyoungseob
    Venkatesha, Yeshwanth
    Panda, Priyadarshini
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 36 - 56
  • [10] Neural Graph Embedding for Neural Architecture Search
    Li, Wei
    Gong, Shaogang
    Zhu, Xiatian
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 4707 - 4714