Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

被引:8
|
作者
Mills, Keith G. [1 ]
Han, Fred X. [2 ]
Zhang, Jialin [3 ]
Rezaei, Seyed Saeed Changiz [2 ]
Chudak, Fabian [2 ]
Lu, Wei [2 ]
Lian, Shuo [3 ]
Jui, Shangling [3 ]
Niu, Di [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Huawei Technol, Edmonton, AB, Canada
[3] Huawei Kirin Solut, Shanghai, Peoples R China
关键词
Neural Architecture Search; Design Space; Latency Measurement;
D O I
10.1145/3459637.3481944
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural architecture search automates neural network design and has achieved state-of-the-art results in many deep learning applications. While recent literature has focused on designing networks to maximize accuracy, little work has been conducted to understand the compatibility of architecture design spaces to varying hardware. In this paper, we analyze the neural blocks used to build Once-for-All (MobileNetV3), ProxylessNAS and ResNet families, in order to understand their predictive power and inference latency on various devices, including Huawei Kirin 9000 NPU, RTX 2080 Ti, AMD Threadripper 2990WX, and Samsung Note10. We introduce a methodology to quantify the friendliness of neural blocks to hardware and the impact of their placement in a macro network on overall network performance via only end-to-end measurements. Based on extensive profiling results, we derive design insights and apply them to hardware-specific search space reduction. We show that searching in the reduced search space generates better accuracy-latency Pareto frontiers than searching in the original search spaces, customizing architecture search according to the hardware. Moreover, insights derived from measurements lead to notably higher ImageNet top-1 scores on all search spaces investigated.
引用
收藏
页码:4026 / 4035
页数:10
相关论文
共 50 条
  • [31] A review of neural architecture search
    Baymurzina, Dilyara
    Golikov, Eugene
    Burtsev, Mikhail
    NEUROCOMPUTING, 2022, 474 : 82 - 93
  • [32] Disentangled Neural Architecture Search
    Zheng, Xinyue
    Wang, Peng
    Wang, Qigang
    Shi, Zhongchao
    Fan, Jianping
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [33] Neural Architecture Search for Convolutional Neural Networks with Attention
    Nakai, Kohei
    Matsubara, Takashi
    Uehara, Kuniaki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (02) : 312 - 321
  • [34] Effective, Efficient and Robust Neural Architecture Search Effective, Efficient and Robust Neural Architecture Search
    Yue, Zhixiong
    Lin, Baijiong
    Zhang, Yu
    Liang, Christy
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [35] Neural AutoForensics: Comparing Neural Sample Search and Neural Architecture Search for malware detection and forensics
    Sewak, Mohit
    Sahay, Sanjay K.
    Rathore, Hemant
    FORENSIC SCIENCE INTERNATIONAL-DIGITAL INVESTIGATION, 2022, 43
  • [36] Evolving Search Space for Neural Architecture Search
    Ci, Yuanzheng
    Lin, Chen
    Sun, Ming
    Chen, Boyu
    Zhang, Hongwen
    Ouyang, Wanli
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6639 - 6649
  • [37] Random Search and Reproducibility for Neural Architecture Search
    Li, Liam
    Talwalkar, Ameet
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 367 - 377
  • [38] DGL: Device Generic Latency Model for Neural Architecture Search on Mobile Devices
    Wang, Qinsi
    Zhang, Sihai
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (02) : 1954 - 1967
  • [39] NPENAS: Neural Predictor Guided Evolution for Neural Architecture Search
    Wei, Chen
    Niu, Chuang
    Tang, Yiping
    Wang, Yue
    Hu, Haihong
    Liang, Jimin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8441 - 8455
  • [40] Neural Architecture Search for Low-Precision Neural Networks
    Wu, Binyi
    Waschneck, Bernd
    Mayr, Christian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 743 - 755