Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices

被引:0
|
作者
Lee, Jooyeon [1 ]
Park, Junsang [1 ]
Lee, Seunghyun [1 ]
Kung, Jaeha [1 ]
机构
[1] Daegu Gyeongbuk Inst Sci & Technol DGIST, Daegu 42988, South Korea
关键词
Dataflow optimization; neural networks; neural architecture search; neural processing unit;
D O I
10.1145/3513085
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in deep learning have made it possible to implement artificial intelligence in mobile devices. Many studies have put a lot of effort into developing lightweight deep learning models optimized for mobile devices. To overcome the performance limitations of manually designed deep learning models, an automated search algorithm, called neural architecture search (NAS), has been proposed. However, studies on the effect of hardware architecture of the mobile device on the performance of NAS have been less explored. In this article, we show the importance of optimizing a hardware architecture, namely, NPU dataflow, when searching for a more accurate yet fast deep learning model. To do so, we first implement an optimization framework, named FlowOptimizer, for generating a best possible NPU dataflow for a given deep learning operator. Then, we utilize this framework during the latency-aware NAS to find the model with the highest accuracy satisfying the latency constraint. As a result, we show that the searched model with FlowOptimizer outperforms the performance by 87.1% and 92.3% on average compared to the searched model with NVDLA and Eyeriss, respectively, with better accuracy on a proxy dataset. We also show that the searched model can be transferred to a larger model to classify a more complex image dataset, i.e., ImageNet, achieving 0.2%/5.4% higher Top-1/Top-5 accuracy compared to MobileNetV2-1.0 with 3.6x lower latency.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Efficient Execution of Deep Neural Networks on Mobile Devices with NPU
    Tan, Tianxiang
    Cao, Guohong
    IPSN'21: PROCEEDINGS OF THE 20TH ACM/IEEE CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2021, : 283 - 298
  • [2] Optimizing the A* search algorithm for mobile robotic devices
    Maneev, V. V.
    Syryamkin, M. V.
    III INTERNATIONAL CONFERENCE COGNITIVE ROBOTICS, 2019, 516
  • [3] DGL: Device Generic Latency Model for Neural Architecture Search on Mobile Devices
    Wang, Qinsi
    Zhang, Sihai
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (02) : 1954 - 1967
  • [4] Neural Architecture Search for Computation Offloading of DNNs from Mobile Devices to the Edge Server
    Lee, KyungChae
    Le Vu Linh
    Kim, Heejae
    Youn, Chan-Hyun
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 134 - 139
  • [5] Microarchitecture Aware Neural Architecture Search for TinyML Devices
    Guan, Juntao
    Liu, Gufeng
    Zeng, Fanhong
    Lai, Rui
    Ding, Ruixue
    Zhu, Zhangming
    2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 522 - 526
  • [6] NASS: Optimizing Secure Inference via Neural Architecture Search
    Bian, Song
    Jiang, Weiwen
    Lu, Qing
    Shi, Yiyu
    Sato, Takashi
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1746 - 1753
  • [7] Efficient Hardware-Aware Neural Architecture Search for Image Super-Resolution on Mobile Devices
    Zhang, Xindong
    Zeng, Hui
    Zhang, Lei
    COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 409 - 426
  • [8] Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search
    Mills, Keith G.
    Han, Fred X.
    Zhang, Jialin
    Rezaei, Seyed Saeed Changiz
    Chudak, Fabian
    Lu, Wei
    Lian, Shuo
    Jui, Shangling
    Niu, Di
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4026 - 4035
  • [9] Neural architecture search for resource constrained hardware devices: A survey
    Yang, Yongjia
    Zhan, Jinyu
    Jiang, Wei
    Jiang, Yucheng
    Yu, Antai
    IET CYBER-PHYSICAL SYSTEMS: THEORY & APPLICATIONS, 2023, 8 (03) : 149 - 159
  • [10] Efficient Human Activity Recognition Using Lookup Table-Based Neural Architecture Search for Mobile Devices
    Lim, Won-Seon
    Seo, Wangduk
    Kim, Dae-Won
    Lee, Jaesung
    IEEE ACCESS, 2023, 11 : 71727 - 71738