Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices

被引:0
|
作者
Lee, Jooyeon [1 ]
Park, Junsang [1 ]
Lee, Seunghyun [1 ]
Kung, Jaeha [1 ]
机构
[1] Daegu Gyeongbuk Inst Sci & Technol DGIST, Daegu 42988, South Korea
关键词
Dataflow optimization; neural networks; neural architecture search; neural processing unit;
D O I
10.1145/3513085
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advances in deep learning have made it possible to implement artificial intelligence in mobile devices. Many studies have put a lot of effort into developing lightweight deep learning models optimized for mobile devices. To overcome the performance limitations of manually designed deep learning models, an automated search algorithm, called neural architecture search (NAS), has been proposed. However, studies on the effect of hardware architecture of the mobile device on the performance of NAS have been less explored. In this article, we show the importance of optimizing a hardware architecture, namely, NPU dataflow, when searching for a more accurate yet fast deep learning model. To do so, we first implement an optimization framework, named FlowOptimizer, for generating a best possible NPU dataflow for a given deep learning operator. Then, we utilize this framework during the latency-aware NAS to find the model with the highest accuracy satisfying the latency constraint. As a result, we show that the searched model with FlowOptimizer outperforms the performance by 87.1% and 92.3% on average compared to the searched model with NVDLA and Eyeriss, respectively, with better accuracy on a proxy dataset. We also show that the searched model can be transferred to a larger model to classify a more complex image dataset, i.e., ImageNet, achieving 0.2%/5.4% higher Top-1/Top-5 accuracy compared to MobileNetV2-1.0 with 3.6x lower latency.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Privacy-Preserving Neural Architecture Search Across Federated IoT Devices
    Zhang, Chunhui
    Yuan, Xiaoming
    Zhang, Qianyun
    Zhu, Guangxu
    Cheng, Lei
    Zhang, Ning
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1434 - 1438
  • [32] Optimizing the downlink for mobile wireless devices
    Methfessel, M
    Frankenfeldt, H
    Dombrowski, KF
    Kraemer, R
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON WIRELESS AND OPTICAL COMMUNICATIONS, 2002, : 245 - 249
  • [33] Optimizing FPGA-Based CNN Accelerator Using Differentiable Neural Architecture Search
    Fan, Hongxiang
    Ferianc, Martin
    Liu, Shuanglong
    Que, Zhiqiang
    Niu, Xinyu
    Luk, Wayne
    2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 465 - 468
  • [34] Optimizing the Run Time in Mobile Devices
    Mani, K.
    Mullai, A.
    2017 2ND WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT), 2017, : 55 - 60
  • [35] FastDeepIoT: Towards Understanding and Optimizing Neural Network Execution Time on Mobile and Embedded Devices
    Yao, Shuochao
    Zhao, Yiran
    Shao, Huajie
    Liu, ShengZhong
    Liu, Dongxin
    Su, Lu
    Abdelzaher, Tarek
    SENSYS'18: PROCEEDINGS OF THE 16TH CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, 2018, : 278 - 291
  • [36] Mobile search - Social network search using mobile devices demonstration
    Tiago, Pedro
    Kotilainen, Niko
    Vapa, Mikko
    2008 5TH IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, VOLS 1-3, 2008, : 1245 - 1245
  • [37] Designing resource-constrained neural networks using neural architecture search targeting embedded devices
    Cassimon, Amber
    Vanneste, Simon
    Bosmans, Stig
    Mercelis, Siegfried
    Hellinckx, Peter
    INTERNET OF THINGS, 2020, 12
  • [38] Graph Neural Architecture Search
    Gao, Yang
    Yang, Hong
    Zhang, Peng
    Zhou, Chuan
    Hu, Yue
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1403 - 1409
  • [39] Neural architecture search: A survey
    Elsken, Thomas
    Metzen, Jan Hendrik
    Hutter, Frank
    Journal of Machine Learning Research, 2019, 20
  • [40] Advances in neural architecture search
    Xin Wang
    Wenwu Zhu
    National Science Review, 2024, 11 (08) : 24 - 38