DGL: Device Generic Latency Model for Neural Architecture Search on Mobile Devices

被引:0
|
作者
Wang, Qinsi [1 ]
Zhang, Sihai [2 ,3 ]
机构
[1] Univ Sci & Technol China, Dept Elect Sci & Technol, Hefei 230026, Anhui, Peoples R China
[2] Chinese Acad Sci, Key Lab Wireless Opt Commun, Beijing 100045, Peoples R China
[3] Univ Sci & Technol China, Sch Microelect, Hefei 230026, Anhui, Peoples R China
关键词
Predictive models; Training; Mobile handsets; Hardware; Costs; Computer architecture; Analytical models; Neural architecture search (NAS); processor interval analysis; latency prediction; mobile devices;
D O I
10.1109/TMC.2023.3244170
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The low-cost Neural Architecture Search (NAS) for lightweight networks working on massive mobile devices is essential for fast-developing ICT technology. Current NAS work can not search on unseen devices without latency sampling, which is a big obstacle to the implementation of NAS on mobile devices. In this paper, we overcome this challenge by proposing the Device Generic Latency (DGL) model. By absorbing processor modeling technology, the proposed DGL formula maps the parameters in the interval theory to the seven static configuration parameters of the device. And to make the formula more practical, we refine it to low-cost form by decreasing the number of configuration parameters to four. Then based on this formula, the DGL model is proposed which introduces the network parameters predictor and accuracy predictor to work with the DGL formula to predict the network latency. We propose the DGL-based NAS framework to enable fast searches without latency sampling. Extensive experiments results validate that the DGL model can achieve more accurate latency predictions than existing NAS latency predictors on unseen mobile devices. When configured with current state-of-the-art predictors, DGL-based NAS can search for architectures with higher accuracy that meet the latency limit than other NAS implementations, while using less training time and prediction time. Our work shed light on how to adopt domain knowledge into NAS topic and play important role in low-cost NAS on mobile devices.
引用
收藏
页码:1954 / 1967
页数:14
相关论文
共 50 条
  • [1] Fast Search of Face Recognition Model for a Mobile Device Based on Neural Architecture Comparator
    Savchenko, Andrey. V. V.
    Savchenko, Lyudmila. V. V.
    Makarov, Ilya
    IEEE ACCESS, 2023, 11 : 65977 - 65990
  • [2] Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices
    Lee, Jooyeon
    Park, Junsang
    Lee, Seunghyun
    Kung, Jaeha
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (05)
  • [3] Latency-Constrained Neural Architecture Search Method for Efficient Model Deployment on RISC-V Devices
    Xiang, Mingxi
    Ding, Rui
    Liu, Haijun
    Zhou, Xichuan
    ELECTRONICS, 2024, 13 (04)
  • [4] Generic Neural Architecture Search via Regression
    Li, Yuhong
    Hao, Cong
    Li, Pan
    Xiong, Jinjun
    Chen, Deming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Multihardware Adaptive Latency Prediction for Neural Architecture Search
    Lin, Chengmin
    Yang, Pengfei
    Wang, Quan
    Guo, Yitong
    Wang, Zhenyi
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (03): : 3385 - 3398
  • [6] Neural Architecture Search for Computation Offloading of DNNs from Mobile Devices to the Edge Server
    Lee, KyungChae
    Le Vu Linh
    Kim, Heejae
    Youn, Chan-Hyun
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 134 - 139
  • [7] Microarchitecture Aware Neural Architecture Search for TinyML Devices
    Guan, Juntao
    Liu, Gufeng
    Zeng, Fanhong
    Lai, Rui
    Ding, Ruixue
    Zhu, Zhangming
    2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 522 - 526
  • [8] LATENCY-CONTROLLED NEURAL ARCHITECTURE SEARCH FOR STREAMING SPEECH RECOGNITION
    He, Liqiang
    Feng, Shulin
    Su, Dan
    Yu, Dong
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 62 - 67
  • [9] Efficient Hardware-Aware Neural Architecture Search for Image Super-Resolution on Mobile Devices
    Zhang, Xindong
    Zeng, Hui
    Zhang, Lei
    COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 409 - 426
  • [10] Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search
    Mills, Keith G.
    Han, Fred X.
    Zhang, Jialin
    Rezaei, Seyed Saeed Changiz
    Chudak, Fabian
    Lu, Wei
    Lian, Shuo
    Jui, Shangling
    Niu, Di
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4026 - 4035