DGL: Device Generic Latency Model for Neural Architecture Search on Mobile Devices

被引：0

作者：

Wang, Qinsi ^{[1
]}

Zhang, Sihai ^{[2
,3
]}

机构：

[1] Univ Sci & Technol China, Dept Elect Sci & Technol, Hefei 230026, Anhui, Peoples R China

[2] Chinese Acad Sci, Key Lab Wireless Opt Commun, Beijing 100045, Peoples R China

[3] Univ Sci & Technol China, Sch Microelect, Hefei 230026, Anhui, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 02期

关键词：

Predictive models; Training; Mobile handsets; Hardware; Costs; Computer architecture; Analytical models; Neural architecture search (NAS); processor interval analysis; latency prediction; mobile devices;

D O I：

10.1109/TMC.2023.3244170

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The low-cost Neural Architecture Search (NAS) for lightweight networks working on massive mobile devices is essential for fast-developing ICT technology. Current NAS work can not search on unseen devices without latency sampling, which is a big obstacle to the implementation of NAS on mobile devices. In this paper, we overcome this challenge by proposing the Device Generic Latency (DGL) model. By absorbing processor modeling technology, the proposed DGL formula maps the parameters in the interval theory to the seven static configuration parameters of the device. And to make the formula more practical, we refine it to low-cost form by decreasing the number of configuration parameters to four. Then based on this formula, the DGL model is proposed which introduces the network parameters predictor and accuracy predictor to work with the DGL formula to predict the network latency. We propose the DGL-based NAS framework to enable fast searches without latency sampling. Extensive experiments results validate that the DGL model can achieve more accurate latency predictions than existing NAS latency predictors on unseen mobile devices. When configured with current state-of-the-art predictors, DGL-based NAS can search for architectures with higher accuracy that meet the latency limit than other NAS implementations, while using less training time and prediction time. Our work shed light on how to adopt domain knowledge into NAS topic and play important role in low-cost NAS on mobile devices.

引用

页码：1954 / 1967

页数：14

共 50 条

[1] Fast Search of Face Recognition Model for a Mobile Device Based on Neural Architecture Comparator
Savchenko, Andrey. V. V.
Savchenko, Lyudmila. V. V.
Makarov, Ilya
IEEE ACCESS, 2023, 11 : 65977 - 65990
[2] Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices
Lee, Jooyeon
Park, Junsang
Lee, Seunghyun
Kung, Jaeha
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (05)
[3] Latency-Constrained Neural Architecture Search Method for Efficient Model Deployment on RISC-V Devices
Xiang, Mingxi
Ding, Rui
Liu, Haijun
Zhou, Xichuan
ELECTRONICS, 2024, 13 (04)
[4] Generic Neural Architecture Search via Regression
Li, Yuhong
Hao, Cong
Li, Pan
Xiong, Jinjun
Chen, Deming
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Multihardware Adaptive Latency Prediction for Neural Architecture Search
Lin, Chengmin
Yang, Pengfei
Wang, Quan
Guo, Yitong
Wang, Zhenyi
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (03): : 3385 - 3398
[6] Neural Architecture Search for Computation Offloading of DNNs from Mobile Devices to the Edge Server
Lee, KyungChae
Le Vu Linh
Kim, Heejae
Youn, Chan-Hyun
12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 134 - 139
[7] Microarchitecture Aware Neural Architecture Search for TinyML Devices
Guan, Juntao
Liu, Gufeng
Zeng, Fanhong
Lai, Rui
Ding, Ruixue
Zhu, Zhangming
2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024, 2024, : 522 - 526
[8] LATENCY-CONTROLLED NEURAL ARCHITECTURE SEARCH FOR STREAMING SPEECH RECOGNITION
He, Liqiang
Feng, Shulin
Su, Dan
Yu, Dong
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 62 - 67
[9] Efficient Hardware-Aware Neural Architecture Search for Image Super-Resolution on Mobile Devices
Zhang, Xindong
Zeng, Hui
Zhang, Lei
COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 409 - 426
[10] Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search
Mills, Keith G.
Han, Fred X.
Zhang, Jialin
Rezaei, Seyed Saeed Changiz
Chudak, Fabian
Lu, Wei
Lian, Shuo
Jui, Shangling
Niu, Di
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4026 - 4035

← 1 2 3 4 5 →