DGL: Device Generic Latency Model for Neural Architecture Search on Mobile Devices

被引:0
|
作者
Wang, Qinsi [1 ]
Zhang, Sihai [2 ,3 ]
机构
[1] Univ Sci & Technol China, Dept Elect Sci & Technol, Hefei 230026, Anhui, Peoples R China
[2] Chinese Acad Sci, Key Lab Wireless Opt Commun, Beijing 100045, Peoples R China
[3] Univ Sci & Technol China, Sch Microelect, Hefei 230026, Anhui, Peoples R China
关键词
Predictive models; Training; Mobile handsets; Hardware; Costs; Computer architecture; Analytical models; Neural architecture search (NAS); processor interval analysis; latency prediction; mobile devices;
D O I
10.1109/TMC.2023.3244170
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The low-cost Neural Architecture Search (NAS) for lightweight networks working on massive mobile devices is essential for fast-developing ICT technology. Current NAS work can not search on unseen devices without latency sampling, which is a big obstacle to the implementation of NAS on mobile devices. In this paper, we overcome this challenge by proposing the Device Generic Latency (DGL) model. By absorbing processor modeling technology, the proposed DGL formula maps the parameters in the interval theory to the seven static configuration parameters of the device. And to make the formula more practical, we refine it to low-cost form by decreasing the number of configuration parameters to four. Then based on this formula, the DGL model is proposed which introduces the network parameters predictor and accuracy predictor to work with the DGL formula to predict the network latency. We propose the DGL-based NAS framework to enable fast searches without latency sampling. Extensive experiments results validate that the DGL model can achieve more accurate latency predictions than existing NAS latency predictors on unseen mobile devices. When configured with current state-of-the-art predictors, DGL-based NAS can search for architectures with higher accuracy that meet the latency limit than other NAS implementations, while using less training time and prediction time. Our work shed light on how to adopt domain knowledge into NAS topic and play important role in low-cost NAS on mobile devices.
引用
收藏
页码:1954 / 1967
页数:14
相关论文
共 50 条
  • [41] FOX-NAS: Fast, On-device and Explainable Neural Architecture Search
    Liu, Chia-Hsiang
    Han, Yu-Shin
    Sung, Yuan-Yao
    Lee, Yi
    Chiang, Hung-Yueh
    Wu, Kai-Chiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 789 - 797
  • [42] IoT Device Lifecycle - a Generic Model and a use case for Cellular Mobile Networks
    Soos, Gabor
    Kozma, Daniel
    Janky, Ferenc Nandor
    Varga, Pal
    2018 IEEE 6TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2018), 2018, : 176 - 183
  • [43] Enhancing Model Parallelism in Neural Architecture Search for Multidevice System
    Fu, Cheng
    Chen, Huili
    Yang, Zhenheng
    Koushanfar, Farinaz
    Tian, Yuandong
    Zhao, Jishen
    IEEE MICRO, 2020, 40 (05) : 46 - 54
  • [44] Designing resource-constrained neural networks using neural architecture search targeting embedded devices
    Cassimon, Amber
    Vanneste, Simon
    Bosmans, Stig
    Mercelis, Siegfried
    Hellinckx, Peter
    INTERNET OF THINGS, 2020, 12
  • [45] Search for predictive generic model of aqueous solubility using Bayesian neural nets
    Bruneau, P
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (06): : 1605 - 1616
  • [46] MicroNAS for memory and latency constrained hardware aware neural architecture search in time series classification on microcontrollers
    King, Tobias
    Zhou, Yexu
    Roeddiger, Tobias
    Beigl, Michael
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [47] One-Shot Sparse Neural Architecture Search for Resource-Constrained Devices
    Song, Shenghui
    Zaech, Jan-Nico
    Heo, Seonyeong
    2024 IEEE 30TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, RTCSA 2024, 2024, : 132 - 133
  • [48] A novel method based on neural architecture search for Diptera insect classification on embedded devices
    Chen, Yantong
    Bian, Shichang
    Liu, Yang
    Zhang, Zhongling
    ECOLOGICAL INFORMATICS, 2022, 71
  • [49] Constrained deep neural network architecture search for IoT devices accounting for hardware calibration
    Scheidegger, Florian
    Benini, Luca
    Bekas, Costas
    Malossi, Cristiano
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [50] Toward Tailored Models on Private AIoT Devices: Federated Direct Neural Architecture Search
    Zhang, Chunhui
    Yuan, Xiaoming
    Zhang, Qianyun
    Zhu, Guangxu
    Cheng, Lei
    Zhang, Ning
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (18): : 17309 - 17322