Multihardware Adaptive Latency Prediction for Neural Architecture Search

被引：0

作者：

Lin, Chengmin ^{[1
]}

Yang, Pengfei ^{[1
]}

Wang, Quan ^{[1
]}

Guo, Yitong ^{[1
]}

Wang, Zhenyi ^{[1
]}

机构：

[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710126, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2025年 / 12卷 / 03期

关键词：

Hardware; Predictive models; Adaptation models; Training; Accuracy; Network architecture; Computer architecture; Optimization; Performance evaluation; Data models; Dynamic sample allocation; few-shot learning; hardware-aware; latency predictor; neural architecture search (NAS); representative sample sampling; NETWORKS;

D O I：

10.1109/JIOT.2024.3480990

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In hardware-aware neural architecture search (NAS), accurately assessing a model's inference efficiency is crucial for search optimization. Traditional approaches, which measure numerous samples to train proxy models, are impractical across varied platforms due to the extensive resources needed to remeasure and rebuild models for each platform. To address this challenge, we propose a multihardware-aware NAS method that enhances the generalizability of proxy models across different platforms while reducing the required sample size. Our method introduces a multihardware adaptive latency prediction (MHLP) model that leverages one-hot encoding for hardware parameters and multihead attention mechanisms to effectively capture the intricate interplay between hardware attributes and network architecture features. Additionally, we implement a two-stage sampling mechanism based on probability density weighting to ensure the representativeness and diversity of the sample set. By adopting a dynamic sample allocation mechanism, our method can adjust the adaptive sample size according to the initial model state, providing stronger data support for devices with significant deviations. Evaluations on NAS benchmarks demonstrate the MHLP predictor's excellent generalization accuracy using only 10 samples, guiding the NAS search process to identify optimal network architectures.

引用

页码：3385 / 3398

页数：14

共 50 条

[21] Traffic Spatial-Temporal Prediction Based on Neural Architecture Search
Zhang, Dongran
Luo, Gang
Li, Jun
PROCEEDINGS OF 2023 18TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATA, SSTD 2023, 2023, : 21 - 30
[22] Accelerating Evolutionary Neural Architecture Search for Remaining Useful Life Prediction
Mo, Hyunho
Iacca, Giovanni
BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 15 - 30
[23] AutoST: Efficient Neural Architecture Search for Spatio-Temporal Prediction
Li, Ting
Zhang, Junbo
Bao, Kainan
Liang, Yuxuan
Li, Yexin
Zheng, Yu
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 794 - 802
[24] Efficient graph neural architecture search using Monte Carlo Tree search and prediction network
Deng, TianJin
Wu, Jia
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
[25] SaDENAS: A self-adaptive differential evolution algorithm for neural architecture search
Han, Xiaolong
Xue, Yu
Wang, Zehong
Zhang, Yong
Muravev, Anton
Gabbouj, Moncef
SWARM AND EVOLUTIONARY COMPUTATION, 2024, 91
[26] AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search
Chen, Daoyuan
Li, Yaliang
Qiu, Minghui
Wang, Zhen
Li, Bofang
Ding, Bolin
Deng, Hongbo
Huang, Jun
Lin, Wei
Zhou, Jingren
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2463 - 2469
[27] Learn Basic Skills and Reuse: Modularized Adaptive Neural Architecture Search (MANAS)
Chen, Hanxiong
Li, Yunqi
Zhu, He
Zhang, Yongfeng
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 169 - 179
[28] Cascaded Multi-task Adaptive Learning Based on Neural Architecture Search
Gao, Yingying
Zhang, Shilei
Cui, Zihao
Deng, Chao
Feng, Junlan
INTERSPEECH 2023, 2023, : 246 - 250
[29] An Adaptive Neural Architecture Search Design for Collaborative Edge-Cloud Computing
Lu, Haodong
Du, Miao
He, Xiaoming
Qian, Kai
Chen, Jianli
Sun, Yanfei
Wang, Kun
IEEE NETWORK, 2021, 35 (05): : 83 - 89
[30] A Self-Adaptive Mutation Neural Architecture Search Algorithm Based on Blocks
Xue, Yu
Wang, Yankang
Liang, Jiayu
Slowik, Adam
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2021, 16 (03) : 67 - 78

← 1 2 3 4 5 →