Multihardware Adaptive Latency Prediction for Neural Architecture Search

被引:0
|
作者
Lin, Chengmin [1 ]
Yang, Pengfei [1 ]
Wang, Quan [1 ]
Guo, Yitong [1 ]
Wang, Zhenyi [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710126, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2025年 / 12卷 / 03期
关键词
Hardware; Predictive models; Adaptation models; Training; Accuracy; Network architecture; Computer architecture; Optimization; Performance evaluation; Data models; Dynamic sample allocation; few-shot learning; hardware-aware; latency predictor; neural architecture search (NAS); representative sample sampling; NETWORKS;
D O I
10.1109/JIOT.2024.3480990
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In hardware-aware neural architecture search (NAS), accurately assessing a model's inference efficiency is crucial for search optimization. Traditional approaches, which measure numerous samples to train proxy models, are impractical across varied platforms due to the extensive resources needed to remeasure and rebuild models for each platform. To address this challenge, we propose a multihardware-aware NAS method that enhances the generalizability of proxy models across different platforms while reducing the required sample size. Our method introduces a multihardware adaptive latency prediction (MHLP) model that leverages one-hot encoding for hardware parameters and multihead attention mechanisms to effectively capture the intricate interplay between hardware attributes and network architecture features. Additionally, we implement a two-stage sampling mechanism based on probability density weighting to ensure the representativeness and diversity of the sample set. By adopting a dynamic sample allocation mechanism, our method can adjust the adaptive sample size according to the initial model state, providing stronger data support for devices with significant deviations. Evaluations on NAS benchmarks demonstrate the MHLP predictor's excellent generalization accuracy using only 10 samples, guiding the NAS search process to identify optimal network architectures.
引用
收藏
页码:3385 / 3398
页数:14
相关论文
共 50 条
  • [21] Traffic Spatial-Temporal Prediction Based on Neural Architecture Search
    Zhang, Dongran
    Luo, Gang
    Li, Jun
    PROCEEDINGS OF 2023 18TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATA, SSTD 2023, 2023, : 21 - 30
  • [22] Accelerating Evolutionary Neural Architecture Search for Remaining Useful Life Prediction
    Mo, Hyunho
    Iacca, Giovanni
    BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 15 - 30
  • [23] AutoST: Efficient Neural Architecture Search for Spatio-Temporal Prediction
    Li, Ting
    Zhang, Junbo
    Bao, Kainan
    Liang, Yuxuan
    Li, Yexin
    Zheng, Yu
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 794 - 802
  • [24] Efficient graph neural architecture search using Monte Carlo Tree search and prediction network
    Deng, TianJin
    Wu, Jia
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [25] SaDENAS: A self-adaptive differential evolution algorithm for neural architecture search
    Han, Xiaolong
    Xue, Yu
    Wang, Zehong
    Zhang, Yong
    Muravev, Anton
    Gabbouj, Moncef
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 91
  • [26] AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search
    Chen, Daoyuan
    Li, Yaliang
    Qiu, Minghui
    Wang, Zhen
    Li, Bofang
    Ding, Bolin
    Deng, Hongbo
    Huang, Jun
    Lin, Wei
    Zhou, Jingren
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2463 - 2469
  • [27] Learn Basic Skills and Reuse: Modularized Adaptive Neural Architecture Search (MANAS)
    Chen, Hanxiong
    Li, Yunqi
    Zhu, He
    Zhang, Yongfeng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 169 - 179
  • [28] Cascaded Multi-task Adaptive Learning Based on Neural Architecture Search
    Gao, Yingying
    Zhang, Shilei
    Cui, Zihao
    Deng, Chao
    Feng, Junlan
    INTERSPEECH 2023, 2023, : 246 - 250
  • [29] An Adaptive Neural Architecture Search Design for Collaborative Edge-Cloud Computing
    Lu, Haodong
    Du, Miao
    He, Xiaoming
    Qian, Kai
    Chen, Jianli
    Sun, Yanfei
    Wang, Kun
    IEEE NETWORK, 2021, 35 (05): : 83 - 89
  • [30] A Self-Adaptive Mutation Neural Architecture Search Algorithm Based on Blocks
    Xue, Yu
    Wang, Yankang
    Liang, Jiayu
    Slowik, Adam
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2021, 16 (03) : 67 - 78