HGNAS: <underline>H</underline>ardware-Aware <underline>G</underline>raph <underline>N</underline>eural <underline>A</underline>rchitecture <underline>S</underline>earch for Edge Devices

被引:0
|
作者
Zhou, Ao [1 ]
Yang, Jianlei [1 ]
Qi, Yingjie [1 ]
Qiao, Tong [1 ]
Shi, Yumeng [1 ]
Duan, Cenlin [2 ]
Zhao, Weisheng [2 ]
Hu, Chunming [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Integrated Circuits & Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph neural networks; Aggregates; Hardware; Accuracy; Performance evaluation; Point cloud compression; Computer architecture; hardware-aware neural architecture search; edge devices; hardware efficiency prediction;
D O I
10.1109/TC.2024.3449108
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph Neural Networks (GNNs) are becoming increasingly popular for graph-based learning tasks such as point cloud processing due to their state-of-the-art (SOTA) performance. Nevertheless, the research community has primarily focused on improving model expressiveness, lacking consideration of how to design efficient GNN models for edge scenarios with real-time requirements and limited resources. Examining existing GNN models reveals varied execution across platforms and frequent Out-Of-Memory (OOM) problems, highlighting the need for hardware-aware GNN design. To address this challenge, this work proposes a novel hardware-aware graph neural architecture search framework tailored for resource constraint edge devices, namely HGNAS. To achieve hardware awareness, HGNAS integrates an efficient GNN hardware performance predictor that evaluates the latency and peak memory usage of GNNs in milliseconds. Meanwhile, we study GNN memory usage during inference and offer a peak memory estimation method, enhancing the robustness of architecture evaluations when combined with predictor outcomes. Furthermore, HGNAS constructs a fine-grained design space to enable the exploration of extreme performance architectures by decoupling the GNN paradigm. In addition, the multi-stage hierarchical search strategy is leveraged to facilitate the navigation of huge candidates, which can reduce the single search time to a few GPU hours. To the best of our knowledge, HGNAS is the first automated GNN design framework for edge devices, and also the first work to achieve hardware awareness of GNNs across different platforms. Extensive experiments across various applications and edge devices have proven the superiority of HGNAS. It can achieve up to a 10.6x speedup and an 82.5% peak memory reduction with negligible accuracy loss compared to DGCNN on ModelNet40.
引用
收藏
页码:2693 / 2707
页数:15
相关论文
共 50 条
  • [1] ViTeGNN: Towards <underline>V</underline>ersatile <underline>I</underline>nference of <underline>Te</underline>mporal <underline>G</underline>raph <underline>N</underline>eural <underline>N</underline>etworks on FPGA
    Zhou, Hongkuan
    Zhang, Bingyi
    Kannan, Rajgopal
    Busart, Carl
    Prasanna, Viktor K.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 36 (03) : 502 - 519
  • [2] LITE-SNN: <underline>L</underline>everaging <underline>I</underline>nherent Dynamics to <underline>T</underline>rain <underline>E</underline>nergy-Efficient <underline>S</underline>piking <underline>N</underline>eural <underline>N</underline>etworks for Sequential Learning
    Rathi, Nitin
    Roy, Kaushik
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (06) : 1905 - 1914
  • [3] <underline>P</underline>hysics-<underline>I</underline>nformed <underline>N</underline>eural <underline>O</underline>DE with <underline>H</underline>eterogeneous control <underline>I</underline>nputs (PINOHI) for quality prediction of composite adhesive joints
    Wang, Yifeng
    Mou, Shancong
    Shi, Jianjun
    Zhang, Chuck
    IISE TRANSACTIONS, 2024,
  • [4] SIMPNet: <underline>S</underline>patial-<underline>I</underline>nformed <underline>M</underline>otion <underline>P</underline>lanning <underline>Net</underline>work
    Soleymanzadeh, Davood
    Liang, Xiao
    Zheng, Minghui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2870 - 2877
  • [5] AUDIT: Function<underline>a</underline>l Q<underline>u</underline>alification in A<underline>d</underline>ditive Manufacturing Via Physical and Dig<underline>i</underline>tal <underline>T</underline>wins
    Biehler, Michael
    Mock, Reinaldo
    Kode, Shriyanshu
    Mehmood, Maham
    Bhardwaj, Palin
    Shi, Jianjun
    JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2024, 146 (02):
  • [6] dbAPIS: a database of <underline>a</underline>nti-<underline>p</underline>rokaryotic <underline>i</underline>mmune <underline>s</underline>ystem genes
    Yan, Yuchen
    Zheng, Jinfang
    Zhang, Xinpeng
    Yin, Yanbin
    NUCLEIC ACIDS RESEARCH, 2023, 52 (D1) : D419 - D425
  • [7] Patient-<underline>Selection</underline> of a Clinical Trial Primary <underline>Outcome</underline>: The ENHANCE-AF <underline>Outcomes</underline> <underline>Survey</underline>
    Stafford, Randall S.
    Rice, Eli N.
    Shah, Rushil
    Hills, Mellanie T.
    Nunes, Julio C.
    Desutter, Katie
    Lin, Amy
    Lhamo, Karma
    Lin, Bryant
    Lu, Ying
    Wang, Paul J.
    PLOS ONE, 2025, 20 (03):
  • [8] Esale: <underline>E</underline>nhancing Code-<underline>S</underline>ummary <underline>A</underline>lignment <underline>Le</underline>arning for Source Code Summarization
    Fang, Chunrong
    Sun, Weisong
    Chen, Yuchen
    Chen, Xiao
    Wei, Zhao
    Zhang, Quanjun
    You, Yudu
    Luo, Bin
    Liu, Yang
    Chen, Zhenyu
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (08) : 2077 - 2095
  • [9] L-NORM: <underline>L</underline>earning and <underline>N</underline>etwork <underline>O</underline>rchestration at the Edge for <underline>R</underline>obot Connectivity and <underline>M</underline>obility in Factory Floor Environments
    Mohanti, Subhramoy
    Roy, Debashri
    Eisen, Mark
    Cavalcanti, Dave
    Chowdhury, Kaushik
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (04) : 2898 - 2914
  • [10] HEARTS Study Protocol: <underline>H</underline>elping <underline>E</underline>nable <underline>A</underline>ccess and <underline>R</underline>emove Barriers <underline>T</underline>o <underline>S</underline>upport for Young Adults with Mental Health-Related Disabilities
    Rao, Sandy
    Dimitropoulos, Gina
    Milaney, Katrina
    Eurich, Dean T.
    Patten, Scott B.
    YOUTH, 2024, 4 (01): : 107 - 123