HGNAS: <underline>H</underline>ardware-Aware <underline>G</underline>raph <underline>N</underline>eural <underline>A</underline>rchitecture <underline>S</underline>earch for Edge Devices

被引:0
|
作者
Zhou, Ao [1 ]
Yang, Jianlei [1 ]
Qi, Yingjie [1 ]
Qiao, Tong [1 ]
Shi, Yumeng [1 ]
Duan, Cenlin [2 ]
Zhao, Weisheng [2 ]
Hu, Chunming [1 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Integrated Circuits & Engn, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph neural networks; Aggregates; Hardware; Accuracy; Performance evaluation; Point cloud compression; Computer architecture; hardware-aware neural architecture search; edge devices; hardware efficiency prediction;
D O I
10.1109/TC.2024.3449108
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph Neural Networks (GNNs) are becoming increasingly popular for graph-based learning tasks such as point cloud processing due to their state-of-the-art (SOTA) performance. Nevertheless, the research community has primarily focused on improving model expressiveness, lacking consideration of how to design efficient GNN models for edge scenarios with real-time requirements and limited resources. Examining existing GNN models reveals varied execution across platforms and frequent Out-Of-Memory (OOM) problems, highlighting the need for hardware-aware GNN design. To address this challenge, this work proposes a novel hardware-aware graph neural architecture search framework tailored for resource constraint edge devices, namely HGNAS. To achieve hardware awareness, HGNAS integrates an efficient GNN hardware performance predictor that evaluates the latency and peak memory usage of GNNs in milliseconds. Meanwhile, we study GNN memory usage during inference and offer a peak memory estimation method, enhancing the robustness of architecture evaluations when combined with predictor outcomes. Furthermore, HGNAS constructs a fine-grained design space to enable the exploration of extreme performance architectures by decoupling the GNN paradigm. In addition, the multi-stage hierarchical search strategy is leveraged to facilitate the navigation of huge candidates, which can reduce the single search time to a few GPU hours. To the best of our knowledge, HGNAS is the first automated GNN design framework for edge devices, and also the first work to achieve hardware awareness of GNNs across different platforms. Extensive experiments across various applications and edge devices have proven the superiority of HGNAS. It can achieve up to a 10.6x speedup and an 82.5% peak memory reduction with negligible accuracy loss compared to DGCNN on ModelNet40.
引用
收藏
页码:2693 / 2707
页数:15
相关论文
共 50 条
  • [31] HiMAL: Multimodal <underline>Hi</underline>erarchical <underline>M</underline>ulti-task <underline>A</underline>uxiliary <underline>L</underline>earning framework for predicting Alzheimer's disease progression
    Kumar, Sayantan
    Yu, Sean C.
    Michelson, Andrew
    Kannampallil, Thomas
    Payne, Philip R. O.
    JAMIA OPEN, 2024, 7 (03)
  • [32] CBANA: A Lightweight, Efficient, and Flexible <underline>C</underline>ache <underline>B</underline>ehavior <underline>Ana</underline>lysis Framework
    Hu, Qilin
    Ding, Yan
    Liu, Chubo
    Li, Keqin
    Li, Kenli
    Zomaya, Albert Y.
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (09) : 2262 - 2274
  • [33] Low dose <underline>T</underline>am<underline>O</underline>xifen and <underline>L</underline>ifestyl<underline>E</underline> changes for b<underline>R</underline>east c<underline>AN</underline>cer prevention (TOLERANT study): Study protocol of a randomized phase II biomarker trial in women at increased risk for breast cancer
    Guerrieri-Gonzaga, Aliana
    Serrano, Davide
    Gnagnarella, Patrizia
    Johansson, Harriet
    Zovato, Stefania
    Nardi, Mariateresa
    Pensabene, Matilde
    Buccolo, Simona
    DeCensi, Andrea
    Briata, Irene Maria
    Pistelli, Luigi
    Sansone, Clementina
    Mannucci, Sara
    Aristarco, Valentina
    Macis, Debora
    Lazzeroni, Matteo
    Aurilio, Gaetano
    Accornero, Chiara Arianna
    Gandini, Sara
    Bonanni, Bernardo
    PLOS ONE, 2024, 19 (09):
  • [34] <underline>Su</underline>stained inflation and chest comp<underline>r</underline>ession <underline>v</underline>ersus 3:<underline>1</underline> chest compression to <underline>v</underline>entilation ratio during cardiopulmonary resuscitation of asphyxiated n<underline>e</underline>wborns (SURV1VE): A cluster randomised controlled trial
    Schmolzer, Georg M.
    Pichler, Gerhard
    Solevag, Anne Lee
    Law, Brenda Hiu Yan
    Mitra, Souvik
    Wagner, Michael
    Pfurtscheller, Daniel
    Yaskina, Maryna
    Cheung, Po-Yin
    ARCHIVES OF DISEASE IN CHILDHOOD-FETAL AND NEONATAL EDITION, 2024, 109 (04): : 428 - 435
  • [35] <underline>A</underline>I-<underline>T</underline>echniques <underline>L</underline>oss-Based <underline>A</underline>lgorithm for <underline>S</underline>everity Classification (ATLAS): a novel approach for continuous quantification of exertional symptoms during incremental exercise testing
    Hijleh, Abed A.
    Wang, Sophia
    Berton, Danilo C.
    Neder-Serafini, Igor
    Vincent, Sandra
    James, Matthew
    Domnik, Nicolle
    Phillips, Devin
    Nery, Luiz E.
    O'Donnell, Denis E.
    Neder, J. Alberto
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025,
  • [36] BuDDI: <underline>Bu</underline>lk <underline>D</underline>econvolution with <underline>D</underline>omain <underline>I</underline>nvariance to predict cell-type-specific perturbations from bulk
    Davidson, Natalie R.
    Zhang, Fan
    Greene, Casey S.
    PLOS COMPUTATIONAL BIOLOGY, 2025, 21 (01)
  • [37] <sc>CLAPnq</sc>: <underline>C</underline>ohesive <underline>L</underline>ong-form <underline>A</underline>nswers from <underline>P</underline>assages in Natural Questions for RAG systems
    Rosenthal, Sara
    Sil, Avirup
    Florian, Radu
    Roukos, Salim
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2025, 13 : 53 - 72
  • [38] Feasibility and accuracy of Dire<underline>C</underline>t <underline>L</underline>ung <underline>U</underline>ltrasound <underline>E</underline>valuation technique to monitor extravascular lung water in porcine lungs
    Buttar, Sana N.
    Moller-Sorensen, Hasse
    Perch, Michael
    Petersen, Rene H.
    Moller, Christian H.
    EUROPEAN JOURNAL OF CARDIO-THORACIC SURGERY, 2025, 67 (01)
  • [39] TKDP: <underline>T</underline>hreefold <underline>K</underline>nowledge-Enriched <underline>D</underline>eep <underline>P</underline>rompt Tuning for Few-Shot Named Entity Recognition
    Liu, Jiang
    Fei, Hao
    Li, Fei
    Li, Jingye
    Li, Bobo
    Zhao, Liang
    Teng, Chong
    Ji, Donghong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6397 - 6409
  • [40] <underline>P</underline>r<underline>E</underline>vention of posttraumatic contractu<underline>R</underline>es with <underline>K</underline>etotifen 2 (PERK 2) - protocol for a multicenter randomized clinical trial
    Ademola, Ayoola
    Hildebrand, Kevin A.
    Schneider, Prism S.
    Mohtadi, Nicholas G. H.
    White, Neil J.
    Bosse, Michael J.
    Garven, Alexandra
    Walker, Richard E. A.
    Sajobi, Tolulope T.
    BMC MUSCULOSKELETAL DISORDERS, 2020, 21 (01)