HAPI: Hardware-Aware Progressive Inference

被引:23
|
作者
Laskaridis, Stefanos [1 ]
Venieris, Stylianos, I [1 ]
Kim, Hyeji [1 ]
Lane, Nicholas D. [1 ,2 ]
机构
[1] Samsung AI Ctr, Cambridge, England
[2] Univ Cambridge, Cambridge, England
关键词
MULTIOBJECTIVE OPTIMIZATION;
D O I
10.1145/3400302.3415698
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) have recently become the state-of-the-art in a diversity of AI tasks. Despite their popularity, CNN inference still comes at a high computational cost. A growing body of work aims to alleviate this by exploiting the difference in the classification difficulty among samples and early-exiting at different stages of the network. Nevertheless, existing studies on early exiting have primarily focused on the training scheme, without considering the use-case requirements or the deployment platform. This work presents HAPI, a novel methodology for generating high-performance early-exit networks by co-optimising the placement of intermediate exits together with the early-exit strategy at inference time. Furthermore, we propose an efficient design space exploration algorithm which enables the faster traversal of a large number of alternative architectures and generates the highest-performing design, tailored to the use-case requirements and target hardware. Quantitative evaluation shows that our system consistently outperforms alternative search mechanisms and state-of-the-art early-exit schemes across various latency budgets. Moreover, it pushes further the performance of highly optimised hand-crafted early-exit CNNs, delivering up to 5.11x speedup over lightweight models on imposed latency-driven SLAs for embedded devices.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] MOSS-DB: A Hardware-Aware OLAP Database
    Zhang, Yansong
    Hu, Wei
    Wang, Shan
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 582 - 594
  • [22] Hardware-aware approach to deep neural network optimization
    Li, Hengyi
    Meng, Lin
    NEUROCOMPUTING, 2023, 559
  • [23] Hardware-aware Moving Objects Detection in Satellite Image
    Lee, Pei-Jun
    Chiu, Zheng-Kai
    Liu, Kuang-Zhe
    Lin, Albert
    Chen, Chia-Ray
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [24] HARDWARE-AWARE MODEL OPTIMIZATION TOOL FOR EMBEDDED DEVICES
    Ozcinar, Cagri
    Kim, Dongsung
    Duckworth, Benjamin Rufus
    Joya, Shayan
    Di Perto, Nicolas Scotto
    Dusnoki, Attila
    Fabo, Marko
    Vince, Daniel
    Loki, Gabor
    Kiss, Akos
    Alder, Christopher
    2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [25] Hardware-Aware Softmax Approximation for Deep Neural Networks
    Geng, Xue
    Lin, Jie
    Zhao, Bin
    Kong, Anmin
    Aly, Mohamed M. Sabry
    Chandrasekhar, Vijay
    COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 107 - 122
  • [26] Evolution of Hardware-Aware Neural Architecture Search on the Edge
    Richey, Blake
    Clay, Mitchell
    Grecos, Christos
    Shirvaikar, Mukul
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2023, 2023, 12528
  • [27] Hardware-Aware Quantization for Multiplierless Neural Network Controllers
    Habermann, Tobias
    Kuehle, Jonas
    Kumm, Martin
    Volkova, Anastasia
    2022 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, APCCAS, 2022, : 541 - 545
  • [28] Hardware-Aware and Efficient Feature Fusion Network Search
    Guo J.-M.
    Zhang R.
    Zhi T.
    He D.-Y.
    Huang D.
    Chang M.
    Zhang X.-S.
    Guo Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (11): : 2420 - 2432
  • [29] A Hardware-Aware Heuristic for the Qubit Mapping Problem in the NISQ Era
    Niu S.
    Suau A.
    Staffelbach G.
    Todri-Sanial A.
    IEEE Transactions on Quantum Engineering, 2020, 1
  • [30] Hardware-Aware Sum-Product Decoding in the Decision Domain
    Yamada, Mizuki
    Takeuchi, Keigo
    Koike, Kiyoyuki
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2019, E102A (12) : 1980 - 1987