ClassyNet: Class-Aware Early-Exit Neural Networks for Edge Devices

被引:4
|
作者
Ayyat, Mohammed [1 ]
Nadeem, Tamer [1 ]
Krawczyk, Bartosz [2 ]
机构
[1] Virginia Commonwealth Univ, Dept Comp Sci, Richmond, VA 23220 USA
[2] Rochester Inst Technol, Chester F Carlson Ctr Imaging Sci, Rochester, NY 14623 USA
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 09期
基金
美国国家科学基金会;
关键词
Class-aware classification; dynamic deep neural network (DNN); early-exit models; edge computing; on-device machine learning;
D O I
10.1109/JIOT.2023.3344120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge-based and IoT devices have seen phenomenal growth in recent years, driven by the surge in demand for emerging applications that leverage machine learning models, such as deep neural networks (DNNs). However, a primary drawback of DNNs is their substantial storage/memory needs and high computational overhead, making their adoption in edge devices challenging. This limitation prompted the development of early-exit models like BranchyNet, which enable decisions to be made at earlier stages by incorporating dedicated exits within the architecture's inner layers. Nonetheless, these existing early-exit models lack control over the specific class that should exit and when. The necessity for such class-aware models is evident in numerous edge applications, where particular high-priority classes must be detected earlier due to their time-sensitive nature. In this article, we introduce ClassyNet, the first early-exit architecture designed to return only selected classes at each exit. This feature facilitates faster inference times for critical classes, allowing the initial layers to operate on edge devices. This strategy conserves considerable computational time and resources on the edge without compromising accuracy. Through extensive experiments, we show the effectiveness of ClassyNet compared to other models under various scenarios.
引用
收藏
页码:15113 / 15127
页数:15
相关论文
共 50 条
  • [41] HGNAS: Hardware-Aware Graph Neural Architecture Search for Edge Devices
    Zhou, Ao
    Yang, Jianlei
    Qi, Yingjie
    Qiao, Tong
    Shi, Yumeng
    Duan, Cenlin
    Zhao, Weisheng
    Hu, Chunming
    IEEE Transactions on Computers, 2024, 73 (12) : 2693 - 2707
  • [42] EDGE-AWARE SUPERPIXEL SEGMENTATION WITH UNSUPERVISED CONVOLUTIONAL NEURAL NETWORKS
    Yu, Yue
    Yang, Yang
    Liu, Kezhao
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1504 - 1508
  • [43] Zero Time Waste: Recycling Predictions in Early Exit Neural Networks
    Wolczyk, Maciej
    Wojcik, Bartosz
    Balazy, Klaudia
    Podolak, Igor
    Tabor, Jacek
    Smieja, Marek
    Trzcinski, Tomasz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [44] SLEXNet: Adaptive Inference Using Slimmable Early Exit Neural Networks
    Kutukcu, Basar
    Baidya, Sabur
    Dey, Sujit
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2024, 23 (06)
  • [45] Using Channel-Wise Attention for Deep CNN Based Real-Time Semantic Segmentation With Class-Aware Edge Information
    Han, Hsiang-Yu
    Chen, Yu-Chi
    Hsiao, Pei-Yung
    Fu, Li-Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (02) : 1041 - 1051
  • [46] Convergence of a Class of Delayed Neural Networks with Real Memristor Devices
    Di Marco, Mauro
    Forti, Mauro
    Moretti, Riccardo
    Pancioni, Luca
    Innocenti, Giacomo
    Tesi, Alberto
    MATHEMATICS, 2022, 10 (14)
  • [47] Networks-on-Chip based Deep Neural Networks Accelerators for IoT Edge Devices
    Ascia, Giuseppe
    Catania, Vincenzo
    Monteleone, Salvatore
    Palesi, Maurizio
    Patti, Davide
    Jose, John
    2019 SIXTH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY (IOTSMS), 2019, : 227 - 234
  • [48] Neural Networks Meet Physical Networks: Distributed Inference Between Edge Devices and the Cloud
    Chinchali, Sandeep P.
    Cidon, Eyal
    Pergament, Evgenya
    Chu, Tianshu
    Katti, Sachin
    HOTNETS-XVII: PROCEEDINGS OF THE 2018 ACM WORKSHOP ON HOT TOPICS IN NETWORKS, 2018, : 50 - 56
  • [49] Characterizing the Execution of Deep Neural Networks on Collaborative Robots and Edge Devices
    Merck, Matthew L.
    Wang, Bingyao
    Liu, Lixing
    Jia, Chunjun
    Siqueira, Arthur
    Huang, Qiusen
    Saraha, Abhijeet
    Lim, Dongsuk
    Cao, Jiashen
    Hadidi, Ramyad
    Kim, Hyesoon
    PEARC '19: PROCEEDINGS OF THE PRACTICE AND EXPERIENCE IN ADVANCED RESEARCH COMPUTING ON RISE OF THE MACHINES (LEARNING), 2019,
  • [50] Distributed Deep Neural Networks over the Cloud, the Edge and End Devices
    Teerapittayanon, Surat
    McDanel, Bradley
    Kung, H. T.
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 328 - 339