Weightless Neural Networks for Efficient Edge Inference

被引:4
|
作者
Susskind, Zachary [1 ]
Arora, Aman [1 ]
Miranda, Igor D. S. [2 ]
Villon, Luis A. Q. [3 ]
Katopodis, Rafael F. [3 ]
de Araujo, Leandro S. [4 ]
Dutra, Diego L. C. [3 ]
Lima, Priscila M. V. [3 ]
Franca, Felipe M. G. [3 ,5 ]
Breternitz, Mauricio, Jr. [6 ]
John, Lizy K. [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Univ Fed Reconcavo Bahia, Cruz Das Almas, BA, Brazil
[3] Univ Fed Rio de Janeiro, Rio De Janeiro, RJ, Brazil
[4] Fluminense Fed Univ, Niteroi, RJ, Brazil
[5] Inst Telecomunicacoes, Porto, Portugal
[6] ISCTE Inst Univ Lisboa, Lisbon, Portugal
基金
美国国家科学基金会;
关键词
Weightless Neural Networks; WNN; WiSARD; Neural Networks; Hardware Acceleration; Inference; Edge Computing; ALGORITHM;
D O I
10.1145/3559009.3569680
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Weightless neural networks (WNNs) are a class of machine learning model which use table lookups to perform inference, rather than the multiply-accumulate operations typical of deep neural networks (DNNs). Individual weightless neurons are capable of learning non-linear functions of their inputs, a theoretical advantage over the linear neurons in DNNs, yet state-of-the-art WNN architectures still lag behind DNNs in accuracy on common classification tasks. Additionally, many existing WNN architectures suffer from high memory requirements, hindering implementation. In this paper, we propose a novel WNN architecture, BTHOWeN, with key algorithmic and architectural improvements over prior work, namely counting Bloom filters, hardware-friendly hashing, and Gaussian-based nonlinear thermometer encodings. These enhancements improve model accuracy while reducing size and energy per inference. BTHOWeN targets the large and growing edge computing sector by providing superior latency and energy efficiency to both prior WNNs and comparable quantized DNNs. Compared to state-of-the-art WNNs across nine classification datasets, BTHOWeN on average reduces error by more than 40% and model size by more than 50%. We demonstrate the viability of a hardware implementation of BTHOWeN by presenting an FPGA-based inference accelerator, and compare its latency and resource usage against similarly accurate quantized DNN inference accelerators, including multi-layer perceptron (MLP) and convolutional models. The proposed BTHOWeN models consume almost 80% less energy than the MLP models, with nearly 85% reduction in latency. In our quest for efficient ML on the edge, WNNs are clearly deserving of additional attention.
引用
收藏
页码:279 / 290
页数:12
相关论文
共 50 条
  • [1] OnceNAS: Discovering efficient on-device inference neural networks for edge devices
    Zhang, Yusen
    Qin, Yunchuan
    Zhang, Yufeng
    Zhou, Xu
    Jian, Songlei
    Tan, Yusong
    Li, Kenli
    [J]. INFORMATION SCIENCES, 2024, 669
  • [2] Scaling for edge inference of deep neural networks
    Xu, Xiaowei
    Ding, Yukun
    Hu, Sharon Xiaobo
    Niemier, Michael
    Cong, Jason
    Hu, Yu
    Shi, Yiyu
    [J]. NATURE ELECTRONICS, 2018, 1 (04): : 216 - 222
  • [3] Scaling for edge inference of deep neural networks
    Xiaowei Xu
    Yukun Ding
    Sharon Xiaobo Hu
    Michael Niemier
    Jason Cong
    Yu Hu
    Yiyu Shi
    [J]. Nature Electronics, 2018, 1 : 216 - 222
  • [4] Adaptive Neural Networks for Efficient Inference
    Bolukbasi, Tolga
    Wang, Joseph
    Dekel, Ofer
    Saligrama, Venkatesh
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [5] Change Detection with Weightless Neural Networks
    De Gregorio, Massimo
    Giordano, Maurizio
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 409 - +
  • [6] Ontology Alignment with Weightless Neural Networks
    Viana, Thais
    Delgado, Carla
    da Silva, Joao C. P.
    Lima, Priscila
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 376 - 384
  • [7] Weightless neural networks for face recognition
    Lauria, S
    Mitchell, R
    [J]. NEURAL NETS - WIRN VIETRI-99, 1999, : 206 - 212
  • [8] Background Modeling by Weightless Neural Networks
    De Gregorio, Massimo
    Giordano, Maurizio
    [J]. NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2015 WORKSHOPS, 2015, 9281 : 493 - 501
  • [9] Background estimation by weightless neural networks
    De Gregorio, Massimo
    Giordano, Maurizio
    [J]. PATTERN RECOGNITION LETTERS, 2017, 96 : 55 - 65
  • [10] Efficient neural networks for edge devices
    Liu, Shiya
    Ha, Dong Sam
    Shen, Fangyang
    Yi, Yang
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92