On energy complexity of fully-connected layers

被引:0
|
作者
Sima, Jiri [1 ]
Cabessa, Jeremie [2 ]
Vidnerova, Petra [1 ]
机构
[1] Czech Acad Sci, Inst Comp Sci, Vodarenskou vezi 271-2, Prague 8, Czech Republic
[2] Univ Paris Saclay, Univ Versailles St Quentin UVSQ, DAVID Lab, 45 Ave Etats Unis, F-78035 Versailles, France
关键词
Deep neural networks; Convolutional neural networks; Fully-connected layer; Energy complexity; Energy consumption; Dataflow; NEURAL-NETWORKS;
D O I
10.1016/j.neunet.2024.106419
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The massive increase in the size of deep neural networks (DNNs) is accompanied by a significant increase in energy consumption of their hardware implementations which is critical for their widespread deployment in low -power mobile devices. In our previous work, an abstract hardware -independent model of energy complexity for convolutional neural networks (CNNs) has been proposed and experimentally validated. Based on this model, we provide a theoretical analysis of energy complexity related to the computation of a fullyconnected layer when its inputs, outputs, and weights are transferred between two kinds of memories (DRAM and Buffer). First, we establish a general lower bound on this energy complexity. Then, we present two dataflows and calculate their energy costs to achieve the corresponding upper bounds. In the case of a partitioned Buffer, we prove by the weak duality theorem from linear programming that the lower and upper bounds coincide up to an additive constant, and therefore establish the optimal energy complexity. Finally, the asymptotically optimal quadratic energy complexity of fully -connected layers is experimentally validated by estimating their energy consumption on the Simba and Eyeriss hardware.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Reaction-diffusion on the fully-connected lattice: A plus A → A
    Turban, Loic
    Fortin, Jean-Yves
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2018, 51 (14)
  • [22] A Fully-Connected Layered Model of Foreground and Background Flow
    Sun, Deqing
    Wulff, Jonas
    Sudderth, Erik B.
    Pfister, Hanspeter
    Black, Michael J.
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2451 - 2458
  • [23] Active Reconfigurable Intelligent Surface: Fully-Connected or Sub-Connected?
    Liu, Kunzan
    Zhang, Zijian
    Dai, Linglong
    Xu, Shenheng
    Yang, Fan
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (01) : 167 - 171
  • [24] Fully-connected semantic segmentation of hyperspectral and LiDAR data
    Aytaylan, Hakan
    Yuksel, Seniha Esen
    IET COMPUTER VISION, 2019, 13 (03) : 285 - 293
  • [25] Deep fully-connected networks for video compressive sensing
    Iliadis, Michael
    Spinoulas, Leonidas
    Katsaggelos, Aggelos K.
    DIGITAL SIGNAL PROCESSING, 2018, 72 : 9 - 18
  • [26] Fully-connected LSTM–CRF on medical concept extraction
    Jie Ji
    Bairui Chen
    Hongcheng Jiang
    International Journal of Machine Learning and Cybernetics, 2020, 11 : 1971 - 1979
  • [27] Storage capacity of a fully-connected parity machine with continuous weights
    Xiong, YS
    Kwon, C
    Oh, JH
    JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1998, 31 (34): : 7043 - 7049
  • [28] Fully-Connected Transformer for Multi-Source Image Fusion
    Wu, Xiao
    Cao, Zi-Han
    Huang, Ting-Zhu
    Deng, Liang-Jian
    Chanussot, Jocelyn
    Vivone, Gemine
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 2071 - 2088
  • [29] A fully-connected three-user quantum hyperentangled network
    Yiwen Huang
    Yuanhua Li
    Zhantong Qi
    Yilin Yang
    Yuanlin Zheng
    Xianfeng Chen
    Quantum Frontiers, 2 (1):
  • [30] Neural Window Fully-connected CRFs for Monocular Depth Estimation
    Yuan, Weihao
    Gu, Xiaodong
    Dai, Zuozhuo
    Zhu, Siyu
    Tan, Ping
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3906 - 3915