Dataflow optimization with layer-wise design variables estimation method for enflame CNN accelerators

被引:0
|
作者
Chen, Tian [1 ]
Tan, Yu -an [1 ]
Zhang, Zheng [1 ]
Luo, Nan [1 ]
Li, Bin [2 ]
Li, Yuanzhang [1 ]
机构
[1] Beijing Inst Technol, Beijing 100081, Peoples R China
[2] Shanghai Enflame Technol Co Ltd, Shanghai 201203, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural networks (CNNs); General computing unit (GCU); Programmable dataflow; Optimization; FLEXIBLE ACCELERATOR;
D O I
10.1016/j.jpdc.2024.104869
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As convolution layers have been proved to be the most time-consuming operation in convolutional neural network (CNN) algorithms, many efficient CNN accelerators have been designed to boost the performance of convolution operations. Previous works on CNN acceleration usually use fixed design variables for diverse convolutional layers, which would lead to inefficient data movements and low utilization of computing resource. We tackle this issue by proposing a flexible dataflow optimization method with design variables estimation for different layers. The optimization method first narrows the design space by the priori constraints, and then enumerates all legal solutions to select the optimal design variables. We demonstrate the effectiveness of the proposed optimization method by implementing representative CNN models (VGG-16, ResNet-18 and MobileNet V1) on Enflame Technology's programmable CNN accelerator, General Computing Unit (GCU). The results indicate that our optimization can significantly enhance the throughput of the convolution layers in ResNet, VGG and MobileNet on GCU, with improvement of up to 1.84x. Furthermore, it achieves up to 2.08x of GCU utilization specifically for the convolution layers of ResNet on GCU.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] LCRM: Layer-Wise Complexity Reduction Method for CNN Model Optimization on End Devices
    Hussain, Hanan
    Tamizharasan, P. S.
    Yadav, Praveen Kumar
    [J]. IEEE ACCESS, 2023, 11 : 66838 - 66857
  • [2] Layer-Wise Data-Free CNN Compression
    Horton, Maxwell
    Jin, Yanzi
    Farhadi, Ali
    Rastegari, Mohammad
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2019 - 2026
  • [3] Dynamic concurrent topology optimization and design for layer-wise graded structures
    Liu, Baoshou
    Wang, Xiaoming
    Zhuang, Zhuo
    Cui, Yinan
    [J]. COMPOSITE STRUCTURES, 2023, 319
  • [4] Improved layer-wise optimization algorithm for the design of viscoelastic composite structures
    Akoussan, Komlan
    Hamdaoui, Mohamed
    Daya, El Mostafa
    [J]. COMPOSITE STRUCTURES, 2017, 176 : 342 - 358
  • [5] Layer-Wise Mixed-Modes CNN Processing Architecture With Double-Stationary Dataflow and Dimension-Reshape Strategy
    Liu, Bo
    Huang, Xinxiang
    Zhang, Yang
    Yang, Guang
    Yan, Han
    Zhang, Chen
    Li, Zejv
    Wang, Yuanhao
    Cai, Hao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, : 4652 - 4664
  • [6] TOPOLOGICAL OPTIMIZATION DESIGN METHOD OF LAYER-WISE GRADED LATTICE STRUCTURES WITH HIGH LOAD-BEARING
    Huang K.
    Ding Z.
    Zhang Y.
    Li X.
    [J]. Lixue Xuebao/Chinese Journal of Theoretical and Applied Mechanics, 2023, 55 (02): : 433 - 444
  • [7] A layer-wise optimization method for the optimal stacking sequence design of symmetric VSCL beams in air and in water
    Meskine, Abdennacer
    Houmat, Abderrahim
    Guenanou, Ahmed
    [J]. MECHANICS OF ADVANCED MATERIALS AND STRUCTURES, 2022, 29 (03) : 350 - 361
  • [8] Topology optimization for concurrent design of layer-wise graded lattice materials and structures
    Zhang, Huikai
    Wang, Yaguang
    Kang, Zhan
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING SCIENCE, 2019, 138 : 26 - 49
  • [9] Layer-Wise Invertibility for Extreme Memory Cost Reduction of CNN Training
    Hascoet, Tristan
    Febvre, Quentin
    Zhuang, Weihao
    Ariki, Yasuo
    Takiguchi, Tetusya
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2049 - 2052
  • [10] Explaining CNN and RNN Using Selective Layer-Wise Relevance Propagation
    Jung, Yeon-Jee
    Han, Seung-Ho
    Choi, Ho-Jin
    [J]. IEEE ACCESS, 2021, 9 : 18670 - 18681