Accelerating CNN Algorithm with Fine-grained Dataflow Architectures

被引:3
|
作者
Xiang, Taoran [1 ,2 ]
Feng, Yujing [1 ]
Ye, Xiaochun [1 ]
Tan, Xu [1 ,2 ]
Li, Wenming [1 ]
Zhu, Yatao [1 ]
Wu, Meng [1 ]
Zhang, Hao [1 ]
Fan, Dongrui [1 ,2 ]
机构
[1] Chinese Acad Sci, ICT, State Key Lab Comp Architecture, Beijing, Peoples R China
[2] UCAS, Sch Comp & Control Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
fine-grained dataflow; Convolutional Neural Network; general accelerator; data reuse; high parallel;
D O I
10.1109/HPCC/SmartCity/DSS.2018.00063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Network(CNN) is a hot and state-of-the-art algorithm which is widely used in applications such as face recognition, intelligent monitoring, image recognition and text recognition. Because of its high computational complexity, many efficient hardware accelerators have been proposed to exploit high degree of parallel processing for CNN. However, accelerators which are implemented on FPGAs and ASICs usually sacrifice generality for higher performance and lower power consumption. Other accelerators, such as GPUs, are general enough, but they lead to higher power consumption. Fine-grained dataflow architectures, which break conventional Von Neumann architectures, show natural advantages in processing CNN-like algorithms with high computational efficiency and low power consumption. At the same time, it remains broadly applicable and adaptable. In this paper, we propose a scheme for implementing and optimizing CNN on fine-grained dataflow architecture based accelerators. The experiment results reveal that by using our scheme, the performance of AlexNet running on the dataflow accelerator is 3.11x higher than that on NVIDIA Tesla K80, and the power consumption of our hardware is 8.52x lower than that of K80.
引用
收藏
页码:243 / 251
页数:9
相关论文
共 50 条
  • [31] A Fine-Grained Source-Throttling Method for Mesh Architectures
    Zhao, Hongzhi
    Bagherzadeh, Nader
    Wang, Qiang
    Wang, Yongchang
    IEEE ACCESS, 2020, 8 : 33101 - 33112
  • [32] Termination detection for fine-grained message-passing architectures
    Naylor, Matthew
    Moore, Simon W.
    Mokhov, Andrey
    Thomas, David
    Beaumont, Jonathan R.
    Fleming, Shane
    Markettos, A. Theodore
    Bytheway, Thomas
    Brown, Andrew
    2020 IEEE 31ST INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2020), 2020, : 17 - 24
  • [33] Active Microservice Fine-Grained Scaling Algorithm
    Peng, Kai
    Ma, Fangling
    Xu, Bo
    Guo, Jialu
    Hu, Menglan
    Computer Engineering and Applications, 2024, 60 (08) : 274 - 286
  • [34] FINE-GRAINED COLOUR DISCRIMINATION WITHOUT FINE-GRAINED COLOUR
    Gert, Joshua
    AUSTRALASIAN JOURNAL OF PHILOSOPHY, 2015, 93 (03) : 602 - 605
  • [35] Accelerating Lattice QCD Multigrid on GPUs Using Fine-Grained Parallelization
    Clark, M. A.
    Joo, Balint
    Strelchenko, Alexei
    Cheng, Michael
    Gambhir, Arjun
    Brower, Richard. C.
    SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2016, : 795 - 806
  • [36] A novel CNN structure for fine-grained classification of Chinese calligraphy styles
    Jiulong Zhang
    Mingtao Guo
    Jianping Fan
    International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 177 - 188
  • [37] Learning a Discriminative Filter Bank within a CNN for Fine-grained Recognition
    Wang, Yaming
    Morariu, Vlad I.
    Davis, Larry S.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4148 - 4157
  • [38] A novel CNN structure for fine-grained classification of Chinese calligraphy styles
    Zhang, Jiulong
    Guo, Mingtao
    Fan, Jianping
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (02) : 177 - 188
  • [39] Two-Stream Contextualized CNN for Fine-Grained Image Classification
    Liu, Jiang
    Gao, Chenqiang
    Meng, Deyu
    Zuo, Wangmeng
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4232 - 4233
  • [40] CNN-Transformer with Stepped Distillation for Fine-Grained Visual Classification
    Xu, Qin
    Liu, Peng
    Wang, Jiahui
    Huang, Lili
    Tang, Jin
    PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 364 - 377