Fast Algorithms for Convolutional Neural Networks

被引:490
|
作者
Lavin, Andrew [1 ]
Gray, Scott [1 ]
机构
[1] Nervana Syst, San Diego, CA 92121 USA
关键词
D O I
10.1109/CVPR.2016.435
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep convolutional neural networks take GPU-days of computation to train on large data sets. Pedestrian detection for self driving cars requires very low latency. Image recognition for mobile phones is constrained by limited processing resources. The success of convolutional neural networks in these situations is limited by how fast we can compute them. Conventional FFT based convolution is fast for large filters, but state of the art convolutional neural networks use small, 3 x 3 filters. We introduce a new class of fast algorithms for convolutional neural networks using Winograd's minimal filtering algorithms. The algorithms compute minimal complexity convolution over small tiles, which makes them fast with small filters and small batch sizes. We benchmark a GPU implementation of our algorithm with the VGG network and show state of the art throughput at batch sizes from 1 to 64.
引用
收藏
页码:4013 / 4021
页数:9
相关论文
共 50 条
  • [1] Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs
    Lu, Liqiang
    Liang, Yun
    Xiao, Qingcheng
    Yan, Shengen
    [J]. 2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017), 2017, : 101 - 108
  • [2] Evaluating Fast Algorithms for Convolutional Neural Networks on FPGAs
    Liang, Yun
    Lu, Liqiang
    Xiao, Qingcheng
    Yan, Shengen
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (04) : 857 - 870
  • [3] Fast Algorithms for Quaternion-Valued Convolutional Neural Networks
    Cariow, Aleksandr
    Cariowa, Galina
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (01) : 457 - 462
  • [4] Fast 2D Convolution Algorithms for Convolutional Neural Networks
    Cheng, Chao
    Parhi, Keshab K.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (05) : 1678 - 1691
  • [5] FAST LEARNING ALGORITHMS FOR NEURAL NETWORKS
    KARAYIANNIS, NB
    VENETSANOPOULOS, AN
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-ANALOG AND DIGITAL SIGNAL PROCESSING, 1992, 39 (07): : 453 - 474
  • [6] FAST GRAPH CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Kadambari, Sai Kiran
    Chepuri, Sundeep Prabhakar
    [J]. CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 467 - 471
  • [7] Fast Convolution Algorithm for Convolutional Neural Networks
    Kim, Tae Sun
    Bae, Jihoon
    Sunwoo, Myung Hoon
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2019), 2019, : 258 - 261
  • [8] Fast Computing Framework for Convolutional Neural Networks
    Korytkowski, Marcin
    Staszewski, Pawel
    Woldan, Piotr
    Scherer, Rafal
    [J]. PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 118 - 123
  • [9] Towards Design Space Exploration and Optimization of Fast Algorithms for Convolutional Neural Networks (CNNs) on FPGAs
    Ahmad, Afzal
    Pasha, Muhammad Adeel
    [J]. 2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 1106 - 1111
  • [10] The effect of image enhancement algorithms on convolutional neural networks
    Rodriguez-Rodriguez, Jose A.
    Molina-Cabello, Miguel A.
    Benitez-Rochel, Rafaela
    Lopez-Rubio, Ezequiel
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3084 - 3089