Reconfigurable Network-on-Chip based Convolutional Neural Network Accelerator

被引:4
|
作者
Firuzan, Arash [1 ]
Modarressi, Mehdi [2 ,3 ]
Reshadi, Midia [4 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Sci & Res Branch, Tehran, Iran
[2] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[3] Inst Studies Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
[4] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland
关键词
CNN accelerator; Network-on-chip; Reconfigurable network-on-chip; PROCESSOR;
D O I
10.1016/j.sysarc.2022.102567
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) have a wide range of applications due to their superior performance in image and pattern classification. However, the performance of CNNs comes at the price of high computational load and memory bandwidth usage. Hardware acceleration has become the primary way to tackle this everincreasing complexity of CNNs. Most of the recent accelerators arrange processing units (PEs) as a many-core accelerator architecture, with the inter-PE connections tailored to the specific dataflow of the CNN layers. The performance of such accelerators is maximized if the input feature map and filter size/dimension matches that of the underlying accelerator. However, current fixed-size accelerator structures lead to sever resource underutilization because the same structure is used to compute CNN layers of varying dimensions. In this paper, we tackle this problem by presenting RC-CNN, a reconfigurable accelerator architecture for CNNs that can adapt the structure of accelerator to the size and dataflow pattern of the running CNN layer. RC-CNN relies on a reconfigurable on-chip interconnection fabric that can organize a sub-set of accelerator's PEs as a PE set with the same size/dimension of the target CNN layer and customize the inter-PE connections for the layer's dataflow pattern. Since the area/energy overhead does not justify using a full-fledged packet-switched network in accelerators with fine-grained PEs, we use a reconfigurable network with very simple switches in order to efficiently implement the dynamic reconfiguration capability for many-core fine-grained CNN accelerators. Experimental results show that, based on the CNN size and accelerator structure, RC-CNN yields 37% higher PE utilization over a baseline design, on average. It also improves the PE utilization of the state-of-the-art CNN accelerators we selected for comparison purpose by 18%, on average. The results show that these improvements translate to 9%-41% increase in the accelerator's throughput. Further, RC-CNN reduces the network latency and energy consumption by 28% and 22%, respectively, compared to the state-of-the-art utilization-aware methods that employ packet-switched networks-on-chip.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Network-on-Chip Evaluation for a Novel Neural Architecture
    Kynigos, Markos
    Navaridas, Javier
    Plana, Luis A.
    Furber, Steve
    [J]. 2018 ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, 2018, : 216 - 219
  • [42] Design and Implementation of a Hybrid Switching Router for the Reconfigurable Network-on-Chip
    Nguyen, Hung K.
    Xuan-Tu Tran
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2016, : 328 - 333
  • [43] BARAN: Bimodal Adaptive Reconfigurable-Allocator Network-on-Chip
    Mirhosseini, Amirhossein
    Sadrosadati, Mohammad
    Aghamohammadi, Fatemeh
    Modarressi, Mehdi
    Sarbazi-Azad, Hamid
    [J]. ACM TRANSACTIONS ON PARALLEL COMPUTING, 2018, 5 (03)
  • [44] Hierarchical Multicast Network-On-Chip for Scalable Reconfigurable Neuromorphic Systems
    Hota, Gopabandhu
    Mysore, Nishant
    Deiss, Stephen
    Pedroni, Bruno
    Cauwenberghs, Gert
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 481 - 485
  • [45] Energy Optimization in a Network-on-Chip with Dynamically Reconfigurable Processing Nodes
    Nunez-Yanez, J. L.
    Hosseinabady, M.
    Nabina, A.
    Zaidi, I.
    [J]. 2009 IEEE CONTROL APPLICATIONS CCA & INTELLIGENT CONTROL (ISIC), VOLS 1-3, 2009, : 308 - 313
  • [46] The ReNoC Reconfigurable Network-on-Chip: Architecture, Configuration Algorithms, and Evaluation
    Stuart, Matthias Bo
    Stensgaard, Mikkel Bystrup
    Sparso, Jens
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2011, 10 (04)
  • [47] A Locally Reconfigurable Network-on-Chip Architecture and Application Mapping onto it
    Soumya, J.
    Sharma, Ashish
    Chattopadhyay, Santanu
    [J]. 18TH INTERNATIONAL SYMPOSIUM ON VLSI DESIGN AND TEST, 2014,
  • [48] Reconfigurable 100 Gb/s Silicon Photonic Network-on-Chip
    Dong, Po
    Chen, Young-Kai
    Gu, Tingyi
    Buhl, Lawrence L.
    Neilson, David T.
    Sinsky, Jeffrey H.
    [J]. JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2015, 7 (01) : A37 - A43
  • [49] Reconfigurable 100 Gb/s Silicon Photonic Network-on-Chip
    Dong, Po
    Chen, Young-Kai
    Gu, Tingyi
    Buhl, Lawrence L.
    Neilson, David T.
    Sinsky, Jeffrey H.
    [J]. 2014 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2014,
  • [50] Ring based Optical Network-on-Chip
    Wang, Zhengyu
    Gu, Huaxi
    Yang, Yintang
    Li, Yonghui
    [J]. OPTICS COMMUNICATIONS, 2012, 285 (06) : 1010 - 1016