A General-Purpose and Configurable Planar Data Processor for Energy-Efficient Pooling Computation

被引:0
|
作者
Pan, Lunshuai [1 ]
Xue, Peng [2 ]
Li, Hongxing [1 ]
Sun, Litao [1 ]
Huang, Mingqiang [2 ]
机构
[1] Southeast Univ, Sch Elect Sci & Engn, Minist Educ, SEU FEI Nanopico Ctr,Key Lab MEMS, Nanjing 210096, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
关键词
Energy-Efficient; CNN accelerator; Pooling; ACCELERATOR;
D O I
10.1109/AICAS54282.2022.9869992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional Neural Networks (CNN) have been widely used in artificial intelligence applications. A typical CNN contains both convolution and pooling layer, in which the convolution is to detect local conjunctions of features and the pooling is to merge similar patterns into one. It is necessary to make pooling operation, which plays a great role in CNN. Up to now, there have been numerous researches on CNN accelerators, however, most of the previous works are only focused on the acceleration of convolution layers, and the specific studies on pooling units are still lacking. Besides, the existing pooling designs are usually constrained by either the poor flexibility or the low energy/area efficiency. In this work, we propose a general purpose and energy-efficient planar data processor to support the pooling operation from different CNN structure. By using the configurable data path control method, the processor is able to support universal pooling operation with arbitrary input feature shape and arbitrary pooling kerneUstride/padding size. Besides, the processor exhibits high efficiency with hardware utilization ratio near 100% during operation, indicating good performance of the design. Most importantly, it is energy-efficient that exhibits 86%-off on power consumption and 62%-off on area utilization when compared with the separate pooling module of NVDLA (NVIDIA Deep Learning Accelerator), thus is particularly suitable for the resource-limited edge intelligent devices.
引用
收藏
页码:33 / 36
页数:4
相关论文
共 50 条
  • [31] A performance model for general-purpose computation on GPU
    Institute of Computer Science and Technology, Peking University, Beijing 100871, China
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2009, 9 (1219-1226):
  • [32] No such thing as a general-purpose processor: And the belief in such a device is harmful
    Chisnall, David
    Queue, 2014, 12 (10): : 1 - 6
  • [33] AN ARRAY PROCESSOR FOR GENERAL-PURPOSE DIGITAL IMAGE COMPRESSION
    YATES, RB
    THACKER, NA
    EVANS, SJ
    WALKER, SN
    IVEY, PA
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1995, 30 (03) : 244 - 250
  • [34] LWRpro: An Energy-Efficient Configurable Crypto-Processor for Module-LWR
    Zhu, Yihong
    Zhu, Min
    Yang, Bohan
    Zhu, Wenping
    Deng, Chenchen
    Chen, Chen
    Wei, Shaojun
    Liu, Leibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (03) : 1146 - 1159
  • [35] THE DESIGN OF A GENERAL-PURPOSE MULTIPLE-PROCESSOR SYSTEM
    OSECKY, BD
    GEORG, DD
    BURY, RJ
    HEWLETT-PACKARD JOURNAL, 1984, 35 (03): : 34 - 38
  • [36] General-purpose acousto-optic connectionist processor
    Naughton, T
    Javadpour, Z
    Keating, J
    Klíma, M
    Rott, J
    OPTICAL ENGINEERING, 1999, 38 (07) : 1170 - 1177
  • [37] A GENERAL-PURPOSE CMOS ASSOCIATIVE PROCESSOR IC AND SYSTEM
    STORMON, CD
    TROULLINOS, NB
    SALEH, EM
    CHAVAN, AV
    BRULE, MR
    OLDFIELD, JV
    IEEE MICRO, 1992, 12 (06) : 68 - 78
  • [38] A GENERAL-PURPOSE VIDEO SIGNAL PROCESSOR - ARCHITECTURE AND PROGRAMMING
    DIJKSTRA, H
    ESSINK, G
    HAFKAMP, AJM
    DENHENGST, H
    HUIZER, CM
    VANROERMUND, AHM
    SLUYTER, RJ
    SNIJDER, PJ
    PROCEEDINGS - IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN : VLSI IN COMPUTERS & PROCESSORS, 1989, : 74 - 77
  • [39] AN EFFICIENT GENERAL-PURPOSE PARALLEL COMPUTER
    GALIL, Z
    PAUL, WJ
    JOURNAL OF THE ACM, 1983, 30 (02) : 360 - 387
  • [40] General-purpose compression for efficient retrieval
    Cannane, A
    Williams, HE
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2001, 52 (05): : 430 - 437