Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays

被引:2
|
作者
Rhe, Johnny [1 ]
Jeon, Kang Eun [1 ]
Lee, Joo Chan [2 ]
Jeong, Seongmoon [2 ]
Ko, Jong Hwan [3 ]
机构
[1] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea
[2] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon, South Korea
[3] Sungkyunkwan Univ, Coll Informat & Commun Engn, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
processing-in-memory; shift and duplicate (SDK) weight mapping; weight pruning; neural compression; ARCHITECTURE; PRECISION;
D O I
10.1109/ICCAD57390.2023.10323749
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Processing-in-memory (PIM) architectures have been highlighted as one of the viable solutions for faster and more power-efficient convolutional neural networks (CNNs) inference. Recently, shift and duplicate kernel (SDK) convolutional weight mapping scheme was proposed, achieving up to 50% throughput improvement over the prior arts. However, the traditional pattern-based pruning methods, which were adopted for row-skipping and computing cycle reduction, are not optimal for the latest SDK mapping due to structural irregularity caused by the shifted and duplicated kernels. To address this issue, we propose a method called kernel shape control (KERNTROL) that aims to promote structural regularity for achieving a high row-skipping ratio and model accuracy. Instead of pruning certain weight elements permanently, KERNTROL controls the kernel shapes through the omission of certain weights based on their mapped columns. In comparison to the latest pattern-based pruning approaches, KERNTROL achieves up to 36.4% improvement in the compression rate, and 38.6% in array utilization with maintaining the original model accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] A 0.18μm CMOS implementation of an area efficient precise exception handling unit for processing-in-memory systems
    Mediratta, S
    Steele, C
    Singh, R
    Sondeen, J
    Draper, J
    2004 47TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III, CONFERENCE PROCEEDINGS, 2004, : 455 - 458
  • [42] PIMA-Logic: A Novel Processing-in-Memory Architecture for Highly Flexible and Energy-Efficient Logic Computation
    Angizi, Shaahin
    He, Zhezhi
    Fan, Deliang
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [43] CMP-PIM: An Energy-Efficient Comparator-based Processing-In-Memory Neural Network Accelerator
    Angizi, Shaahin
    He, Zhezhi
    Rakin, Adnan Siraj
    Fan, Deliang
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [44] PIMA-LPN: Processing-in-memory Acceleration for Efficient LPN-based Post-Quantum Cryptography
    Ding, Lin
    Bian, Song
    Zhang, Jiliang
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [45] An area-efficient standard-cell floating-point unit design for a processing-in-memory system
    Moon, JS
    Kwon, TJ
    Sondeen, J
    Draper, J
    ESSCIRC 2003: PROCEEDINGS OF THE 29TH EUROPEAN SOLID-STATE CIRCUITS CONFERENCE, 2003, : 57 - 60
  • [46] KERNTROL: Kernel Shape Control Toward Ultimate Memory Utilization for In-Memory Convolutional Weight Mapping
    Rhe, Johnny
    Jeon, Kang Eun
    Lee, Joo Chan
    Jeong, Seongmoon
    Ko, Jong Hwan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, : 1 - 14
  • [47] T-PIM: An Energy-Efficient Processing-in-Memory Accelerator for End-to-End On-Device Training
    Heo, Jaehoon
    Kim, Junsoo
    Lim, Sukbin
    Han, Wontak
    Kim, Joo-Young
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2023, 58 (03) : 600 - 613
  • [48] Smart Multiple Wetting Control on ZnO Coated Shape Memory Polymer Arrays
    Wang Xiaonan
    Wang Bohan
    Lai Hua
    Cheng Zhongjun
    CHEMICAL RESEARCH IN CHINESE UNIVERSITIES, 2023, 39 (01) : 151 - 158
  • [49] Processing-in-memory in High Bandwidth Memory (PIM-HBM) Architecture with Energy-efficient and Low Latency Channels for High Bandwidth System
    Kim, Seongguk
    Kim, Subin
    Cho, Kyungjun
    Shin, Taein
    Park, Hyunwook
    Lho, Daehwan
    Park, Shinyoung
    Son, Kyungjune
    Park, Gapyeol
    Kim, Joungho
    2019 IEEE 28TH CONFERENCE ON ELECTRICAL PERFORMANCE OF ELECTRONIC PACKAGING AND SYSTEMS (EPEPS 2019), 2019,
  • [50] Smart Multiple Wetting Control on ZnO Coated Shape Memory Polymer Arrays
    Xiaonan Wang
    Bohan Wang
    Hua Lai
    Zhongjun Cheng
    Chemical Research in Chinese Universities, 2023, 39 : 151 - 158