Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays

被引:2
|
作者
Rhe, Johnny [1 ]
Jeon, Kang Eun [1 ]
Lee, Joo Chan [2 ]
Jeong, Seongmoon [2 ]
Ko, Jong Hwan [3 ]
机构
[1] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea
[2] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon, South Korea
[3] Sungkyunkwan Univ, Coll Informat & Commun Engn, Suwon, South Korea
基金
新加坡国家研究基金会;
关键词
processing-in-memory; shift and duplicate (SDK) weight mapping; weight pruning; neural compression; ARCHITECTURE; PRECISION;
D O I
10.1109/ICCAD57390.2023.10323749
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Processing-in-memory (PIM) architectures have been highlighted as one of the viable solutions for faster and more power-efficient convolutional neural networks (CNNs) inference. Recently, shift and duplicate kernel (SDK) convolutional weight mapping scheme was proposed, achieving up to 50% throughput improvement over the prior arts. However, the traditional pattern-based pruning methods, which were adopted for row-skipping and computing cycle reduction, are not optimal for the latest SDK mapping due to structural irregularity caused by the shifted and duplicated kernels. To address this issue, we propose a method called kernel shape control (KERNTROL) that aims to promote structural regularity for achieving a high row-skipping ratio and model accuracy. Instead of pruning certain weight elements permanently, KERNTROL controls the kernel shapes through the omission of certain weights based on their mapped columns. In comparison to the latest pattern-based pruning approaches, KERNTROL achieves up to 36.4% improvement in the compression rate, and 38.6% in array utilization with maintaining the original model accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] CuckooPIM: An Efficient and Less-blocking Coherence Mechanism for Processing-in-Memory Systems
    Xu, Sheng
    Chen, Xiaoming
    Wang, Ying
    Han, Yinhe
    Li, Xiaowei
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 140 - 145
  • [22] An Energy-Efficient Quantized and Regularized Training Framework For Processing-In-Memory Accelerators
    Sun, Hanbo
    Zhu, Zhenhua
    Cai, Yi
    Chen, Xiaoming
    Wang, Yu
    Yang, Huazhong
    2020 25TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2020, 2020, : 325 - 330
  • [23] Processing-in-memory based multilateration localization in wireless sensor networks using memristor crossbar arrays
    Siyad, B. Mohammed
    Mohan, Ramasundaram
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (12):
  • [24] Gibbon: An Efficient Co-Exploration Framework of NN Model and Processing-In-Memory Architecture
    Sun, Hanbo
    Zhu, Zhenhua
    Wang, Chenyu
    Ning, Xuefei
    Dai, Guohao
    Yang, Huazhong
    Wang, Yu
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (11) : 4075 - 4089
  • [25] Efficient and reconfigurable reservoir computing to realize alphabet pronunciation recognition based on processing-in-memory
    Liu, Shuang
    Wu, Yuancong
    Xiong, Canlong
    Liu, Yihe
    Yang, Jing
    Yu, Q.
    Hu, S. G.
    Chen, T. P.
    Liu, Y.
    APPLIED PHYSICS LETTERS, 2021, 119 (10)
  • [26] Efficient Processing-in-Memory System Based on RISC-V Instruction Set Architecture
    Lim, Jihwan
    Son, Jeonghun
    Yoo, Hoyoung
    ELECTRONICS, 2024, 13 (15)
  • [27] An Efficient GCNs Accelerator Using 3D-Stacked Processing-in-Memory Architectures
    Wang, Runze
    Hu, Ao
    Zheng, Long
    Wang, Qinggang
    Yuan, Jingrui
    Liu, Haifeng
    Yu, Linchen
    Liao, Xiaofei
    Jin, Hai
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (05) : 1360 - 1373
  • [28] SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures
    Giannoula, Christina
    Fernandez, Ivan
    Luna, Juan Gomez
    Koziris, Nectarios
    Goumas, Georgios
    Mutlu, Onur
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2022, 6 (01)
  • [29] An Energy-efficient Processing-in-memory Architecture for Long Short Term Memory in Spin Orbit Torque MRAM
    Kim, Kyeonghan
    Shin, Hyein
    Sim, Jaehyeong
    Kang, Myeonggu
    Kim, Lee-Sup
    2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
  • [30] Identification of a convolution kernel in a control problem for the heat equation with a boundary memory term
    Cecilia Cavaterra
    Davide Guidetti
    Annali di Matematica Pura ed Applicata, 2014, 193 : 779 - 816