Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays

被引：2

作者：

Rhe, Johnny ^{[1
]}

Jeon, Kang Eun ^{[1
]}

Lee, Joo Chan ^{[2
]}

Jeong, Seongmoon ^{[2
]}

Ko, Jong Hwan ^{[3
]}

机构：

[1] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon, South Korea

[2] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon, South Korea

[3] Sungkyunkwan Univ, Coll Informat & Commun Engn, Suwon, South Korea

来源：

2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD | 2023年

基金：

新加坡国家研究基金会;

关键词：

processing-in-memory; shift and duplicate (SDK) weight mapping; weight pruning; neural compression; ARCHITECTURE; PRECISION;

D O I：

10.1109/ICCAD57390.2023.10323749

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Processing-in-memory (PIM) architectures have been highlighted as one of the viable solutions for faster and more power-efficient convolutional neural networks (CNNs) inference. Recently, shift and duplicate kernel (SDK) convolutional weight mapping scheme was proposed, achieving up to 50% throughput improvement over the prior arts. However, the traditional pattern-based pruning methods, which were adopted for row-skipping and computing cycle reduction, are not optimal for the latest SDK mapping due to structural irregularity caused by the shifted and duplicated kernels. To address this issue, we propose a method called kernel shape control (KERNTROL) that aims to promote structural regularity for achieving a high row-skipping ratio and model accuracy. Instead of pruning certain weight elements permanently, KERNTROL controls the kernel shapes through the omission of certain weights based on their mapped columns. In comparison to the latest pattern-based pruning approaches, KERNTROL achieves up to 36.4% improvement in the compression rate, and 38.6% in array utilization with maintaining the original model accuracy.

引用

页数：9

共 50 条

[21] CuckooPIM: An Efficient and Less-blocking Coherence Mechanism for Processing-in-Memory Systems
Xu, Sheng
Chen, Xiaoming
Wang, Ying
Han, Yinhe
Li, Xiaowei
24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 140 - 145
[22] An Energy-Efficient Quantized and Regularized Training Framework For Processing-In-Memory Accelerators
Sun, Hanbo
Zhu, Zhenhua
Cai, Yi
Chen, Xiaoming
Wang, Yu
Yang, Huazhong
2020 25TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2020, 2020, : 325 - 330
[23] Processing-in-memory based multilateration localization in wireless sensor networks using memristor crossbar arrays
Siyad, B. Mohammed
Mohan, Ramasundaram
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (12):
[24] Gibbon: An Efficient Co-Exploration Framework of NN Model and Processing-In-Memory Architecture
Sun, Hanbo
Zhu, Zhenhua
Wang, Chenyu
Ning, Xuefei
Dai, Guohao
Yang, Huazhong
Wang, Yu
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (11) : 4075 - 4089
[25] Efficient and reconfigurable reservoir computing to realize alphabet pronunciation recognition based on processing-in-memory
Liu, Shuang
Wu, Yuancong
Xiong, Canlong
Liu, Yihe
Yang, Jing
Yu, Q.
Hu, S. G.
Chen, T. P.
Liu, Y.
APPLIED PHYSICS LETTERS, 2021, 119 (10)
[26] Efficient Processing-in-Memory System Based on RISC-V Instruction Set Architecture
Lim, Jihwan
Son, Jeonghun
Yoo, Hoyoung
ELECTRONICS, 2024, 13 (15)
[27] An Efficient GCNs Accelerator Using 3D-Stacked Processing-in-Memory Architectures
Wang, Runze
Hu, Ao
Zheng, Long
Wang, Qinggang
Yuan, Jingrui
Liu, Haifeng
Yu, Linchen
Liao, Xiaofei
Jin, Hai
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (05) : 1360 - 1373
[28] SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures
Giannoula, Christina
Fernandez, Ivan
Luna, Juan Gomez
Koziris, Nectarios
Goumas, Georgios
Mutlu, Onur
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2022, 6 (01)
[29] An Energy-efficient Processing-in-memory Architecture for Long Short Term Memory in Spin Orbit Torque MRAM
Kim, Kyeonghan
Shin, Hyein
Sim, Jaehyeong
Kang, Myeonggu
Kim, Lee-Sup
2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
[30] Identification of a convolution kernel in a control problem for the heat equation with a boundary memory term
Cecilia Cavaterra
Davide Guidetti
Annali di Matematica Pura ed Applicata, 2014, 193 : 779 - 816

← 1 2 3 4 5 →