Towards Resilient Analog In-Memory Deep Learning via Data Layout Re-Organization

被引：1

作者：

Rashed, Muhammad Rashedul Haq ^{[1
]}

Awad, Amro ^{[2
]}

Jha, Sumit Kumar ^{[3
]}

Ewetz, Rickard ^{[1
]}

机构：

[1] Univ Cent Florida, Dept ECE, Orlando, FL 32816 USA

[2] North Carolina State Univ, Dept ECE, Raleigh, NC USA

[3] Univ Texas San Antonio, CS Dept, San Antonio, TX USA

来源：

PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022 | 2022年

关键词：

D O I：

10.1145/3489517.3530532

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Processing in-memory paves the way for neural network inference engines. An arising challenge is to develop the software/hardware interface to automatically compile deep learning models onto in memory computing platforms. In this paper, we observe that the data layout organization of a deep neural network (DNN) model directly impacts the model's classification accuracy. This stems from that the resistive parasitics within a crossbar introduces a dependency between the matrix data and the precision of the analog computation. To minimize the impact of the parasitics, we first perform a case study to understand the underlying matrix properties that result in computation with low and high precision, respectively. Next, we propose the XORG framework that performs data layout organization for DNNs deployed on in-memory computing platforms. The data layout organization improves precision by optimizing the weight matrix to crossbar assignments at compile time. The experimental results show that the XORG framework improves precision with up to 3.2X and 31% on the average. When accelerating DNNs using XORG, the write bit-accuracy requirements are relaxed with 1-bit and the robustness to random telegraph noise (RTN) is improved.

引用

页码：859 / 864

页数：6

共 17 条

[1] ALPINE: Analog In-Memory Acceleration With Tight Processor Integration for Deep Learning
Klein, Joshua
Boybat, Irem
Qureshi, Yasir Mahmood
Dazzi, Martino
Levisse, Alexandre
Ansaloni, Giovanni
Zapater, Marina
Sebastian, Abu
Atienza, David
[J]. IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (07) : 1985 - 1998
[2] Variation-Resilient FeFET-Based In-Memory Computing Leveraging Probabilistic Deep Learning
Manna, Bibhas
Saha, Arnob
Jiang, Zhouhang
Ni, Kai
Sengupta, Abhronil
[J]. IEEE TRANSACTIONS ON ELECTRON DEVICES, 2024, 71 (05) : 2963 - 2969
[3] Towards Decrypting the Art of Analog Layout: Placement Quality Prediction via Transfer Learning
Liu, Mingjie
Zhu, Keren
Gu, Jiaqi
Shen, Linxiao
Tang, Xiyuan
Sun, Nan
Pan, David Z.
[J]. PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 496 - 501
[4] Smarter Traffic Prediction Using Big Data, In-Memory Computing, Deep Learning and GPUs
Aqib, Muhammad
Mehmood, Rashid
Alzahrani, Ahmed
Katib, Iyad
Albeshri, Aiiad
Altowaijri, Saleh M.
[J]. SENSORS, 2019, 19 (09)
[5] PIM-DL: Boosting DNN Inference on Digital Processing In-Memory Architectures via Data Layout Optimizations
Zhou, Minxuan
Chen, Guoyang
Imani, Mohsen
Gupta, Saransh
Zhang, Weifeng
Rosing, Tajana
[J]. 30TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT 2021), 2021, : 186 - 198
[6] Providing Transaction Class-Based QoS in in-Memory Data Grids Via Machine Learning
Di Sanzo, Pierangelo
Molfese, Francesco Maria
Rughetti, Diego
Ciciani, Bruno
[J]. 2014 IEEE 3RD SYMPOSIUM ON NETWORK CLOUD COMPUTING AND APPLICATIONS (NCCA), 2014, : 46 - 53
[7] Auto-tuning of Cloud-based In-memory Transactional Data Grids via Machine Learning
Di Sanzo, Pierangelo
Rughetti, Diego
Ciciani, Bruno
Quaglia, Francesco
[J]. 2012 IEEE SECOND SYMPOSIUM ON NETWORK CLOUD COMPUTING AND APPLICATIONS (NCCA 2012), 2012, : 9 - 16
[8] Introduction to Analog Testing of Resistive Random Access Memory (RRAM) Devices Towards Scalable Analog Compute Technology for Deep Learning
Pujari, Ruturaj
Gasasira, Arthur
Kim, Youngseok
Katragadda, Veenadhar
Seo, Soon-Cheon
Kong, Dexin
Liu, Xuefeng
Teehan, Sean
Saulnier, Nicole
Ahsan, Ishtiaq
Narayanan, Vijay
Ando, Takashi
[J]. 2021 32ND ANNUAL SEMI ADVANCED SEMICONDUCTOR MANUFACTURING CONFERENCE (ASMC), 2021,
[9] Rapid Transit Systems: Smarter Urban Planning Using Big Data, In-Memory Computing, Deep Learning, and GPUs
Aqib, Muhammad
Mehmood, Rashid
Alzahrani, Ahmed
Katib, Iyad
Albeshri, Aiiad
Altowaijri, Saleh M.
[J]. SUSTAINABILITY, 2019, 11 (10)
[10] Memory-efficient deep learning inference with incremental weight loading and data layout reorganization on edge systems
Ji, Cheng
Wu, Fan
Zhu, Zongwei
Chang, Li-Pin
Liu, Huanghe
Zhai, Wenjie
[J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2021, 118

← 1 2 →