HitM: High-Throughput ReRAM-based PIM for Multi-Modal Neural Networks

被引：6

作者：

Li, Bing ^{[1
]}

Wang, Ying ^{[2
]}

Chen, Yiran ^{[3
]}

机构：

[1] Capital Normal Univ, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

[3] Duke Univ, Dept Elect & Comp Engn, Durham, NC USA

来源：

2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD) | 2020年

基金：

中国国家自然科学基金;

关键词：

multi-modal neural networks; ReRAM; processing-in-memory; accelerator;

D O I：

10.1145/3400302.3415663

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid progress of artificial intelligence (AI) algorithms, multi-modal deep neural networks (DNNs) have been applied to some challenging tasks, e.g., image and video description to process multi-modal information from vision and language. Resistive-memory-based processing-in-memory (ReRAM-based PIM) has been extensively studied to accelerate either convolutional neural network (CNN) or recurrent neural network (RNN). According to the requirements of their core layers, i.e. convolutional layers and linear layers, the existing ReRAM-based PIMs adopt different optimization schemes for them. Directly deploying multi-modal DNNs on the existing ReRAM-based PIMs, however, is inefficient because multi-modal DNNs have combined CNN and RNN where the primary layers differ depending on the specific tasks. Therefore, a high-efficiency ReRAM-based PIM design for multi-modal DNNs necessitates an adaptive optimization to the given network. In this work, we propose HitM, a high-throughput ReRAM-based PIM for multi-modal DNNs with a two-stage workflow, which consists of a static analysis and an adaptive optimization. The static analysis generates the layer-wise resource and computation information with the input multi-modal DNN description and the adaptive optimization produces a high-throughput ReRAM-based PIM design through the dynamic algorithm based on hardware resources and the information from the static analysis. We evaluated HitM using several popular multi-modal DNNs with different parameters and structures and compared it with a naive ReRAM-based PIM design and an optimal-throughput ReRAM-based PIM design that assumes no hardware resource limitations. The experimental results show that HitM averagely achieves 78.01% of the optimal throughput while consumes 64.52% of the total hardware resources.

引用

页数：7

共 50 条

[1] A Versatile ReRAM-based Accelerator for Convolutional Neural Networks
Mao, Manqing
Sun, Xiao Yu
Peng, Xiaochen
Yu, Shimeng
Chakrabarti, Chaitali
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2018, : 211 - 216
[2] On-Line Fault Protection for ReRAM-Based Neural Networks
Li, Wen
Wang, Ying
Liu, Cheng
He, Yintao
Liu, Lian
Li, Huawei
Li, Xiaowei
IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (02) : 423 - 437
[3] PHANES: ReRAM-based Photonic Accelerator for Deep Neural Networks
Liu, Yinyi
Liu, Jiaqi
Fu, Yuxiang
Chen, Shixi
Zhang, Jiaxu
Xu, Jiang
PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 103 - 108
[4] High-Throughput Training of Deep CNNs on ReRAM-Based Heterogeneous Architectures via Optimized Normalization Layers
Joardar, Biresh Kumar
Deshwal, Aryan
Doppa, Janardhan Rao
Pande, Partha Pratim
Chakrabarty, Krishnendu
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (05) : 1537 - 1549
[5] Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
Ray, Bisakha
Henaff, Mikael
Ma, Sisi
Efstathiadis, Efstratios
Peskin, Eric R.
Picone, Marco
Poli, Tito
Aliferis, Constantin F.
Statnikov, Alexander
SCIENTIFIC REPORTS, 2014, 4
[6] Challenges in Developing Prediction Models for Multi-modal High-Throughput Biomedical Data
Alzubaidi, Abeer
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 1056 - 1069
[7] Information content and analysis methods for Multi-Modal High-Throughput Biomedical Data
Bisakha Ray
Mikael Henaff
Sisi Ma
Efstratios Efstathiadis
Eric R. Peskin
Marco Picone
Tito Poli
Constantin F. Aliferis
Alexander Statnikov
Scientific Reports, 4
[8] ReGNN: A ReRAM-based Heterogeneous Architecture for General Graph Neural Networks
Liu, Cong
Liu, Haikun
Jin, Hai
Liao, Xiaofei
Zhang, Yu
Duan, Zhuohui
Xu, Jiahong
Li, Huize
PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 469 - 474
[9] A Survey of ReRAM-Based Architectures for Processing-In-Memory and Neural Networks
Mittal, Sparsh
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (01): : 75 - 114
[10] Speech recognition with multi-modal features based on neural networks
Kim, Myung Won
Ryu, Joung Woo
Kim, Eun Ju
NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 489 - 498

← 1 2 3 4 5 →