HitM: High-Throughput ReRAM-based PIM for Multi-Modal Neural Networks

被引:6
|
作者
Li, Bing [1 ]
Wang, Ying [2 ]
Chen, Yiran [3 ]
机构
[1] Capital Normal Univ, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[3] Duke Univ, Dept Elect & Comp Engn, Durham, NC USA
来源
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD) | 2020年
基金
中国国家自然科学基金;
关键词
multi-modal neural networks; ReRAM; processing-in-memory; accelerator;
D O I
10.1145/3400302.3415663
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid progress of artificial intelligence (AI) algorithms, multi-modal deep neural networks (DNNs) have been applied to some challenging tasks, e.g., image and video description to process multi-modal information from vision and language. Resistive-memory-based processing-in-memory (ReRAM-based PIM) has been extensively studied to accelerate either convolutional neural network (CNN) or recurrent neural network (RNN). According to the requirements of their core layers, i.e. convolutional layers and linear layers, the existing ReRAM-based PIMs adopt different optimization schemes for them. Directly deploying multi-modal DNNs on the existing ReRAM-based PIMs, however, is inefficient because multi-modal DNNs have combined CNN and RNN where the primary layers differ depending on the specific tasks. Therefore, a high-efficiency ReRAM-based PIM design for multi-modal DNNs necessitates an adaptive optimization to the given network. In this work, we propose HitM, a high-throughput ReRAM-based PIM for multi-modal DNNs with a two-stage workflow, which consists of a static analysis and an adaptive optimization. The static analysis generates the layer-wise resource and computation information with the input multi-modal DNN description and the adaptive optimization produces a high-throughput ReRAM-based PIM design through the dynamic algorithm based on hardware resources and the information from the static analysis. We evaluated HitM using several popular multi-modal DNNs with different parameters and structures and compared it with a naive ReRAM-based PIM design and an optimal-throughput ReRAM-based PIM design that assumes no hardware resource limitations. The experimental results show that HitM averagely achieves 78.01% of the optimal throughput while consumes 64.52% of the total hardware resources.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Multi-Modal Reflection Removal Using Convolutional Neural Networks
    Sun, Jun
    Chang, Yakun
    Jung, Cheolkon
    Feng, Jiawei
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (07) : 1011 - 1015
  • [32] Prediction of protein secondary structure by multi-modal neural networks
    Zhu, HX
    Yoshihara, I
    Yamamori, K
    Yasunaga, M
    RECENT ADVANCES IN SIMULATED EVOLUTION AND LEARNING, 2004, 2 : 682 - 697
  • [33] Multi-modal page stream segmentation with convolutional neural networks
    Gregor Wiedemann
    Gerhard Heyer
    Language Resources and Evaluation, 2021, 55 : 127 - 150
  • [34] Reinforced multi-modal cyberbullying detection with subgraph neural networks
    Luo, Kai
    Zheng, Ce
    Guan, Zhenyu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025, 16 (03) : 2161 - 2180
  • [35] Multi-Modal Depth Estimation Using Convolutional Neural Networks
    Siddiqui, Sadique Adnan
    Vierling, Axel
    Berns, Karsten
    2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 354 - 359
  • [36] Learning multi-modal recurrent neural networks with target propagation
    Manchev, Nikolay
    Spratling, Michael
    COMPUTATIONAL INTELLIGENCE, 2024, 40 (04)
  • [37] A secure multi-modal biometrics using deep ConvGRU neural networks based hashing
    Sasikala, T. S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [38] Multi-information Complementarity Neural Networks for Multi-Modal Action Recognition
    Ding, Chuan
    Tie, Yun
    Qi, Lin
    2019 8TH INTERNATIONAL SYMPOSIUM ON NEXT GENERATION ELECTRONICS (ISNE), 2019,
  • [39] Exploiting device-level non-idealities for adversarial attacks on ReRAM-based neural networks
    McLemore, Tyler
    Sunbury, Robert
    Brodzik, Seth
    Cronin, Zachary
    Timmons, Elias
    Chakraborty, Dwaipayan
    Memories - Materials, Devices, Circuits and Systems, 2023, 4
  • [40] A High-Throughput Reconfigurable Processing Array for Neural Networks
    Wu, Ephrem
    Zhang, Xiaoqian
    Berman, David
    Cho, Inkeun
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,