A Weighted Current Summation Based Mixed Signal DRAM-PIM Architecture for Deep Neural Network Inference

被引:3
|
作者
Sudarshan, Chirag [1 ]
Soliman, Taha [2 ]
Lappas, Jan [1 ]
Weis, Christian [1 ]
Sadi, Mohammad Hassani [1 ]
Jung, Matthias [3 ]
Guntoro, Andre [2 ]
Wehn, Norbert [1 ]
机构
[1] TU Kaiserslautern, Microelect Syst Design Res Grp, D-67663 Kaiserslautern, Germany
[2] Robert Bosch GmbH Corp Res CR ADT2, D-70839 Stuttgart, Germany
[3] Fraunhofer Inst Expt Software Engn IESE, D-67663 Kaiserslautern, Germany
关键词
Computer architecture; Random access memory; Parallel processing; Optical wavelength conversion; Neural networks; Kernel; Performance evaluation; Processing-in-memory; PIM; compute-in-memory; CIM; DRAM; CNN; DNN; EFFICIENT;
D O I
10.1109/JETCAS.2022.3170235
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Processing-in-Memory (PIM) is an emerging approach to bridge the memory-computation gap. One of the major challenges of PIM architectures in the scope of Deep Neural Network (DNN) inference is the implementation of area-intensive Multiply-Accumulate (MAC) units in memory technologies, especially for DRAM-based PIMs. The DRAM architecture restricts the integration of DNN computation units near the area optimized commodity DRAM Sub-Array (SA) or Primary Sense Amplifier (PSA) region, where the data parallelism is maximum and the data movement cost is minimum. In this paper, we present a novel DRAM-based PIM architecture that is based on bit-decomposed MAC operation and Weighted Current Summation (WCS) technique to implement the MAC unit with minimal additional circuitry in the PSA region by leveraging on mixed-signal design. The architecture presents a two-stage design that employs light-weight current mirror based analog units near the SAs in the PSA region, whereas all the other substantial logic is integrated near the bank peripheral region. Hence, our architecture attains a balance between the data parallelism, data movement energy and area optimization. For an 8-bit CNN inference, our novel 8Gb DRAM PIM device achieves a peak performance of 142.8GOPS while consuming a power of 756.76mW, which results in an energy efficiency of 188.8GOPS/W. The area overhead of such an 8Gb device for a 2ynm DRAM technology is 12.63% in comparison to a commodity 8Gb DRAM device.
引用
收藏
页码:367 / 380
页数:14
相关论文
共 50 条
  • [21] Deep Neural Network Architecture for Anomaly Based Intrusion Detection System
    Behera, Sidharth
    Pradhan, Ayush
    Dash, Ratnakar
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 270 - 274
  • [22] Integrated processing method for microseismic signal based on deep neural network
    Zhang, Hang
    Ma, Chunchi
    Jiang, Yupeng
    Li, Tianbin
    Pazzi, Veronica
    Casagli, Nicola
    [J]. GEOPHYSICAL JOURNAL INTERNATIONAL, 2021, 226 (03) : 2145 - 2157
  • [23] LFM Signal Detection and Estimation Based on Deep Convolutional Neural Network
    Chen, Xiaolong
    Jiang, Qiaowen
    Su, Ningyuan
    Chen, Baoxin
    Guan, Jian
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 753 - 758
  • [24] A Method of Underwater Acoustic Signal Classification Based on Deep Neural Network
    Wei, Zhengxian
    Ju, Yang
    Song, Min
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 46 - 50
  • [25] Low Complexity Reconfigurable-Scalable Architecture Design Methodology for Deep Neural Network Inference Accelerator
    Nimbekar, Anagha
    Vatti, Chandrasekhara Srinivas
    Dinesh, Y. V. Sai
    Singh, Sunidhi
    Gupta, Tarun
    Chandrapu, Ramesh Reddy
    Acharyya, Amit
    [J]. 2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 83 - 88
  • [26] Parallel Processing Method for Microseismic Signal Based on Deep Neural Network
    Ma, Chunchi
    Yan, Wenjin
    Xu, Weihao
    Li, Tianbin
    Ran, Xuefeng
    Wan, Jiangjun
    Tong, Ke
    Lin, Yu
    [J]. REMOTE SENSING, 2023, 15 (05)
  • [27] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
    Peng Xiyuan
    Yu Jinxiang
    Yao Bowen
    Liu Liansheng
    Peng Yu
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (01) : 1 - 17
  • [28] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
    PENG Xiyuan
    YU Jinxiang
    YAO Bowen
    LIU Liansheng
    PENG Yu
    [J]. Chinese Journal of Electronics, 2021, 30 (01) : 1 - 17
  • [29] Deep Neural Network and Weighted Ridge Regression Based Pixel Prediction Technique
    Chang, Jer-Ming
    Ding, Jian-Jiun
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [30] Personalized Recommendation Algorithm Based on Deep Neural Network and Weighted Implicit Feedback
    基于深度神经网络和加权隐反馈的个性化推荐
    [J]. Xue, Feng (feng.xue@hfut.edu.cn), 1600, Science Press (33): : 295 - 302