A Potential Enabler for High-Performance In-Memory Multi-Bit Arithmetic Schemes With Unipolar Switching SOT-MRAM

被引:0
|
作者
Zhu, Haonan [1 ,2 ]
Wu, Bi [1 ,2 ]
Yu, Tianyang [1 ,2 ]
Chen, Ke [1 ,2 ]
Yan, Chenggang [1 ,2 ]
Liu, Weiqiang [1 ,2 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Integrated Circuits, Nanjing 211106, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Aerosp Integrated Circuits & Microsyst, Nanjing 211106, Peoples R China
基金
中国国家自然科学基金;
关键词
Switches; Arithmetic; Computer architecture; Resistance; Parallel processing; Magnetization; Magnetic tunneling; Emerging technology; SOT-MRAM; computing-in-memory; multi-bit addition; multi-bit multiplication; CNN;
D O I
10.1109/TCSI.2024.3395442
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the physical separation of data processing and storage, the conventional Von Neumann architecture exists excessive data migration overhead to curtail the progress of data-intensive applications. In this way, the Computing-in-Memory (CiM) architecture is proposed. Due to the boolean property of the memory cell, the current CiM mainly focuses on single-bit logic design. For the multi-bit arithmetic design, a prevalent patchwork approach is employed using single-bit logic, leaving the design with insufficient parallelism. This paper proposes a high-performance in-memory multi-bit addition (M-Add) and multiplication (M-Mul) scheme based on unipolar switching SOT-MRAM. For the M-Add scheme, transmission logic-based circuit design is proposed to realize single-step inter-column XOR operations, which is logically fits perfectly the g operator of parallel prefix algorithm. Further, the oBK algorithm is presented to maximize the g operator occupancy. For the M-Mul scheme, mapping the Booth decoder to the control signal of the proposed modified flip-flop queue, only two steps are required to realize the decoding of three encoded signals in parallel. The simulation results indicate the proposed design reduces the latency of N-bit Add (N-bit Mul) by an average of 82.6% (31.5%) compared to state-of-the-art CiM designs. Further, a CNN application based on proposed operations achieves 1.23 TOPS/w on the CIFAR-10 dataset, with an average of 47.57% increase over other CiM designs.
引用
收藏
页码:3165 / 3178
页数:14
相关论文
共 19 条
  • [1] High-Performance in-memory Logic Scheme using Unipolar Switching SOT-MRAM
    Zhu, Haonan
    Wu, Bi
    Chen, Ke
    Yan, Chenggang
    Liu, Weiqiang
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON NANOTECHNOLOGY (NANO), 2022, : 112 - 115
  • [2] MLiM: High-Performance Magnetic Logic in-Memory Scheme With Unipolar Switching SOT-MRAM
    Wu, Bi
    Zhu, Haonan
    Chen, Ke
    Yan, Chenggang
    Liu, Weiqiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (06) : 2412 - 2424
  • [3] High Performance and Energy-Efficient In-Memory Computing Architecture based on SOT-MRAM
    He, Zhezhi
    Angizi, Shaahin
    Parveen, Farhana
    Fan, Deliang
    [J]. PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL SYMPOSIUM ON NANOSCALE ARCHITECTURES (NANOARCH 2017), 2017, : 97 - 102
  • [4] Reconfigurable Bit-Serial Operation Using Toggle SOT-MRAM for High-Performance Computing in Memory Architecture
    Wang, Jinkai
    Bai, Yining
    Wang, Hongyu
    Hao, Zuolei
    Wang, Guanda
    Zhang, Kun
    Zhang, Youguang
    Lv, Weifeng
    Zhang, Yue
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (11) : 4535 - 4545
  • [5] Adaptive Non-Uniform Compressive Sensing Using SOT-MRAM Multi-Bit Precision Crossbar Arrays
    Salehi, Soheil
    DeMara, Ronald F.
    [J]. IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2021, 20 : 224 - 228
  • [6] System and Design Technology Co-Optimization of SOT-MRAM for High-Performance AI Accelerator Memory System
    Mishty, Kaniz
    Sadi, Mehdi
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (04) : 1065 - 1078
  • [7] Multi-bit MRAM based high performance neuromorphic accelerator for image classification
    Verma, Gaurav
    Soni, Sandeep
    Nisar, Arshid
    Kaushik, Brajesh Kumar
    [J]. NEUROMORPHIC COMPUTING AND ENGINEERING, 2024, 4 (01):
  • [8] Multi-bit carry chains for high-performance reconfigurable fabrics
    Frederick, Michael T.
    Somani, Arun K.
    [J]. 2006 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2006, : 275 - 280
  • [9] Structures and Characteristics of High-performance Multi-bit Streaming Multiplayers
    Nykolaychuk, Yaroslav
    Davletova, Alina
    Humennyi, Petro
    Vozna, Natalia
    Pitukh, Igor
    Zastavnyy, Oleg
    [J]. 2020 10TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER INFORMATION TECHNOLOGIES (ACIT), 2020, : 323 - 326
  • [10] A High Performance, Multi-Bit Output Logic-in-Memory Adder
    Talafy, Javad
    Zokaee, Farzaneh
    Zarandi, Hamid R.
    Bagherzadeh, Nader
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (04) : 2223 - 2233