All-Digital Computing-in-Memory Macro Supporting FP64-Based Fused Multiply-Add Operation

被引:3
|
作者
Li, Dejian [1 ]
Mo, Kefan [2 ]
Liu, Liang [1 ]
Pan, Biao [2 ]
Li, Weili [1 ]
Kang, Wang [2 ]
Li, Lei [1 ]
机构
[1] Beijing Smartchip Microelect Technol Co Ltd, Beijing 102299, Peoples R China
[2] Beihang Univ, Sch Integrated Circuit Sci & Engn, Beijing 100191, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
基金
中国国家自然科学基金;
关键词
digital computing-in-memory; floating-point arithmetic; fused multiply-add; scientific computing; matrix-vector multiplication; FLOATING-POINT;
D O I
10.3390/app13074085
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Recently, frequent data movement between computing units and memory during floating-point arithmetic has become a major problem for scientific computing. Computing-in-memory (CIM) is a novel computing paradigm that merges computing logic into memory, which can address the data movement problem with excellent power efficiency. However, the previous CIM paradigm failed to support double-precision floating-point format (FP64) due to its computing complexity. This paper presents a novel all-digital CIM macro-DCIM-FF to complete FP64 based fused multiply-add (FMA) operation for the first time. With 16 sub-CIM cells integrating digital multipliers to complete mantissa multiplication, DCIM-FF is able to provide correct rounded implementations for normalized/denormalized inputs in round-to-nearest-even mode and round-to-zero mode, respectively. To evaluate our design, we synthesized and tested the DCIM-FF macro in 55-nm CMOS technology. With a minimum power efficiency of 0.12 mW and a maximum computing efficiency of 26.9 TOPS/W, we successfully demonstrated that DCIM-FF can run the FP64-based FMA operation without error. Compared to related works, the proposed DCIM-FF macro shows significant power efficiency improvement and less area overhead based on CIM technology. This work paves a novel pathway for high-performance implementation of an FP64-based matrix-vector multiplication (MVM) operation, which is essential for hyperscale scientific computing.
引用
收藏
页数:13
相关论文
共 15 条
  • [1] A Digital SRAM-Based Computing-in-Memory Macro Supporting Parallel Maintaining for Network Management
    Li, Geng
    Zheng, Hanqing
    Sun, Jiacong
    Jiao, Hailong
    IEEE SOLID-STATE CIRCUITS LETTERS, 2024, 7 : 327 - 330
  • [2] FP-IMC: A 28nm All-Digital Configurable Floating-Point In-Memory Computing Macro
    Saikia, Jyotishman
    Sridharan, Amitesh
    Yeo, Injune
    Venkataramanaiah, Shreyas
    Fan, Deliang
    Seo, Jae-Sun
    IEEE 49TH EUROPEAN SOLID STATE CIRCUITS CONFERENCE, ESSCIRC 2023, 2023, : 405 - 408
  • [3] A 9T-SRAM based computing-in-memory with redundant unit and digital operation for boolean logic and MAC
    Li, Xin
    Gao, Mengya
    Ren, Zihua
    Yu, Kefeng
    Lu, Wenjuan
    Dai, Chenghu
    Hu, Wei
    Peng, Chunyu
    Wu, Xiulong
    MICROELECTRONICS JOURNAL, 2024, 145
  • [4] A 28nm 64Kb SRAM based Inference-Training Tri-Mode Computing-in-Memory Macro
    Pan, Nanbing
    Cui, Xiaoxin
    Qiao, Xin
    Xiao, Kanglin
    Guo, Qingyu
    Wang, Yuan
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2561 - 2565
  • [5] A Computing-in-Memory SRAM Macro Based on Fully-Capacitive-Coupling With Hierarchical Capacity Attenuator for 4-b MAC Operation
    Xiao, Kanglin
    Cui, Xiaoxin
    Qiao, Xin
    Pan, Nanbing
    Wang, Xin'An
    Wang, Yuan
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2551 - 2555
  • [6] A Local Computing Cell and 6T SRAM-Based Computing-in-Memory Macro With 8-b MAC Operation for Edge AI Chips
    Si, Xin
    Tu, Yung-Ning
    Huang, Wei-Hsing
    Su, Jian-Wei
    Lu, Pei-Jung
    Wang, Jing-Hong
    Liu, Ta-Wei
    Wu, Ssu-Yen
    Liu, Ruhui
    Chou, Yen-Chi
    Chung, Yen-Lin
    Shih, William
    Lo, Chung-Chuan
    Liu, Ren-Shuo
    Hsieh, Chih-Cheng
    Tang, Kea-Tiong
    Lien, Nan-Chun
    Shih, Wei-Chiang
    He, Yajuan
    Li, Qiang
    Chang, Meng-Fan
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2021, 56 (09) : 2817 - 2831
  • [7] A 40-nm, 64-Kb, 56.67 TOPS/W Voltage-Sensing Computing-In-Memory/Digital RRAM Macro Supporting Iterative Write With Verification and Online Read-Disturb Detection
    Yoon, Jong-Hyeok
    Chang, Muya
    Khwa, Win-San
    Chih, Yu-Der
    Chang, Meng-Fan
    Raychowdhury, Arijit
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2022, 57 (01) : 68 - 79
  • [8] An RRAM-Based Digital Computing-in-Memory Macro With Dynamic Voltage Sense Amplifier and Sparse-Aware Approximate Adder Tree
    He, Yifan
    Yue, Jinshan
    Feng, Xiaoyu
    Huang, Yuxuan
    Jia, Hongyang
    Wang, Jingyu
    Zhang, Lu
    Sun, Wenyu
    Yang, Huazhong
    Liu, Yongpan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (02) : 416 - 420
  • [9] A 10T2C Capacitive SRAM-based Computing-In-Memory Macro with Array-Embedded DAC and Shift-and-Add Functions
    Kim, Eunhwan
    Oh, Hyunmyung
    Lee, Jehun
    Park, Jihoon
    Kwon, Myeongeun
    Kim, Jae-Joon
    2024 IEEE CUSTOM INTEGRATED CIRCUITS CONFERENCE, CICC, 2024,
  • [10] 15.5 A 28nm 64Kb 6T SRAM Computing-in-Memory Macro with 8b MAC Operation for AI Edge Chips
    Si, Xin
    Tu, Yung-Ning
    Huang, Wei-Hsing
    Su, Jian-Wei
    Lu, Pei-Jung
    Wang, Jing-Hong
    Liu, Ta-Wei
    Wu, Ssu-Yen
    Liu, Ruhui
    Chou, Yen-Chi
    Zhang, Zhixiao
    Sie, Syuan-Hao
    Wei, Wei-Chen
    Lo, Yun-Chen
    Wen, Tai-Hsing
    Hsu, Tzu-Hsiang
    Chen, Yen-Kai
    Shih, William
    Lo, Chung-Chuan
    Liu, Ren-Shuo
    Hsieh, Chih-Cheng
    Tang, Kea-Tiong
    Lien, Nan-Chun
    Shih, Wei-Chiang
    He, Yajuan
    Li, Qiang
    Chang, Meng-Fan
    2020 IEEE INTERNATIONAL SOLID- STATE CIRCUITS CONFERENCE (ISSCC), 2020, : 246 - +