Hardware-software co-exploration with racetrack memory based in-memory computing for CNN inference in embedded systems

被引:2
|
作者
Choong, Benjamin Chen Ming [1 ]
Luo, Tao [2 ]
Liu, Cheng [3 ]
He, Bingsheng [4 ]
Zhang, Wei [5 ]
Zhou, Joey Tianyi [6 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, 4 Engn Dr 3, Singapore 117583, Singapore
[2] Agcy Sci Technol & Res, Inst High Performance Comp, 1 Fusionopolis Way,16-16 Connexis, Singapore 138632, Singapore
[3] Chinese Acad Sci, Inst Comp Technol, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China
[4] Natl Univ Singapore, Sch Comp, COM1,13 Comp Dr, Singapore 117417, Singapore
[5] Hong Kong Univ Sci & Technol, Kowloon, Clear Water Bay, Hong Kong, Peoples R China
[6] ASTAR, Ctr Frontier AI Res, 1 Fusionopolis Way,16-16 Connexis, Singapore 138632, Singapore
关键词
Artificial intelligence; Hardware-software co-design; Deep learning; Embedded systems; Emerging memory; NEURAL-NETWORKS; ENERGY; ARCHITECTURE; ACCELERATOR; MACHINE; ADDER;
D O I
10.1016/j.sysarc.2022.102507
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks generate and process large volumes of data, posing challenges for low-resource embedded systems. In-memory computing has been demonstrated as an efficient computing infrastructure and shows promise for embedded AI applications. Among newly-researched memory technologies, racetrack memory is a non-volatile technology that allows high data density fabrication, making it a good fit for in memory computing. However, integrating in-memory arithmetic circuits with memory cells affects both the memory density and power efficiency. It remains challenging to build efficient in-memory arithmetic circuits on racetrack memory within area and energy constraints. To this end, we present an efficient in-memory convolutional neural network (CNN) accelerator optimized for use with racetrack memory. We design a series of fundamental arithmetic circuits as in-memory computing cells suited for multiply-and-accumulate operations. Moreover, we explore the design space of racetrack memory based systems and CNN model architectures, employing co-design to improve the efficiency and performance of performing CNN inference in racetrack memory while maintaining model accuracy. Our designed circuits and model-system co-optimization strategies achieve a small memory bank area with significant improvements in energy and performance for racetrack memory based embedded systems.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Design Space and Memory Technology Co-exploration for In-Memory Computing Based Machine Learning Accelerators
    He, Kang
    Chakraborty, Indranil
    Wang, Cheng
    Roy, Kaushik
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,
  • [2] Optimizing hardware-software co-design based on non-ideality in memristor crossbars for in-memory computing
    Pinfeng JIANG
    Danzhe SONG
    Menghua HUANG
    Fan YANG
    Letian WANG
    Pan LIU
    Xiangshui MIAO
    Xingsheng WANG
    Science China(Information Sciences), 2025, 68 (02) : 354 - 369
  • [3] Optimizing hardware-software co-design based on non-ideality in memristor crossbars for in-memory computing
    Pinfeng Jiang
    Danzhe Song
    Menghua Huang
    Fan Yang
    Letian Wang
    Pan Liu
    Xiangshui Miao
    Xingsheng Wang
    Science China Information Sciences, 2025, 68 (2)
  • [4] Hardware-Software Co-Design of an In-Memory Transformer Network Accelerator
    Laguna, Ann Franchesca
    Sharifi, Mohammed Mehdi
    Kazemi, Arman
    Yin, Xunzhao
    Niemier, Michael
    Hu, X. Sharon
    FRONTIERS IN ELECTRONICS, 2022, 3
  • [5] Error Resilient In-Memory Computing Architecture for CNN Inference on the Edge
    Rios, Marco
    Ponzina, Flavio
    Ansaloni, Giovanni
    Levisse, Alexandre
    Atienza, David
    PROCEEDINGS OF THE 32ND GREAT LAKES SYMPOSIUM ON VLSI 2022, GLSVLSI 2022, 2022, : 249 - 254
  • [6] HARDWARE-SOFTWARE CO-DESIGN OF EMBEDDED SYSTEMS
    WOLF, WH
    PROCEEDINGS OF THE IEEE, 1994, 82 (07) : 967 - 989
  • [7] A Skyrmion Racetrack Memory based Computing In-memory Architecture for Binary Neural Convolutional Network
    Pan, Yu
    Ouyang, Peng
    Zhao, Yinglin
    Yin, Shouyi
    Zhang, Youguang
    Wei, Shaojun
    Zhao, Weisheng
    GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 271 - 274
  • [8] COSYN: Hardware-software co-synthesis of embedded systems
    Dave, BP
    Lakshminarayana, G
    Jha, NK
    DESIGN AUTOMATION CONFERENCE - PROCEEDINGS 1997, 1997, : 703 - 708
  • [9] Hardware/Software Co-Design With ADC-Less In-Memory Computing Hardware for Spiking Neural Networks
    Apolinario, Marco Paul E.
    Kosta, Adarsh Kumar
    Saxena, Utkarsh
    Roy, Kaushik
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (01) : 35 - 47
  • [10] A hardware/software co-design methodology for in-memory processors
    Yantir, Hasan Erdem
    Eltawil, Ahmed M.
    Salama, Khaled N.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 161 : 63 - 71