NAS4RRAM: neural network architecture search for inference on RRAM-based accelerators

被引:17
|
作者
Yuan, Zhihang [1 ]
Liu, Jingze [1 ]
Li, Xingchen [1 ]
Yan, Longhao [2 ,3 ]
Chen, Haoxiang [1 ]
Wu, Bingzhe [1 ]
Yang, Yuchao [2 ,3 ]
Sun, Guangyu [1 ]
机构
[1] Peking Univ, Ctr Energy Efficient Comp & Applicat, Beijing 100871, Peoples R China
[2] Peking Univ, Dept Micro Nanoelect, Beijing 100871, Peoples R China
[3] Peking Univ, Ctr Brain Inspired Chips, Inst Artificial Intelligence, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
network architecture search (NAS); neural networks; RRAM-based accelerator; hardware noise; quantization;
D O I
10.1007/s11432-020-3245-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The RRAM-based accelerators enable fast and energy-efficient inference for neural networks. However, there are some requirements to deploy neural networks on RRAM-based accelerators, which are not considered in existing neural networks. (1) Because the noise problem and analog-digital converters/digital-analog converters (ADC/DAC) affect the prediction accuracy, they should be modeled in networks. (2) Because the weights are mapped to the RRAM cells, they should be quantized, and the number of weights is limited by the number of RRAM cells in the accelerator. These requirements motivate us to customize the hardware-friendly network for the RRAM-based accelerator. We take the idea of network architecture search (NAS) to design networks with high prediction accuracy that meet the requirements. We propose a framework called NAS4RRAM to search for the optimal network on the given RRAM-based accelerator. The experiments demonstrate that NAS4RRAM can apply to different RRAM-based accelerators with different scales. The performance of searched networks outperforms the manually designed ResNet.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] NAS4RRAM: neural network architecture search for inference on RRAM-based accelerators
    Zhihang Yuan
    Jingze Liu
    Xingchen Li
    Longhao Yan
    Haoxiang Chen
    Bingzhe Wu
    Yuchao Yang
    Guangyu Sun
    Science China Information Sciences, 2021, 64
  • [2] NAS4RRAM: neural network architecture search for inference on RRAM-based accelerators
    Zhihang YUAN
    Jingze LIU
    Xingchen LI
    Longhao YAN
    Haoxiang CHEN
    Bingzhe WU
    Yuchao YANG
    Guangyu SUN
    Science China(Information Sciences), 2021, 64 (06) : 89 - 99
  • [3] A RRAM-based Coarse Grain Reconfigurable Array for Neural Network Accelerators
    Chen, Zhengyu
    Zhou, Hai
    Gu, Jie
    2018 IEEE SOI-3D-SUBTHRESHOLD MICROELECTRONICS TECHNOLOGY UNIFIED CONFERENCE (S3S), 2018,
  • [4] An RRAM-Based Oscillatory Neural Network
    Jackson, Thomas C.
    Sharma, Abhishek A.
    Bain, James A.
    Weldon, Jeffrey A.
    Pileggi, Lawrence
    2015 IEEE 6TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS & SYSTEMS (LASCAS), 2015,
  • [5] Mitigating read-program variation and IR drop by circuit architecture in RRAM-based neural network accelerators
    Lepri, Nicola
    Glukhov, Artem
    Ielmini, Daniele
    2022 IEEE INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM (IRPS), 2022,
  • [6] Mitigating Read-Program Variation and IR Drop by Circuit Architecture in RRAM-Based Neural Network Accelerators
    Lepri, Nicola
    Glukhov, Artem
    Ielmini, Daniele
    2022 IEEE INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM (IRPS), 2022,
  • [7] High-speed programming with threshold division for RRAM-based neural network accelerators
    Du, Xiangyu
    Chen, Taiping
    Su, Man
    Li, Zhen
    Tong, Peiwen
    Wang, Wei
    Cao, Rongrong
    AIP ADVANCES, 2024, 14 (12)
  • [8] RRAM-Based TCAMs for Pattern Search
    Zheng, Le
    Shin, Sangho
    Lloyd, Scott
    Gokhale, Maya
    Kim, Kyungmin
    Kang, Sung-Mo
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1382 - 1385
  • [9] WRAP: Weight RemApping and Processing in RRAM-based Neural Network Accelerators Considering Thermal Effect
    Chen, Po-Yuan
    Gu, Fang-Yi
    Huang, Yu-Hong
    Lin, Ing-Chao
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 1245 - 1250
  • [10] On the Reliability of RRAM-Based Neural Networks
    Aziza, Hassen
    Zambelli, Cristian
    Hamdioui, Said
    Diware, Sumit
    Bishnoi, Rajendra
    Gebregiorgis, Anteneh
    2023 IFIP/IEEE 31ST INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION, VLSI-SOC, 2023, : 13 - 20