Memristor-Based Multilayer Neural Networks With Online Gradient Descent Training

被引:185
|
作者
Soudry, Daniel [1 ,2 ]
Di Castro, Dotan [3 ]
Gal, Asaf [4 ]
Kolodny, Avinoam [5 ]
Kvatinsky, Shahar [6 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] Columbia Univ, Grossman Ctr Stat Mind, Dept Stat, New York, NY 10027 USA
[3] Yahoo Labs, IL-31905 Haifa, Israel
[4] Technion Israel Inst Technol, Dept Elect Engn, Biol Networks Res Labs, IL-32000 Haifa, Israel
[5] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
[6] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
Backpropagation; hardware; memristive systems; memristor; multilayer neural networks (MNNs); stochastic gradient descent; synapse; ANALOG; PLASTICITY; DEVICES; DESIGN; MODEL;
D O I
10.1109/TNNLS.2014.2383395
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning in multilayer neural networks (MNNs) relies on continuous updating of large matrices of synaptic weights by local rules. Such locality can be exploited for massive parallelism when implementing MNNs in hardware. However, these update rules require a multiply and accumulate operation for each synaptic weight, which is challenging to implement compactly using CMOS. In this paper, a method for performing these update operations simultaneously (incremental outer products) using memristor-based arrays is proposed. The method is based on the fact that, approximately, given a voltage pulse, the conductivity of a memristor will increment proportionally to the pulse duration multiplied by the pulse magnitude if the increment is sufficiently small. The proposed method uses a synaptic circuit composed of a small number of components per synapse: one memristor and two CMOS transistors. This circuit is expected to consume between 2% and 8% of the area and static power of previous CMOS-only hardware alternatives. Such a circuit can compactly implement hardware MNNs trainable by scalable algorithms based on online gradient descent (e.g., backpropagation). The utility and robustness of the proposed memristor-based circuit are demonstrated on standard supervised learning tasks.
引用
收藏
页码:2408 / 2421
页数:14
相关论文
共 50 条
  • [1] A Fully Analog Memristor-Based Neural Network with Online Gradient Training
    Rosenthal, Eyal
    Greshnikov, Sergey
    Soudry, Daniel
    Kvatinsky, Shahar
    [J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1394 - 1397
  • [2] Offline Training for Memristor-based Neural Networks
    Boquet, Guillem
    Macias, Edwar
    Morell, Antoni
    Serrano, Javier
    Miranda, Enrique
    Lopez Vicario, Jose
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1547 - 1551
  • [3] Memristor-Based Circuit Design for Multilayer Neural Networks
    Zhang, Yang
    Wang, Xiaoping
    Friedman, Eby G.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (02) : 677 - 686
  • [4] Memristor-based neural networks
    Thomas, Andy
    [J]. JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2013, 46 (09)
  • [5] Memristor-based neural networks with weight simultaneous perturbation training
    Wang, Chunhua
    Xiong, Lin
    Sun, Jingru
    Yao, Wei
    [J]. NONLINEAR DYNAMICS, 2019, 95 (04) : 2893 - 2906
  • [6] Memristor-based neural networks with weight simultaneous perturbation training
    Chunhua Wang
    Lin Xiong
    Jingru Sun
    Wei Yao
    [J]. Nonlinear Dynamics, 2019, 95 : 2893 - 2906
  • [7] Advances in Memristor-Based Neural Networks
    Xu, Weilin
    Wang, Jingjuan
    Yan, Xiaobing
    [J]. FRONTIERS IN NANOTECHNOLOGY, 2021, 3
  • [8] One-Pass Online Learning Based on Gradient Descent for Multilayer Spiking Neural Networks
    Lin, Xianghong
    Hu, Tiandou
    Wang, Xiangwen
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 16 - 31
  • [9] Training memristor-based multilayer neuromorphic networks with SGD, momentum and adaptive learning rates
    Yan, Zheng
    Chen, Jiadong
    Hu, Rui
    Huang, Tingwen
    Chen, Yiran
    Wen, Shiping
    [J]. NEURAL NETWORKS, 2020, 128 : 142 - 149
  • [10] Adjusting Learning Rate of Memristor-Based Multilayer Neural Networks via Fuzzy Method
    Wen, Shiping
    Xiao, Shuixin
    Yang, Yin
    Yan, Zheng
    Zeng, Zhigang
    Huang, Tingwen
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (06) : 1084 - 1094