Memristor-Based Multilayer Neural Networks With Online Gradient Descent Training

被引：185

作者：

Soudry, Daniel ^{[1
,2
]}

Di Castro, Dotan ^{[3
]}

Gal, Asaf ^{[4
]}

Kolodny, Avinoam ^{[5
]}

Kvatinsky, Shahar ^{[6
]}

机构：

[1] Columbia Univ, Dept Stat, New York, NY 10027 USA

[2] Columbia Univ, Grossman Ctr Stat Mind, Dept Stat, New York, NY 10027 USA

[3] Yahoo Labs, IL-31905 Haifa, Israel

[4] Technion Israel Inst Technol, Dept Elect Engn, Biol Networks Res Labs, IL-32000 Haifa, Israel

[5] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel

[6] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2015年 / 26卷 / 10期

关键词：

Backpropagation; hardware; memristive systems; memristor; multilayer neural networks (MNNs); stochastic gradient descent; synapse; ANALOG; PLASTICITY; DEVICES; DESIGN; MODEL;

D O I：

10.1109/TNNLS.2014.2383395

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning in multilayer neural networks (MNNs) relies on continuous updating of large matrices of synaptic weights by local rules. Such locality can be exploited for massive parallelism when implementing MNNs in hardware. However, these update rules require a multiply and accumulate operation for each synaptic weight, which is challenging to implement compactly using CMOS. In this paper, a method for performing these update operations simultaneously (incremental outer products) using memristor-based arrays is proposed. The method is based on the fact that, approximately, given a voltage pulse, the conductivity of a memristor will increment proportionally to the pulse duration multiplied by the pulse magnitude if the increment is sufficiently small. The proposed method uses a synaptic circuit composed of a small number of components per synapse: one memristor and two CMOS transistors. This circuit is expected to consume between 2% and 8% of the area and static power of previous CMOS-only hardware alternatives. Such a circuit can compactly implement hardware MNNs trainable by scalable algorithms based on online gradient descent (e.g., backpropagation). The utility and robustness of the proposed memristor-based circuit are demonstrated on standard supervised learning tasks.

引用

页码：2408 / 2421

页数：14

共 50 条

[1] A Fully Analog Memristor-Based Neural Network with Online Gradient Training
Rosenthal, Eyal
Greshnikov, Sergey
Soudry, Daniel
Kvatinsky, Shahar
[J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1394 - 1397
[2] Offline Training for Memristor-based Neural Networks
Boquet, Guillem
Macias, Edwar
Morell, Antoni
Serrano, Javier
Miranda, Enrique
Lopez Vicario, Jose
[J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 1547 - 1551
[3] Memristor-Based Circuit Design for Multilayer Neural Networks
Zhang, Yang
Wang, Xiaoping
Friedman, Eby G.
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (02) : 677 - 686
[4] Memristor-based neural networks
Thomas, Andy
[J]. JOURNAL OF PHYSICS D-APPLIED PHYSICS, 2013, 46 (09)
[5] Memristor-based neural networks with weight simultaneous perturbation training
Wang, Chunhua
Xiong, Lin
Sun, Jingru
Yao, Wei
[J]. NONLINEAR DYNAMICS, 2019, 95 (04) : 2893 - 2906
[6] Memristor-based neural networks with weight simultaneous perturbation training
Chunhua Wang
Lin Xiong
Jingru Sun
Wei Yao
[J]. Nonlinear Dynamics, 2019, 95 : 2893 - 2906
[7] Advances in Memristor-Based Neural Networks
Xu, Weilin
Wang, Jingjuan
Yan, Xiaobing
[J]. FRONTIERS IN NANOTECHNOLOGY, 2021, 3
[8] One-Pass Online Learning Based on Gradient Descent for Multilayer Spiking Neural Networks
Lin, Xianghong
Hu, Tiandou
Wang, Xiangwen
[J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (01) : 16 - 31
[9] Training memristor-based multilayer neuromorphic networks with SGD, momentum and adaptive learning rates
Yan, Zheng
Chen, Jiadong
Hu, Rui
Huang, Tingwen
Chen, Yiran
Wen, Shiping
[J]. NEURAL NETWORKS, 2020, 128 : 142 - 149
[10] Adjusting Learning Rate of Memristor-Based Multilayer Neural Networks via Fuzzy Method
Wen, Shiping
Xiao, Shuixin
Yang, Yin
Yan, Zheng
Zeng, Zhigang
Huang, Tingwen
[J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (06) : 1084 - 1094

← 1 2 3 4 5 →