Preliminary Results Towards Reinforcement Learning with Mixed-signal Memristive Neuromorphic Circuits

被引:0
|
作者
Wu, Nan [1 ]
Vincent, Adrien F. [1 ,2 ]
Strukov, Dmitri [1 ]
机构
[1] UC Santa Barabra, Santa Barbara, CA 93106 USA
[2] Univ Bordeaux, IMS, Bordeaux INP, CNRS,UMR 5218, Bordeaux, France
关键词
Artificial neural networks; Reinforcement learning; Memristor; ReRAM; In-situ training; Hardware implementation; Actor-Critic model;
D O I
10.1109/iscas.2019.8702229
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As the end of Moore's law seems to be imminent, emerging technologies that enable high performance neuromorphic hardware systems are attracting increasing attention. A very promising approach is to utilize memristors, programmable nonvolatile memory devices, as synaptic weights in neuromorphic circuits. One of the challenges for memristive hardware with integrated learning capabilities is prohibitively larger number of write cycles that might be required during learning process. In this work we propose a memristive neuromorphic hardware implementation for reinforcement learning based on temporal difference actor-critic algorithm. As a case study, we consider a task of balancing an inverted pendulum, a classical problem in both reinforcement learning and control theory. We introduce training techniques that significantly reduce the number of weight updates and are suitable for efficient in-situ learning hardware implementations. We believe that this study shows the promise of using memristor-based hardware neural networks for handling complex tasks through in-situ reinforcement learning.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Security Aspects of Analog and Mixed-signal Circuits
    Polian, Ilia
    PROCEEDINGS OF THE 2016 IEEE 21ST INTERNATIONAL MIXED-SIGNALS TEST WORKSHOP (IMSTW), 2016,
  • [42] Integrated design and test of mixed-signal circuits
    Engin, N
    Kerkhoff, HG
    Tangelder, RJWT
    Speek, H
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 1999, 14 (1-2): : 75 - 83
  • [43] Effective pseudorandom testing of mixed-signal circuits
    Amer, HH
    Salama, AE
    ICM 2003: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS, 2003, : 400 - 403
  • [44] Benchmark circuits for analog and mixed-signal testing
    Kondagunturi, R
    Bradley, E
    Maggard, K
    Stroud, C
    IEEE SOUTHEASTCON '99, PROCEEDINGS, 1999, : 217 - 220
  • [45] Monitoring properties of analog and mixed-signal circuits
    Maler O.
    Ničković D.
    International Journal on Software Tools for Technology Transfer, 2013, 15 (03) : 247 - 268
  • [46] A Mixed-Signal Spiking Neuromorphic Architecture for Scalable Neural Network
    Luo, Chong
    Ying, Zhaozhong
    Zhu, Xiaolei
    Chen, Longlong
    2017 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2017), VOL 1, 2017, : 179 - 182
  • [47] A Mixed-Signal Structured AdEx Neuron for Accelerated Neuromorphic Cores
    Aamir, Syed Ahmed
    Mueller, Paul
    Kiene, Gerd
    Kriener, Laura
    Stradmann, Yannik
    Gruebl, Andreas
    Schemmel, Johannes
    Meier, Karlheinz
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2018, 12 (05) : 1027 - 1037
  • [48] Fogging-Effect- Aware Mixed-Signal IC Placement with Reinforcement Learning
    Hajijafari, Mohammad
    Ahmadi, Mehrnaz
    Zhao, Zhenxin
    Zhang, Lihong
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2895 - 2899
  • [49] Analog / Mixed-Signal / RF Circuits for Complex Signal Processing
    Kobayashi, Haruo
    Kushita, Nene
    Tran, Minh Tri
    Asami, Koji
    San, Hao
    Kuwana, Anna
    Hatta, Akemi
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
  • [50] Adaptive motor control and learning in a spiking neural network realised on a mixed-signal neuromorphic processor
    Glatz, Sebastian
    Martel, Julien
    Kreiser, Raphaela
    Qiao, Ning
    Sandamirskaya, Yulia
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 9631 - 9637