Preliminary Results Towards Reinforcement Learning with Mixed-signal Memristive Neuromorphic Circuits

被引:0
|
作者
Wu, Nan [1 ]
Vincent, Adrien F. [1 ,2 ]
Strukov, Dmitri [1 ]
机构
[1] UC Santa Barabra, Santa Barbara, CA 93106 USA
[2] Univ Bordeaux, IMS, Bordeaux INP, CNRS,UMR 5218, Bordeaux, France
关键词
Artificial neural networks; Reinforcement learning; Memristor; ReRAM; In-situ training; Hardware implementation; Actor-Critic model;
D O I
10.1109/iscas.2019.8702229
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As the end of Moore's law seems to be imminent, emerging technologies that enable high performance neuromorphic hardware systems are attracting increasing attention. A very promising approach is to utilize memristors, programmable nonvolatile memory devices, as synaptic weights in neuromorphic circuits. One of the challenges for memristive hardware with integrated learning capabilities is prohibitively larger number of write cycles that might be required during learning process. In this work we propose a memristive neuromorphic hardware implementation for reinforcement learning based on temporal difference actor-critic algorithm. As a case study, we consider a task of balancing an inverted pendulum, a classical problem in both reinforcement learning and control theory. We introduce training techniques that significantly reduce the number of weight updates and are suitable for efficient in-situ learning hardware implementations. We believe that this study shows the promise of using memristor-based hardware neural networks for handling complex tasks through in-situ reinforcement learning.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] A Mixed-Signal Approach to Memristive Neuromorphic System Design
    Chakma, Gangotree
    Sayyaparaju, Sagarvarma
    Weiss, Ryan
    Rose, Garrett S.
    2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 547 - 550
  • [2] Memristive Mixed-Signal Neuromorphic Systems: Energy-Efficient Learning at the Circuit-Level
    Chakma, Gangotree
    Adnan, Md Musabbir
    Wyer, Austin R.
    Weiss, Ryan
    Schuman, Catherine D.
    Rose, Garrett S.
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2018, 8 (01) : 125 - 136
  • [4] Towards a Machine Learning based Method for Indirect Test Generation of Mixed-Signal Circuits
    Ferreira, Allan F. G.
    Zilch, Lucas B.
    Navarro, Vinicius
    Lubaszewski, Marcelo S.
    Balen, Tiago R.
    2023 36TH SBC/SBMICRO/IEEE/ACM SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI, 2023, : 137 - 142
  • [5] Energy-Efficient CMOS Memristive Synapses for Mixed-Signal Neuromorphic System-on-a-Chip
    Saxena, Vishal
    Wu, Xinyu
    Zhu, Kehan
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [6] Mixed-Signal Neuromorphic Inference Accelerators: Recent Results and Future Prospects
    Bavandpour, M.
    Mahmoodi, M. R.
    Nili, H.
    Bayat, F. Merrikh
    Prezioso, M.
    Vincent, A.
    Strukov, D. B.
    Likharev, K. K.
    2018 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2018,
  • [7] Programmable mixed-signal circuits
    S. Tappertzhofen
    SN Applied Sciences, 2023, 5
  • [8] Programmable mixed-signal circuits
    Tappertzhofen, S.
    SN APPLIED SCIENCES, 2023, 5 (12):
  • [9] Mixed-Signal Neuromorphic Computing Circuits Using Hybrid CMOS-RRAM Integration
    Saxena, Vishal
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (02) : 581 - 586
  • [10] A Reconfigurable Mixed-signal Implementation of a Neuromorphic ADC
    Xu, Ying
    Thakur, Chetan Singh
    Hamilton, Tara Julia
    Tapson, Jonathan
    Wang, Runchun
    van Schaik, Andre
    2015 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS), 2015, : 519 - 522