IMPACT OF COMPUTATION IN INTEGRAL REINFORCEMENT LEARNING FOR CONTINUOUS-TIME CONTROL

被引:0
|
作者
Cao, Wenhan [1 ,2 ]
Pan, Wei [1 ]
机构
[1] Department of Computer Science, University of Manchester, United Kingdom
[2] School of Vehicle and Mobility, Tsinghua University, China
关键词
Compendex;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement learning
引用
收藏
相关论文
共 50 条
  • [21] Continuous-time reinforcement learning for robust control under worst-case uncertainty
    Perrusquia, Adolfo
    Yu, Wen
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2021, 52 (04) : 770 - 784
  • [22] Integral Reinforcement Learning for Continuous-Time Input-Affine Nonlinear Systems With Simultaneous Invariant Explorations
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (05) : 916 - 932
  • [23] Data-Driven Integral Reinforcement Learning for Continuous-Time Non-Zero-Sum Games
    Yang, Yongliang
    Wang, Liming
    Modares, Hamidreza
    Ding, Dawei
    Yin, Yixin
    Wunsch, Donald
    IEEE ACCESS, 2019, 7 : 82901 - 82912
  • [24] Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach
    Tao Bian
    Zhong-Ping Jiang
    IEEE/CAA Journal of Automatica Sinica, 2019, 6 (02) : 433 - 440
  • [25] Reinforcement Learning for Linear Continuous-time Systems: an Incremental Learning Approach
    Bian, Tao
    Jiang, Zhong-Ping
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (02) : 433 - 440
  • [26] Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics
    Li, Hongliang
    Liu, Derong
    Wang, Ding
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (03) : 706 - 714
  • [27] Continuous-time reinforcement learning approach for portfolio management with time penalization
    Garcia-Galicia, Mauricio
    Carsteanu, Alin A.
    Clempner, Julio B.
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 129 : 27 - 36
  • [28] Efficient continuous-time reinforcement learning with adaptive state graphs
    Neumann, Gerhard
    Pfeiffer, Michael
    Maass, Wolfgang
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 250 - +
  • [29] Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
    Modares, Hamidreza
    Lewis, Frank L.
    Naghibi-Sistani, Mohammad-Bagher
    AUTOMATICA, 2014, 50 (01) : 193 - 202
  • [30] Continuous-time fuzzy control and learning methods
    Valtonen, M.
    Vainio, A. -M.
    Vanhala, J.
    2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 346 - 351