Linear least-squares algorithms for temporal difference learning

被引:0
|
作者
机构
来源
Mach Learn | / 1-3卷 / 33期
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] Linear least-squares algorithms for temporal difference learning
    Bradtke, SJ
    Barto, AG
    MACHINE LEARNING, 1996, 22 (1-3) : 33 - 57
  • [2] Least-Squares temporal difference learning
    Boyan, JA
    MACHINE LEARNING, PROCEEDINGS, 1999, : 49 - 56
  • [3] Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator
    Tu, Stephen
    Recht, Benjamin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [4] Technical Update: Least-Squares Temporal Difference Learning
    Justin A. Boyan
    Machine Learning, 2002, 49 : 233 - 246
  • [5] Technical update: Least-squares temporal difference learning
    Boyan, JA
    MACHINE LEARNING, 2002, 49 (2-3) : 233 - 246
  • [6] Multikernel Recursive Least-Squares Temporal Difference Learning
    Zhang, Chunyuan
    Zhu, Qingxin
    Niu, Xinzheng
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 205 - 217
  • [7] Least-squares temporal difference learning based on an extreme learning machine
    Escandell-Montero, Pablo
    Martinez-Martinez, Jose M.
    Martin-Guerrero, Jose D.
    Soria-Olivas, Emilio
    Gomez-Sanchis, Juan
    NEUROCOMPUTING, 2014, 141 : 37 - 45
  • [8] Kernel Recursive Least-Squares Temporal Difference Algorithms with Sparsification and Regularization
    Zhang, Chunyuan
    Zhu, Qingxin
    Niu, Xinzheng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016
  • [9] Least-Squares SARSA(λ) Algorithms for Reinforcement Learning
    Chen, Sheng-Lei
    Wei, Yan-Mei
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 632 - +
  • [10] Deep reinforcement learning using least-squares truncated temporal-difference
    Ren, Junkai
    Lan, Yixing
    Xu, Xin
    Zhang, Yichuan
    Fang, Qiang
    Zeng, Yujun
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (02) : 425 - 439