Actor-critic algorithm as multi-time-scale stochastic approximation

被引:0
|
作者
Indian Inst of Science, Bangalore, India [1 ]
机构
来源
Sadhana | / pt 4卷 / 525-543期
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] The actor-critic algorithm as multi-time-scale stochastic approximation
    Borkar, VS
    Konda, VR
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1997, 22 (4): : 525 - 543
  • [2] The actor-critic algorithm as multi-time-scale stochastic approximation
    Vivek S Borkar
    Vijaymohan R Konda
    Sadhana, 1997, 22 : 525 - 543
  • [3] An Actor-Critic Algorithm for the Stochastic Cutting Stock Problem
    Su, Jie-Ying
    Kang, Jia-Lin
    Jang, Shi-Shang
    PROCESSES, 2023, 11 (04)
  • [4] A Hessian Actor-Critic Algorithm
    Wang, Jing
    Paschalidis, Ioannis Ch
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1131 - 1136
  • [5] A simultaneous perturbation Stochastic approximation-based actor-critic algorithm for Markov decision processes
    Bhatnagar, S
    Kumar, S
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (04) : 592 - 598
  • [6] An Actor-Critic Algorithm With Second-Order Actor and Critic
    Wang, Jing
    Paschalidis, Ioannis Ch.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2689 - 2703
  • [7] On Finite-Time Convergence of Actor-Critic Algorithm
    Qiu S.
    Yang Z.
    Ye J.
    Wang Z.
    IEEE Journal on Selected Areas in Information Theory, 2021, 2 (02): : 652 - 664
  • [8] An actor-critic algorithm for multi-agent learning in queue-based stochastic games
    Sundar, D. Krishna
    Ravikumar, K.
    NEUROCOMPUTING, 2014, 127 : 258 - 265
  • [9] An Actor-Critic Algorithm for SVM Hyperparameters
    Kim, Chayoung
    Park, Jung-min
    Kim, Hye-young
    INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 653 - 661
  • [10] Actor-Critic or Critic-Actor? A Tale of Two Time Scales
    Bhatnagar, Shalabh
    Borkar, Vivek S.
    Guin, Soumyajit
    IEEE CONTROL SYSTEMS LETTERS, 2023, 7 : 2671 - 2676