Solving semi-Markov decision problems using average reward reinforcement learning

被引:0
|
作者
Dept. Indust. and Mgmt. Syst. Eng., University of South Florida, Tampa, FL 33620, United States [1 ]
不详 [2 ]
不详 [3 ]
机构
来源
Manage Sci | / 4卷 / 560-574期
关键词
D O I
暂无
中图分类号
学科分类号
摘要
37
引用
收藏
相关论文
共 50 条
  • [21] Semi-Markov decision processes with limiting ratio average rewards
    Sinha, Sagnik
    Mondal, Prasenjit
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2017, 455 (01) : 864 - 871
  • [22] Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
    Du, Jianzhun
    Futoma, Joseph
    Doshi-Velez, Finale
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [23] PERFORMABILITY ANALYSIS USING SEMI-MARKOV REWARD PROCESSES
    CIARDO, G
    MARIE, RA
    SERICOLA, B
    TRIVEDI, KS
    IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (10) : 1251 - 1264
  • [24] Semi-Markov Reinforcement Learning for Stochastic Resource Collection
    Schmoll, Sebastian
    Schubert, Matthias
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3349 - 3355
  • [25] A reinforcement learning method based on an immune network adapted to a semi-Markov decision process
    Kogawa N.
    Obayashi M.
    Kobayashi K.
    Kuremoto T.
    Artificial Life and Robotics, 2009, 13 (2) : 538 - 542
  • [26] Semi-Markov decision problems and performance sensitivity analysis
    Cao, XR
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (05) : 758 - 769
  • [27] A reinforcement learning algorithm with fuzzy approximation for semi Markov decision problems
    Kula, Ufuk
    Ocaktan, Beyazit
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 28 (04) : 1733 - 1744
  • [28] RELATIVE VALUE ITERATION FOR AVERAGE REWARD SEMI-MARKOV CONTROL VIA SIMULATION
    Gosavi, Abhijit
    2013 WINTER SIMULATION CONFERENCE (WSC), 2013, : 623 - 630
  • [29] Reward Algorithms for Semi-Markov Processes
    Dmitrii Silvestrov
    Raimondo Manca
    Methodology and Computing in Applied Probability, 2017, 19 : 1191 - 1209
  • [30] Reward Algorithms for Semi-Markov Processes
    Silvestrov, Dmitrii
    Manca, Raimondo
    METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2017, 19 (04) : 1191 - 1209