Solving semi-Markov decision problems using average reward reinforcement learning

被引:0
|
作者
Dept. Indust. and Mgmt. Syst. Eng., University of South Florida, Tampa, FL 33620, United States [1 ]
不详 [2 ]
不详 [3 ]
机构
来源
Manage Sci | / 4卷 / 560-574期
关键词
D O I
暂无
中图分类号
学科分类号
摘要
37
引用
收藏
相关论文
共 50 条
  • [41] PARTIALLY OBSERVABLE SEMI-MARKOV REWARD PROCESSES
    MASUDA, Y
    JOURNAL OF APPLIED PROBABILITY, 1993, 30 (03) : 548 - 560
  • [42] On mean reward variance in semi-Markov processes
    Sladky, K
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2005, 62 (03) : 387 - 397
  • [43] Using Semi-Markov Chains to Solve Semi-Markov Processes
    Bei Wu
    Brenda Ivette Garcia Maya
    Nikolaos Limnios
    Methodology and Computing in Applied Probability, 2021, 23 : 1419 - 1431
  • [44] Relations between discounted models and average models for semi-Markov decision processes
    Yin, Bao-Qun
    Li, Yan-Jie
    Tang, Hao
    Dai, Gui-Ping
    Xi, Hong-Sheng
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2006, 23 (01): : 65 - 68
  • [45] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
    Qingda Wei
    Xianping Guo
    Journal of Optimization Theory and Applications, 2012, 153 : 709 - 732
  • [46] TIME-AVERAGE OPTIMAL CONSTRAINED SEMI-MARKOV DECISION-PROCESSES
    BEUTLER, FJ
    ROSS, KW
    ADVANCES IN APPLIED PROBABILITY, 1986, 18 (02) : 341 - 359
  • [47] New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces
    Wei, Qingda
    Guo, Xianping
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2012, 153 (03) : 709 - 732
  • [48] Solving generalized semi-Markov decision processes using continuous phase-type distributions
    Younes, HLS
    Simmons, RG
    PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 742 - 747
  • [49] Optimal replacement of a system according to a semi-Markov decision process in a semi-Markov environment
    Hu, QY
    Yue, WY
    OPTIMIZATION METHODS & SOFTWARE, 2003, 18 (02): : 181 - 196
  • [50] Zero-Sum Semi-Markov Games with the Risk-Sensitive Average Reward Criterion
    Chen, Fang
    Guo, Xin
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2025, 204 (03)