A UNIFIED APPROACH TO ADAPTIVE-CONTROL OF AVERAGE REWARD MARKOV DECISION-PROCESSES

被引:2
|
作者
HUBNER, G [1 ]
机构
[1] UNIV HAMBURG,INST MATH STOCHASTIK,D-2000 HAMBURG 13,FED REP GER
关键词
D O I
10.1007/BF01740510
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
引用
收藏
页码:161 / 166
页数:6
相关论文
共 50 条
  • [41] MARKOV DECISION-PROCESSES WITH BOTH CONTINUOUS AND IMPULSIVE CONTROL
    YUSHKEVICH, AA
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1986, 81 : 234 - 246
  • [42] A Duality Approach for Regret Minimization in Average-Reward Ergodic Markov Decision Processes
    Gong, Hao
    Wang, Mengdi
    LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 862 - 883
  • [43] BATCH POLICY LEARNING IN AVERAGE REWARD MARKOV DECISION PROCESSES
    Liao, Peng
    Qi, Zhengling
    Wan, Runzhe
    Klasnja, Predrag
    Murphy, Susan A.
    ANNALS OF STATISTICS, 2022, 50 (06): : 3364 - 3387
  • [44] Learning and Planning in Average-Reward Markov Decision Processes
    Wan, Yi
    Naik, Abhishek
    Sutton, Richard S.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7665 - 7676
  • [45] Bounded parameter Markov decision processes with average reward criterion
    Tewari, Ambuj
    Bartlett, Peter L.
    LEARNING THEORY, PROCEEDINGS, 2007, 4539 : 263 - +
  • [46] Pseudometrics for state aggregation in average reward Markov decision processes
    Ortner, Ronald
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2007, 4754 : 373 - 387
  • [47] REVERSIBLE MARKOV DECISION PROCESSES WITH AN AVERAGE-REWARD CRITERION
    Cogill, Randy
    Peng, Cheng
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (01) : 402 - 418
  • [48] APPROXIMATING THE MARKOV PROPERTY IN MARKOV DECISION-PROCESSES
    WHITE, DJ
    INFORMATION AND DECISION TECHNOLOGIES, 1989, 15 (03): : 147 - 162
  • [49] A UNIFIED THEORY FOR THE CGT APPROACH TO ADAPTIVE-CONTROL
    WEI, S
    SOBEL, KM
    INTERNATIONAL JOURNAL OF CONTROL, 1992, 56 (01) : 143 - 171
  • [50] TIME-AVERAGE OPTIMAL CONSTRAINED SEMI-MARKOV DECISION-PROCESSES
    BEUTLER, FJ
    ROSS, KW
    ADVANCES IN APPLIED PROBABILITY, 1986, 18 (02) : 341 - 359