共 50 条
- [1] Markov reward models and markov decision processes in discrete and continuous time: Performance evaluation and optimization Gouberman, Alexander (alexander.gouberman@unibw.de), 1600, Springer Verlag (8453): : 156 - 241
- [3] Qualitative Controller Synthesis for Consumption Markov Decision Processes COMPUTER AIDED VERIFICATION, PT II, 2020, 12225 : 421 - 447
- [4] RISK SENSITIVE DISCRETE- AND CONTINUOUS-TIME MARKOV REWARD PROCESSES PROCEEDINGS OF THE INTERNATIONAL CONFERENCE QUANTITATIVE METHODS IN ECONOMICS (MULTIPLE CRITERIA DECISION MAKING XIV), 2008, : 272 - 281
- [6] Markov Decision Processes with Arbitrary Reward Processes RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 268 - +
- [8] Approximate gradient methods in policy-space optimization of Markov reward processes DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2003, 13 (1-2): : 111 - 148
- [9] Approximate Gradient Methods in Policy-Space Optimization of Markov Reward Processes Discrete Event Dynamic Systems, 2003, 13 : 111 - 148
- [10] Online Markov Decision Processes Configuration with Continuous Decision Space THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14315 - 14322