共 50 条
- [1] Reinforcement Learning Algorithms for Regret Minimization in Structured Markov Decision Processes [J]. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1289 - 1290
- [3] A Duality Approach for Regret Minimization in Average-Reward Ergodic Markov Decision Processes [J]. LEARNING FOR DYNAMICS AND CONTROL, VOL 120, 2020, 120 : 862 - 883
- [4] Dynamic Regret of Online Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [5] Parametric Regret in Uncertain Markov Decision Processes [J]. PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 3606 - 3613
- [6] Episodic task learning in Markov decision processes [J]. Artificial Intelligence Review, 2011, 36 : 87 - 98
- [8] Variance minimization of parameterized Markov decision processes [J]. DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2018, 28 (01): : 63 - 81
- [10] Variance minimization of parameterized Markov decision processes [J]. Discrete Event Dynamic Systems, 2018, 28 : 63 - 81