共 50 条
- [21] Cooperative Online Learning in Stochastic and Adversarial MDPs INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [22] Reduction Techniques for Model Checking and Learning in MDPs PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4273 - 4279
- [23] Learning option MDPs from small data 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 252 - 257
- [24] Minimax Model Learning 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [26] Reinforcement Learning in Reward-Mixing MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [27] TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs FROM ANIMALS TO ANIMATS 11, 2010, 6226 : 489 - +
- [28] Safety-Constrained Reinforcement Learning for MDPs TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS (TACAS 2016), 2016, 9636 : 130 - 146
- [29] Learning to Act in Decentralized Partially Observable MDPs INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
- [30] Belief Propagation for MiniMax Weight Matching MODELLING, COMPUTATION AND OPTIMIZATION IN INFORMATION SYSTEMS AND MANAGEMENT SCIENCES - MCO 2015, PT 1, 2015, 359 : 37 - 45