共 50 条
- [41] Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [42] Near-optimal Reinforcement Learning in Factored MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
- [43] Reinforcement learning for MDPs using temporal difference schemes PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 577 - 583
- [44] Exploiting Additive Structure in Factored MDPs for Reinforcement Learning RECENT ADVANCES IN REINFORCEMENT LEARNING, 2008, 5323 : 15 - 26
- [45] Minimax Lower Bounds via f-divergences 2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1340 - 1344
- [46] Lower Bounds on the Minimax Risk for the Source Localization Problem 2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017,
- [48] Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7667 - 7674
- [49] Memory Lower Bounds of Reductions Revisited ADVANCES IN CRYPTOLOGY - EUROCRYPT 2018, PT I, 2018, 10820 : 61 - 90
- [50] Minimax Lower Bounds for Transfer Learning with Linear and One-hidden Layer Neural Networks ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33