共 50 条
- [34] Advantage Based Value Iteration for Markov Decision Processes with Unknown Rewards [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3837 - 3844
- [35] THE VARIANCE OF DISCOUNTED MARKOV DECISION-PROCESSES [J]. JOURNAL OF APPLIED PROBABILITY, 1982, 19 (04) : 794 - 802
- [38] ASYNCHRONOUS VALUE ITERATION FOR MARKOV DECISION PROCESSES WITH CONTINUOUS STATE SPACES [J]. 2020 WINTER SIMULATION CONFERENCE (WSC), 2020, : 2856 - 2866
- [40] AsyncQVI: Asynchronous-Parallel Q-Value Iteration for Discounted Markov Decision Processes with Near-Optimal Sample Complexity [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 713 - 722