共 50 条
- [1] Approximate Relative Value Learning for Average-reward Continuous State MDPs 35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 956 - 964
- [3] An Approximately Optimal Relative Value Learning Algorithm for Averaged MDPs with Continuous States and Actions 2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 734 - 740
- [4] An Empirical Algorithm for Relative Value Iteration for Average-cost MDPs 2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 5079 - 5084
- [7] NON-PARAMETRIC EMPIRICAL BAYES PROCEDURES ANNALS OF MATHEMATICAL STATISTICS, 1957, 28 (03): : 649 - 669
- [10] Non-parametric manifold learning ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 3903 - 3930