共 50 条
- [41] Accountable Off-Policy Evaluation With Kernel Bellman Statistics INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [42] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [43] Off-Policy Proximal Policy Optimization THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 9162 - 9170
- [44] Average-Reward Off-Policy Policy Evaluation with Function Approximation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [45] A Nonparametric Off-Policy Policy Gradient INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [46] Boosted Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [47] Supervised Off-Policy Ranking INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 10323 - 10339
- [48] Q(λ) with Off-Policy Corrections ALGORITHMIC LEARNING THEORY, (ALT 2016), 2016, 9925 : 305 - 320
- [49] On the Relation between Policy Improvement and Off-Policy Minimum-Variance Policy Evaluation UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 1423 - 1433
- [50] Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108