共 50 条
- [1] Conditional Importance Sampling for Off-Policy Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 45 - 54
- [2] Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [3] Off-Policy Differentiable Logic Reinforcement Learning [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 617 - 632
- [4] Weighted importance sampling for off-policy learning with linear function approximation [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
- [6] Off-policy learning based on weighted importance sampling with linear computational complexity [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 552 - 561
- [8] Off-Policy Evaluation via Off-Policy Classification [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [10] A perspective on off-policy evaluation in reinforcement learning [J]. Frontiers of Computer Science, 2019, 13 : 911 - 912