共 50 条
- [41] Off-Policy Deep Reinforcement Learning without Exploration INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [42] Modified Retrace for Off-Policy Temporal Difference Learning UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 303 - 312
- [44] Safe Optimal Design with Applications in Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [45] Interpretable Off-Policy Learning via Hyperbox Search INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [46] Pessimistic Reward Models for Off-Policy Learning in Recommendation 15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 63 - 74
- [48] Off-policy Learning for Remote Electrical Tilt Optimization 2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
- [49] Adaptive Trade-Offs in Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 34 - 43