共 50 条
- [1] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [3] Optimistic Sampling Strategy for Data-Efficient Reinforcement Learning [J]. IEEE ACCESS, 2019, 7 : 55763 - 55769
- [4] Off-policy and on-policy reinforcement learning with the Tsetlin machine [J]. Applied Intelligence, 2023, 53 : 8596 - 8613
- [5] Data-Efficient Policy Evaluation Through Behavior Policy Search [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [7] Tabu search exploration for on-policy reinforcement learning [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2910 - 2915
- [8] Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [10] Data-Efficient Hierarchical Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31