共 50 条
- [1] Safe and efficient off-policy reinforcement learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [2] Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [3] Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [4] OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
- [5] Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [6] Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [7] Brain-inspired computing and machine learning [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11): : 6641 - 6643
- [8] Brain-inspired computing and machine learning [J]. Neural Computing and Applications, 2020, 32 : 6641 - 6643
- [9] Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32