共 50 条
- [41] Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [42] Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [44] More Efficient Off-Policy Evaluation through Regularized Targeted Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [45] Off-policy Evaluation in Infinite-horizon Reinforcement Learning with Latent Confounders [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [46] Learning Action Embeddings for Off-Policy Evaluation [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 108 - 122
- [47] Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 4674 - 4679
- [48] Statistically Efficient Off-Policy Policy Gradients [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [49] OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022,
- [50] Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 19 - 34