共 50 条
- [32] Deep Off-Policy Iterative Learning Control LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
- [33] Control Variates for Slate Off-Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [34] Temporal-difference learning and applications in finance COMPUTATIONAL FINANCE 1999, 2000, : 447 - 461
- [35] Average cost temporal-difference learning PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 498 - 502
- [40] Boosted Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206