共 50 条
- [1] Temporal-difference emphasis learning with regularized correction for off-policy evaluation and control Applied Intelligence, 2023, 53 : 20917 - 20937
- [4] Distributed Consensus-Based Multi-Agent Off-Policy Temporal-Difference Learning 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 5976 - 5981
- [5] Modified Retrace for Off-Policy Temporal Difference Learning UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 303 - 312
- [7] Gradient Temporal-Difference Learning with Regularized Corrections INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [8] Gradient Temporal-Difference Learning with Regularized Corrections 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
- [9] Generalized gradient emphasis learning for off-policy evaluation and control with function approximation Neural Computing and Applications, 2023, 35 : 23599 - 23616