共 50 条
- [2] Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [4] Fuzzy interpretation for temporal-difference learning in anomaly detection problems Sukhanov, A.V. (drewnia@rambler.ru), 1600, Polska Akademia Nauk (64): : 625 - 632
- [6] GAUSSIAN PROCESS TEMPORAL-DIFFERENCE LEARNING WITH SCALABILITY AND WORST-CASE PERFORMANCE GUARANTEES 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3485 - 3489
- [7] Striatal and Tegmental Neurons Code Critical Signals for Temporal-Difference Learning of State Value in Domestic Chicks FRONTIERS IN NEUROSCIENCE, 2016, 10
- [8] On the Convergence of Reinforcement Learning in Nonlinear Continuous State Space Problems 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2969 - 2975
- [9] Temporal difference learning in continuous time and space ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 8: PROCEEDINGS OF THE 1995 CONFERENCE, 1996, 8 : 1073 - 1079
- [10] Intentionally-underestimated value function at terminal state for temporal-difference learning with mis-designed reward RESULTS IN CONTROL AND OPTIMIZATION, 2025, 18