共 50 条
- [32] A temporal-difference learning method using gaussian state representation for continuous state space problems 1600, Japanese Society for Artificial Intelligence (29):
- [35] Optimal Active Fault Diagnosis by Temporal-Difference Learning 2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 2146 - 2151
- [36] On Average Versus Discounted Reward Temporal-Difference Learning Machine Learning, 2002, 49 : 179 - 191
- [37] Temporal-Difference Learning with Sampling Baseline for Image Captioning THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6706 - 6713
- [38] On the Convergence of Temporal-Difference Learning with Linear Function Approximation Machine Learning, 2001, 42 : 241 - 267
- [39] Neural Temporal-Difference Learning Converges to Global Optima ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [40] On the asymptotic behavior of a constant stepsize temporal-difference learning algorithm COMPUTATIONAL LEARNING THEORY, 1999, 1572 : 126 - 137