共 50 条
- [41] Peer Learning: Learning Complex Policies in Groups from Scratch via Action Recommendations THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11766 - 11774
- [42] Revisiting Smoothed Online Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [44] Value Iteration and Action ε-Approximation of Optimal Policies in Discounted Markov Decision Processes RECENT ADVANCES IN APPLIED MATHEMATICS, 2009, : 213 - +
- [45] Genetic learning using adaptive action value tables ADVANCED TOPICS ON EVOLUTIONARY COMPUTING, 2008, : 136 - +
- [46] Chaining Value Functions for Off-Policy Learning THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8187 - 8195