共 50 条
- [31] Parallel reinforcement learning using multiple reward signals [J]. NEUROCOMPUTING, 2006, 69 (16-18) : 2171 - 2179
- [32] An Average-Reward Reinforcement Learning Algorithm based on Schweitzer's Transformation [J]. PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 2966 - 2970
- [33] Average Reward Reinforcement Learning for Optimal On-route Charging of Electric Buses [J]. 2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
- [35] Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [36] RVI Reinforcement Learning for Semi-Markov Decision Processes with Average Reward [J]. 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 1674 - 1679
- [37] Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [40] Reward contrast in delay and probability discounting [J]. Learning & Behavior, 2009, 37 : 281 - 288