共 50 条
- [2] Expected Policy Gradients THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2868 - 2875
- [4] Human-Machine Coadaptation Based on Reinforcement Learning with Policy Gradients 2019 8TH INTERNATIONAL CONFERENCE ON SYSTEMS AND CONTROL (ICSC'19), 2019, : 247 - 251
- [6] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1291 - 1307
- [8] Beyond Expected Return: Accounting for Policy Reproducibility When Evaluating Reinforcement Learning Algorithms THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12024 - 12032
- [9] DELAY OF REINFORCEMENT GRADIENTS IN CHILDRENS LEARNING PSYCHONOMIC SCIENCE, 1964, 1 (10): : 307 - 308
- [10] Batch Reinforcement Learning with Hyperparameter Gradients INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119