共 50 条
- [1] Node Constraint Routing Algorithm based on Reinforcement Learning [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1752 - 1756
- [2] Robust Imitation via Mirror Descent Inverse Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [3] Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [5] Mirror Descent Learning in Continuous Games [J]. 2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
- [6] Analysis of Online Composite Mirror Descent Algorithm [J]. NEURAL COMPUTATION, 2017, 29 (03) : 825 - 860
- [7] Gradient descent for general reinforcement learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 968 - 974
- [9] Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes [J]. Mathematical Programming, 2023, 198 : 1059 - 1106
- [10] Energy-Based Policy Constraint for Offline Reinforcement Learning [J]. ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 335 - 346