共 10 条
- [1] Policy mirror descent for reinforcement learning: linear convergence, new sampling complexity, and generalized problem classes [J]. Mathematical Programming, 2023, 198 : 1059 - 1106
- [4] Convergence and Iteration Complexity of Policy Gradient Method for Infinite-horizon Reinforcement Learning [J]. 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 7415 - 7422
- [5] Off-policy learning based on weighted importance sampling with linear computational complexity [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 552 - 561
- [6] Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 4927 - 4932
- [7] Reinforcement Learning in Nonzero-sum Linear Quadratic Deep Structured Games: Global Convergence of Policy Optimization [J]. 2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 512 - 517
- [8] Reinforcement Learning of Control Policy for Linear Temporal Logic Specifications Using Limit-Deterministic Generalized Buchi Automata [J]. IEEE CONTROL SYSTEMS LETTERS, 2020, 4 (03): : 761 - 766
- [10] Policy Gradient Reinforcement Learning Method for Discrete-Time Linear Quadratic Regulation Problem Using Estimated State Value Function [J]. 2017 56TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2017, : 653 - 657