共 50 条
- [4] Actor-critic algorithms for hierarchical Markov decision processes [J]. AUTOMATICA, 2006, 42 (04) : 637 - 644
- [5] An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes [J]. Journal of Optimization Theory and Applications, 2012, 153 : 688 - 708
- [6] Improved Simultaneous Perturbation Stochastic Approximation-based Consensus Algorithm for Tracking* [J]. 2023 31ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, MED, 2023, : 850 - 855
- [7] The actor-critic algorithm as multi-time-scale stochastic approximation [J]. Sadhana, 1997, 22 : 525 - 543
- [8] The actor-critic algorithm as multi-time-scale stochastic approximation [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1997, 22 (4): : 525 - 543
- [9] A simultaneous deterministic perturbation actor-critic algorithm with an application to optimal mortgage refinancing [J]. PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 4151 - 4156