共 50 条
- [31] Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 801 - 806
- [32] Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [33] Effective Linear Policy Gradient Search through Primal-Dual Approximation 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
- [34] Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [35] Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2611 - 2616
- [36] Convergence of Batch Gradient Method Based on the Entropy Error Function for Feedforward Neural Networks Neural Processing Letters, 2020, 52 : 2687 - 2695
- [38] Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [39] Rates of convergence of performance gradient estimates using function approximation and bias in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1515 - 1522
- [40] CESARO CONVERGENCE OF GRADIENT METHOD OF CONVEX-CONCAVE FUNCTION SADDLE POINT APPROXIMATION DOKLADY AKADEMII NAUK SSSR, 1978, 239 (05): : 1056 - 1059