共 42 条
- [31] Convergence of Policy Gradient Methods for Nash Equilibria in General-sum Stochastic Games IFAC PAPERSONLINE, 2023, 56 (02): : 3435 - 3440
- [33] Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2462 - 2468
- [34] Local Analysis of Entropy-Regularized Stochastic Soft-Max Policy Gradient Methods 2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
- [36] Synthesising results of meta-analyses to inform policy: a comparison of fast-track methods Environmental Evidence, 12
- [39] Entropy-Driven Stochastic Policy for Fast Federated Learning in Beyond 5G Edge-RAN 2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
- [40] Performance Bounds for Policy-Based Reinforcement Learning Methods in Zero-Sum Markov Games With Linear Function Approximation 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 7144 - 7149