共 50 条
- [33] Regularization of the Policy Updates for Stabilizing Mean Field Games ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 361 - 372
- [34] Deterministic Policy Gradient: Convergence Analysis UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2159 - 2169
- [37] Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206