共 50 条
- [1] Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [2] Learning Continuous-Action Control Policies [J]. ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 169 - 176
- [3] Learning Stochastic Parametric Differentiable Predictive Control Policies [J]. IFAC PAPERSONLINE, 2022, 55 (25): : 121 - 126
- [4] Reinforcement learning for continuous stochastic control problems [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1029 - 1035
- [5] Autoregressive Policies for Continuous Control Deep Reinforcement Learning [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2754 - 2762
- [6] Learning Policies for Continuous Control via Transition Models [J]. ACTIVE INFERENCE, IWAI 2022, 2023, 1721 : 162 - 178
- [8] Continuous-Review Tracking Policies for Dynamic Control of Stochastic Networks [J]. Queueing Systems, 2003, 43 : 43 - 80
- [9] Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates [J]. 2019 IEEE CONFERENCE ON GAMES (COG), 2019,
- [10] Learning First-to-Spike Policies for Neuromorphic Control Using Policy Gradients [J]. 2019 IEEE 20TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC 2019), 2019,