共 50 条
- [11] Neural Network Compatible Off-Policy Natural Actor-Critic Algorithm [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
- [12] An Optimistic Approach to the Temporal Difference Error in Off-Policy Actor-Critic Algorithms [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 875 - 883
- [13] Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [15] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
- [17] An Improved Off-Policy Actor-Critic Algorithm with Historical Behaviors Reusing for Robotic Control [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT IV, 2022, 13458 : 449 - 458
- [18] A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 1549 - 1554
- [19] Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation [J]. IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2611 - 2616