共 50 条
- [21] A MULTIAGENT REINFORCEMENT LEARNING FRAMEWORK FOR OFF-POLICY EVALUATION IN TWO-SIDED MARKETS ANNALS OF APPLIED STATISTICS, 2023, 17 (04): : 2701 - 2722
- [25] Optimal tracking control for non-zero-sum games of linear discrete-time systems via off-policy reinforcement learning OPTIMAL CONTROL APPLICATIONS & METHODS, 2020, 41 (04): : 1233 - 1250
- [26] Off-Policy Reinforcement Learning for Optimal Preview Tracking Control of Linear Discrete-Time systems with unknown dynamics 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1402 - 1407
- [27] Safe and efficient off-policy reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
- [29] Off-Policy Reinforcement Learning with Delayed Rewards INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [30] Bounds for Off-policy Prediction in Reinforcement Learning 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3991 - 3997