共 50 条
- [42] Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [44] PROJECTED STATE-ACTION BALANCING WEIGHTS FOR OFFLINE REINFORCEMENT LEARNING ANNALS OF STATISTICS, 2023, 51 (04): : 1639 - 1665
- [46] Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [47] Safe Exploration of State and Action Spaces in Reinforcement Learning JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2012, 45 : 515 - 564
- [48] Self-Regulating Action Exploration in Reinforcement Learning PROCEEDINGS OF THE INTERNATIONAL NEURAL NETWORK SOCIETY WINTER CONFERENCE (INNS-WC2012), 2012, 13 : 18 - 30
- [49] #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30