共 50 条
- [1] Learning Reward Machines for Partially Observable Reinforcement Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [2] Inverse reinforcement learning in partially observable environments [J]. Journal of Machine Learning Research, 2011, 12 : 691 - 730
- [4] Reinforcement Learning with Stochastic Reward Machines [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6429 - 6436
- [5] Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7941 - 7948
- [6] Inverse Reinforcement Learning in Partially Observable Environments [J]. 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1028 - 1033
- [7] Partially Observable Reinforcement Learning for Sustainable Active Surveillance [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2018, PT II, 2018, 11062 : 425 - 437
- [8] Regret Minimization for Partially Observable Deep Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
- [9] Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning [J]. Journal of Artificial Intelligence Research, 2022, 73 : 173 - 208
- [10] Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 173 - 208