共 50 条
- [1] A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes [J]. IEEE ACCESS, 2018, 6 : 49089 - 49102
- [2] A pulse neural network reinforcement learning algorithm for partially observable Markov decision processes [J]. Systems and Computers in Japan, 2005, 36 (03): : 42 - 52
- [3] Fuzzy Reinforcement Learning Control for Decentralized Partially Observable Markov Decision Processes [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1422 - 1429
- [4] Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [5] Active learning in partially observable Markov decision processes [J]. MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 601 - 608
- [6] Mixed reinforcement learning for partially observable Markov decision process [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION, 2007, : 436 - +
- [7] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes [J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2004, : 17 - 23
- [8] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1004 - 1011
- [9] Learning deterministic policies in partially observable Markov decision processes [J]. INTELLIGENT AUTONOMOUS SYSTEMS: IAS-5, 1998, : 250 - 257
- [10] Learning factored representations for partially observable Markov decision processes [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 1050 - 1056