共 50 条
- [1] Maximizing the average reward in episodic reinforcement learning tasks 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2015, : 420 - 421
- [2] Reinforcement Learning of Pareto-Optimal Multiobjective Policies Using Steering AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 : 596 - 608
- [4] Behaviour-Conditioned Policies for Cooperative Reinforcement Learning Tasks ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 493 - 504
- [5] NEFRL: A new neuro-fuzzy system for episodic reinforcement learning tasks PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 819 - 824
- [6] Towards Interpretable Policies in Multi-agent Reinforcement Learning Tasks BIOINSPIRED OPTIMIZATION METHODS AND THEIR APPLICATIONS, 2022, 13627 : 262 - 276
- [9] Learning Options in Multiobjective Reinforcement Learning THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4907 - 4908