共 50 条
- [1] General discounting versus average reward [J]. ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2006, 4264 : 244 - 258
- [2] Hierarchical average reward reinforcement learning [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 2629 - 2669
- [3] Hierarchical average reward reinforcement learning [J]. Journal of Machine Learning Research, 2007, 8 : 2629 - 2669
- [4] Robust Average-Reward Reinforcement Learning [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 719 - 803
- [5] Robust Average-Reward Reinforcement Learning [J]. Journal of Artificial Intelligence Research, 2024, 80 : 719 - 803
- [6] Inverse Reinforcement Learning with the Average Reward Criterion [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] A Modified Average Reward Reinforcement Learning Based on Fuzzy Reward Function [J]. IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 113 - 117
- [9] Maximizing the average reward in episodic reinforcement learning tasks [J]. 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS), 2015, : 420 - 421
- [10] Auto-exploratory average reward Reinforcement Learning [J]. PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, 1996, : 881 - 887