共 50 条
- [21] Constraints Penalized Q-learning for Safe Offline Reinforcement Learning [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8753 - 8760
- [24] Swarm Reinforcement Learning Method Based on Hierarchical Q-Learning [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
- [25] Reinforcement Learning for Taxi-out Time Prediction: An improved Q-learning Approach [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTING AND NETWORK COMMUNICATIONS (COCONET), 2015, : 757 - 764
- [26] Tightening the Dependence on Horizon in the Sample Complexity of Q-Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [28] INTERNALLY DRIVEN Q-LEARNING Convergence and Generalization Results [J]. ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2012, : 491 - 494
- [29] Q-learning agents in a Cournot oligopoly model [J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2008, 32 (10): : 3275 - 3293